Infrastructure Engineer

Techruiter

We aim to address the shortage of skilled labour in critical domains such as healthcare, by
creating AI assistants that can perform tasks that have typically required human-level intelligence.
 
We are looking for a motivated Infrastructure Engineer to work with us on developing leading
edge LLM and knowledge based systems.
 
About You
 
As an Infrastructure Engineer, you will play a critical role in designing, implementing, and
maintaining our cloud-based infrastructure. You will work closely with our machine learning
and software development teams to ensure that our platforms are scalable, secure, and
optimised for performance.
You may succeed in this role if you
  • Possess 3+ years of experience setting up and managing cloud infrastructure on AWS or other cloud providers
  • Are proficient in Python for scripting and automation
  • Have a strong understanding of security best practices and tightening infrastructure for highly secure cloud operations
  • Understand infrastructure-as-code (IaC) tools and CI/CD best practices
  • Have strong communication and collaboration skills, with the ability to work effectively in a remote or hybrid team environment
  • Additionally, the following would be considered advantageous:
  • Operating infrastructure for ML / AI applications at scale
  • Previous experience in an early-stage startup
  • Prior experience working with healthcare data and ensuring data privacy and regulatory compliance
Key Responsibilities
  • Design, implement, and maintain scalable and secure AWS infrastructure to support AI/ML applications and services
  • Develop and maintain infrastructure-as-code (IaC) using tools such as AWS CloudFormation or Terraform
  • Automate deployment, monitoring, and management processes using Python and relevant AWS services (e.g., Lambda, EC2, S3, RDS)
  • Optimise cloud infrastructure for performance, cost-efficiency, and reliability
  • Implement and manage CI/CD pipelines to streamline development and deployment
  • Monitor system performance, troubleshoot issues, and ensure high availability and disaster recovery capabilities
  • Stay up-to-date with the latest AWS services, best practices, and industry trends to continuously improve our infrastructure
  • Provide technical support and guidance to team members on cloud infrastructure and automation topics

Source
remotive.com