We are seeking a Senior DevOps Engineer to support the development and scaling of our medical image analysis tools, focusing on creating compliant machine learning products. This role offers the opportunity to drive innovation in internal ML tooling while ensuring robust and efficient deployment practices.
Responsibilities
- Design and implement internal ML tools to add features, fix bugs, raise test coverage and enhance documentation
- Deploy and maintain complex data pipelines for large image analysis workflows
- Evaluate and integrate cloud computing platforms in collaboration with IT and development teams
- Define workflows, evaluate tooling and infrastructure and assist users in test scenarios
- Ensure compliance for processes and developed tools
Requirements
- 3+ years of experience as DevOps or in a similar role
- Expertise in building ML systems with frameworks like PyTorch, TensorFlow or JAX
- Competency in developing training data processing pipelines for image data
- Proficiency in MLOps lifecycles including model training, validation, deployment and quality monitoring
- Knowledge of Python for high-performing, modern and well-documented code with strong unit-testing
- Familiarity with Linux environments, Git-based workflows, GitHub Actions, containerization tools like Docker and Kubernetes orchestration
- Skills in MLOps platforms like W&B, MLFlow and deep learning tools including PyTorch Lightning
Nice to have
- Understanding of TypeScript, Node.js or JavaScript
- Background in processing platforms such as Argo Workflows or Kubeflow
- Expertise in computer vision problems including OpenCV
- Familiarity with distributed microservice architectures
- Capability to work with cloud infrastructure particularly AWS