Role OverviewWe are looking for an MLOps Engineer with solid cloud experience to help build, deploy and maintain Machine Learning solutions in production. The right person combines strong engineering foundations with a passion for automation, cloud architectures and operational excellence.You will collaborate closely with Data Scientists, Data Engineers, Architects and Platform teams to ensure that AI models are delivered in a scalable, reproducible and secure way.Key Responsibilities
- Design, build and maintain end to end ML pipelines for training, validation and deployment
- Implement CI CD practices for the entire model lifecycle
- Automate workflows using tools such as Airflow, Kubeflow, MLflow or Vertex Pipelines
- Deploy models on cloud platforms like AWS Sagemaker, Azure ML or GCP Vertex AI
- Implement monitoring, observability and alerting for models in production
- Work with Data Scientists to optimise performance, datasets and reproducibility
- Manage infrastructure as code with Terraform or CloudFormation
- Ensure compliance with data governance, security and model versioning policies
- Optimise reliability, performance and cloud cost of ML platforms
- Document architectures, processes and best practicesRequirementsTechnical Skills
- 3 to 6 years of experience in Data Engineering, MLOps or similar roles
- Strong Python experience applied to ML or pipeline development
- Hands on experience with at least one major cloud provider : - AWS (Sagemaker, ECR, Lambda, Step Functions)- Azure (Azure ML, Databricks, AKS)- GCP (Vertex AI, Cloud Run, BigQuery)-
- Experience with Docker and Kubernetes-
- Experience with CI CD systems like GitHub Actions, GitLab CI or Azure DevOps-
- Knowledge of MLflow, DVC, Feast, Metaflow or similar tracking tools-
- Experience with Infrastructure as Code using Terraform or Pulumi-
- Understanding of ML models, pipelines and reproducibility principlesSoft Skills
- Strong engineering mindset and focus on automation
- Ability to collaborate with multidisciplinary teams
- Clear communication of technical concepts
- Proactive, reliable and quality focusedNice to Have
- Experience with RAG or GenAI workflows
- Experience with Spark or Databricks
- Experience with event driven systems such as Kafka or Pub Sub
- Background in security and data governance