AI Operations Specialist
🔒 Confidential Employer
Posted 13 August 2025
LOCATION
United Kingdom
TYPE
Full-time
LEVEL
Mid-Senior level
CATEGORY
Technology
This employer holds a UK Home Office sponsor license — sponsorship for this specific role is at the employer’s discretion
SKILLS
MLOps
Cloud Platforms
CI/CD
Docker
Kubernetes
Python
Bash
FULL DESCRIPTION
Summary
The AI Operations Specialist will manage the deployment and monitoring of AI/ML models, ensuring their reliability and performance. They will collaborate with data scientists and DevOps teams to streamline model pipelines, automate workflows, and implement best practices in AI environments.
Key Responsibilities/Duties:
- Managing the deployment and monitoring of AI/ML models in production environments
- Ensuring uptime, reliability, and performance of AI systems across platforms
- Collaborating with data scientists and DevOps teams to streamline model pipelines
- Automating workflows for model retraining, versioning, and data handling
- Monitoring system health and resolving issues related to data drift, performance, and latency
- Implementing operational best practices and compliance standards in AI environments
Core Requirements/Qualifications/Skills:
- Experience in AI/ML operations, MLOps, or DevOps in tech-driven environments
- Proficiency with cloud platforms (AWS, Azure, GCP), CI/CD, and containerization (Docker, Kubernetes)
- Familiarity with ML model lifecycle management tools (MLflow, SageMaker, Kubeflow, etc.)
- Strong scripting skills (Python, Bash) and comfort working in Linux-based systems
- Ability to troubleshoot model performance issues in real-time environments
- Degree in Computer Science, Data Engineering, AI, or related field preferred
🔧 What You’ll Be Working On:
- Managing the deployment and monitoring of AI/ML models in production environments
- Ensuring uptime, reliability, and performance of AI systems across platforms
- Collaborating with data scientists and DevOps teams to streamline model pipelines
- Automating workflows for model retraining, versioning, and data handling
- Monitoring system health and resolving issues related to data drift, performance, and latency
- Implementing operational best practices and compliance standards in AI environments
🎯 What We’re Looking For:
- Experience in AI/ML operations, MLOps, or DevOps in tech-driven environments
- Proficiency with cloud platforms (AWS, Azure, GCP), CI/CD, and containerization (Docker, Kubernetes)
- Familiarity with ML model lifecycle management tools (MLflow, SageMaker, Kubeflow, etc.)
- Strong scripting skills (Python, Bash) and comfort working in Linux-based systems
- Ability to troubleshoot model performance issues in real-time environments
- Degree in Computer Science, Data Engineering, AI, or related field preferred
Sign up free — access 45,000+ UK sponsor-licensed jobs