Senior Machine Learning Engineer

🔒 Confidential Employer
Posted 23 April 2026
LOCATION
United Kingdom
TYPE
Full-time
LEVEL
Mid-Senior level
CATEGORY
Technology
This employer holds a UK Home Office sponsor license — sponsorship for this specific role is at the employer’s discretion

SKILLS

Python PyTorch Diffusion models LLMs Fine-tuning GPU Model evaluation ML systems

FULL DESCRIPTION

Join [Employer hidden — view at passion-project.co.uk] as a Senior Machine Learning Engineer and be at the forefront of developing innovative AI solutions across various media modalities including text, image, video, 3D, and audio. We're building a powerful AI media creation platform designed to revolutionize how content is generated.

What You'll Be Doing

  • Integrate open-source and third-party models into our inference platform
  • Lead fine-tuning initiatives (LoRA, adapters, PEFT, domain adaptation)
  • Optimise inference workloads for latency, batching, memory efficiency, and throughput
  • Benchmark model quality vs cost vs performance across modalities
  • Improve inference startup times and stability under high load
  • Build evaluation frameworks and internal tooling for model validation
  • Work closely with Infrastructure and Backend teams on scalable serving systems
  • Monitor production performance and drive continuous optimisation
  • Mentor engineers and help raise the ML engineering bar across the team

What We’re Looking For

  • Proven experience delivering ML systems to production environments
  • Strong, low-level Python skills and deep hands-on experience with PyTorch
  • Experience working with diffusion models, LLMs, or multimodal architectures
  • Practical experience fine-tuning large models (LoRA, PEFT, adapters, etc.)
  • Experience optimizing inference workloads in GPU environments
  • Strong understanding of model evaluation, experimentation, and monitoring
  • Ability to debug performance, memory, and reliability issues in production
  • Strong systems thinking understanding how ML decisions impact infrastructure
  • High ownership and comfort operating in a fast-paced startup environment

Nice to have

  • Experience with vLLM or custom inference servers
  • Experience with Kubernetes or containerised ML workloads
  • Experience working in high-throughput distributed systems
  • Background in AI media generation (image, video, audio)
  • Experience building internal ML tooling or developer-facing APIs
  • Experience with kernels in CUDA/C++
Sign up free — access 45,000+ UK sponsor-licensed jobs