Senior Simulation Data Engineer

🔒 Confidential Employer
Posted 8 May 2026
LOCATION
London
TYPE
Full-time
LEVEL
Mid-Senior level
CATEGORY
Data & Analytics
This employer holds a UK Home Office sponsor license — sponsorship for this specific role is at the employer’s discretion

SKILLS

Python Kubernetes SLURM CFD Data Pipeline Engineering Cloud Infrastructure Geometry Processing HDF5

FULL DESCRIPTION

Senior Simulation Data Engineer

[Employer hidden — sign up to reveal] is a deep-tech company with roots in numerical physics and Formula One, dedicated to accelerating hardware innovation at the speed of software. We are building an AI-driven simulation software stack for engineering and manufacturing across advanced industries. This role sits at the intersection of HPC engineering and data engineering, orchestrating long-running CFD simulations at scale.

Location: London, United Kingdom

Work Type: Hybrid (Shoreditch office and work-from-home)

The Role

The Senior Simulation Data Engineer will extend and operate the infrastructure that powers our research Data Factory. Responsibilities include geometry preparation, simulation orchestration, validation, post-processing, and delivery to downstream ML training systems.

What You Will Do

  • Extend and operate the Data Factory infrastructure orchestrating thousands of CFD simulations per day on cloud compute
  • Design job scheduling systems maximizing throughput while handling failures gracefully
  • Build monitoring and alerting for simulation failures, convergence issues, and resource bottlenecks
  • Build high-performance data pipelines moving simulation outputs to ML-ready training data
  • Implement geometry preprocessing workflows (mesh preparation, morphing, watertightness validation)
  • Design and operate post-processing pipelines: surface decimation, field interpolation, format conversion
  • Optimize I/O performance for large mesh datasets
  • Implement comprehensive validation checks at every pipeline stage
  • Build systems to capture and quarantine bad data before they reach training pipelines
  • Deliver validated datasets to ML training infrastructure in optimized formats

What You Bring to the Table

  • 5+ years of experience in data engineering, HPC engineering, or simulation infrastructure
  • Strong experience with orchestration systems: SLURM, Kubernetes, Temporal
  • Production data pipeline experience with large volumes of data
  • Proficiency in Python for pipeline development and automation
  • Systems engineering fundamentals: Linux, networking, storage systems, performance debugging
  • Experience with cloud infrastructure; ideally CoreWeave or similar GPU/HPC-focused clouds
  • Background in HPC for simulation engineering (CFD, FEA, etc.)
  • Experience with geometry processing: mesh manipulation, CAD formats, PyVista
  • Familiarity with scientific data formats: HDF5, VTK, NetCDF, Zarr
  • Data quality engineering experience: validation frameworks, anomaly detection, data observability

What We Offer

Join [Employer hidden — sign up to reveal] and build what actually matters. Work with a high-caliber team, enjoy hybrid flexibility, equity options, 10% employer pension contribution, free office lunches, enhanced parental leave, 25 days annual leave, private medical insurance, Wellhub subscription, and more.

Apply via the [Employer hidden — sign up to reveal] careers page at [Employer hidden].ai/careers.

Sign up free — access 45,000+ UK sponsor-licensed jobs