Infrastructure Engineer
SKILLS
FULL DESCRIPTION
Infrastructure Engineer in Cambridge
Cambridge
Full-Time
28800 - 48000 £ / year (est.)
No home office possible
At a Glance
- Tasks: Manage and optimise our cutting-edge infrastructure for AI model deployment.
- Company: Join [Employer hidden — view at passion-project.co.uk], a leader in Speech Intelligence and AI technology.
- Benefits: Flexible working, generous holiday allowance, and career development support.
- Why this job: Be part of a team that drives innovation in AI and speech technology.
- Qualifications: Experience in data centres, Linux troubleshooting, and configuration management tools.
- Other info: Diverse and inclusive workplace with opportunities for global collaboration.
The predicted salary is between 28800 - 48000 £ per year.
We are seeking an Infrastructure Engineer to help ensure our internal infrastructure is running smoothly and enabling our team to train, build, test and release our AI models to customers quickly.
What You’ll Be Doing
- Leading the end-to-end hardware lifecycle of networking, CPU compute and GPU training hardware, such as the Nvidia DGX platform.
- Managing data centre operations day‑to‑day, ensuring that all maintenance tasks are performed proactively, including hardware monitoring and troubleshooting, disaster recovery testing and generally keeping our hardware up and running.
- Working with PXE booting to provision nodes, Ansible for configuration, Terraform for VMware deployments and Helm for Kubernetes deployments.
- Troubleshooting systems using observability tooling and your knowledge of Linux.
- Configuring our network switch devices, ensuring DNS/DHCP/IPAM is maintained.
- Vendor coordination and collaboration – you’ll lead relationships with hardware vendors, managed service providers, Smart Hands and other external contacts to drive improvements to our infrastructure.
Who We Are Looking For
- Experience working in a data centre environment.
- Experience with networking configuration and troubleshooting.
- Comfortable troubleshooting Linux systems – we use Ubuntu.
- Experience with configuration management tooling – we use Terraform and Ansible a lot.
- Experience with bare metal provisioning – we use PXE booting to automate hardware OS installs.
- Broad knowledge of the infrastructure that supports an application in a distributed system – e.g. operating systems, observability tooling, networking, storage, databases, monitoring, cloud providers, containers, container orchestration tools.
- Document your actions so you and others can repeat them and eventually automate them.
- Love to go deep into any subject when required as you love learning on the job.
- Excellent written and verbal communication – you know when to go synchronous and when to go async.
Who We Are
[Employer hidden] is the leading expert in Speech Intelligence, and uses AI and Machine Learning to unlock business value in human speech worldwide. We work with an amazing mix of global companies, and our technology can integrate into our customers stack irrespective of their industry or use case – making it the go‑to solution to harness useful information from speech.
What We Can Offer You
No matter what stage of your career you’re at – from paid internships and first‑job opportunities through to management and senior positions – we’ll support you with the training and development needed to reach your career aspirations with us. We offer flexible working, regular company lunches, birthday celebrations, private medical and dental, global working opportunities, a generous holiday allowance and pension/401K matching, a working‑from‑home allowance for tech or home office equipment, and supportive parental leave.
At [Employer hidden], our mission is simple: Understand Every Voice out there. We welcome different experiences, viewpoints, and identities and actively celebrate and support everyone – regardless of gender, race, disability, age, sexual orientation, religion, marital status, national origin, veteran status, pregnancy or maternity status.