AI Training Data Specialist

🔒 Confidential Employer
Posted 13 August 2025
LOCATION
Remote
TYPE
Full-time
LEVEL
Associate
SALARY
£70,000 / year
CATEGORY
Technology
This employer holds a UK Home Office sponsor license — sponsorship for this specific role is at the employer’s discretion

SKILLS

Data Preprocessing Python Pandas SQL Machine Learning Workflows Data Annotation NumPy

FULL DESCRIPTION

Summary

[Employer hidden — view at passion-project.co.uk] is seeking an AI Training Data Specialist to collect, curate, and preprocess data for training AI models. The role involves identifying and addressing biases, collaborating with data scientists, implementing quality assurance, and developing data labeling tools.

Key Responsibilities:

  • Collect, curate, and preprocess data for training AI models.
  • Identify and address biases or inconsistencies in datasets.
  • Collaborate with data scientists to define data requirements and formats.
  • Implement quality assurance processes to validate data integrity.
  • Develop and maintain tools for efficient data labeling and annotation.

Core Requirements:

  • Strong knowledge of data preprocessing and annotation techniques.
  • Experience with Python and data manipulation libraries (e.g., Pandas, NumPy).
  • Familiarity with database management and querying (SQL, NoSQL).
  • Understanding of machine learning workflows and training pipelines.
  • Bachelor’s degree in Data Science, Computer Science, or related field.

Responsibilities:

Collect, curate, and preprocess data for training AI models. Identify and address biases or inconsistencies in datasets. Collaborate with data scientists to define data requirements and formats. Implement quality assurance processes to validate data integrity. Develop and maintain tools for efficient data labeling and annotation.

Requirements:

Strong knowledge of data preprocessing and annotation techniques. Experience with Python and data manipulation libraries (e.g., Pandas, NumPy). Familiarity with database management and querying (SQL, NoSQL). Understanding of machine learning workflows and training pipelines. Bachelor’s degree in Data Science, Computer Science, or related field.

Sign up free — access 45,000+ UK sponsor-licensed jobs