Scaled Cognition is focused on ensuring the quality of AI models. As an AI Quality Assurance Engineer (Multilingual), you will be responsible for inspecting and grading training data, maintaining development environments, and collaborating with engineering teams to enhance data quality and pipelines.
Responsibilities:
- Meticulously inspect, review, and grade LLM training data, evaluation test cases, and model outputs to ensure maximum quality and accuracy
- Maintain local development environments to run test pipelines, investigate edge cases, and submit PRs via Git/GitHub to update our training repositories
- Act as a technical data detective, diving deep into training data to spot error cases
- Leverage LLMs as internal tools to translate, verify, and maintain our cross-lingual datasets
- Collaborate closely with the engineering team to refine our evaluation criteria and improve our data pipelines