aitrainer.work - AI Training Jobs Platform
Software Engineering

Senior Software Engineer – LLM Evaluation

Turing Remote Posted 23 days ago

Education

Any

Type

Pay Rate

$54/task

Posted

23d ago

Apply Now

Role Overview

As a Software Engineering evaluator, you will create cutting-edge datasets for training, benchmarking, and advancing large language models, collaborating closely with researchers. This includes curating code examples, providing precise solutions, and making corrections in Python, JavaScript (including ReactJS), C/C++, Java, Rust, and Go; evaluating and refining AI-generated code for efficiency, scalability, and reliability; and working with cross-functional teams to enhance enterprise-level AI-d

Requirements

  • • Several years of software engineering experience, including 2+ years of continuous full-time experience at a top-tier product company (e.g., Google, Stripe, Amazon, Apple, Meta, Netflix, Microsoft, Datadog, Dropbox, Shopify, PayPal, IBM Research).
  • • Strong expertise in building full-stack applications and deploying scalable, production-grade software using modern languages and tools.
  • • Deep understanding of software architecture, design, development, debugging, and code quality/review assessment.
  • • Excellent oral and written communication skills for clear, structured evaluation rationales.
  • Must be eligible to work in Remote

Compensation Analysis

Based in San Francisco, California, Turing is the world’s leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing supports customers in two ways: first, by accelerating frontier research with high-quality data, advanced training pipelines, plus top AI researchers who speci

Related Jobs

Browse All Jobs from Turing

Discover more opportunities on Turing that match your skills and interests.

View All Turing Jobs →

Frequently Asked Questions

How do I get started?

Review the hiring process above and follow the steps outlined. Each platform has different requirements, so make sure you meet the eligibility criteria before applying.

Is this work legitimate?

Yes. All platforms listed on aitrainer.work are legitimate and pay reliably. However, always verify payment methods and read reviews before committing significant time.

What does the work actually look like?

It is practical, hands-on data work. You might be recording short videos, categorizing images, rating text responses, or analyzing data. The tasks are designed to be short and distinct—typically 5-60 minutes per task.

How flexible is the schedule?

Extremely. This is true "log in and work" flexibility. You can usually work for 20 minutes or 4 hours depending on your availability. There are rarely minimum hour requirements, making it ideal for side income.

Is there an interview?

Usually, no. Hiring for these roles is almost entirely based on passing an automated assessment or "qualification" task. If you pass the test, you get access to the work.