aitrainer.work - AI Training Jobs Platform
Software Engineering alignerr

Software Engineer – AI Evaluation Specialist

Alignerr Remote Posted 24 days ago

Education

Any

Type

Pay Rate

$75/task

Posted

24d ago

✅ Applying through this link gives you a verified candidate referral.

Referrals from verified candidates give your profile a visibility boost and help support our platform at no cost to you.

This position is hosted on an external talent platform. Please only apply for this position if it fits your skills and interests.

Apply Now

About this Role

What You'll Do

  • Evaluate frontier AI language models on complex, real-world software engineering tasks
  • Identify bugs, logical errors, hallucinations, and reliability issues in AI-generated code
  • Design and review prompts, test cases, and evaluation scenarios that push models to their limits
  • Write precise, structured feedback that explains model strengths, weaknesses, and edge case behavior
  • Work across multiple languages and codebases to assess generalization, correctness, and robustness
  • Think like an adversary — find what breaks, not just what works

About the Role

What if your engineering instincts could directly influence how the world's most advanced AI systems write code for millions of developers? We're looking for experienced software engineers to evaluate frontier AI models — putting them through rigorous real-world scenarios, exposing their failure modes, and providing the expert feedback that makes them better. This is a fully remote, flexible contract role built for engineers who think critically, debug instinctively, and know the difference between code that looks right and code that actually is.

  • Organization: Alignerr
  • Type: Hourly Contract
  • Location: Remote
  • Commitment: 10–40 hours/week

Who You Are

  • 3–4+ years of professional software engineering experience
  • Strong proficiency in at least one of: TypeScript, Ruby, Java, or C++
  • Excellent written and spoken English — clear, precise communication is essential
  • A natural debugger — you thrive on finding non-obvious issues in complex systems
  • Comfortable with modern development tooling: Git, CLI workflows, testing frameworks
  • Able to critically evaluate AI behavior rather than simply accept model outputs at face value

Nice to Have

  • Familiarity with large language models, AI evaluation workflows, or prompt engineering
  • Experience writing test cases, test plans, or formal code reviews
  • Background in compiler design, static analysis, or systems programming
  • Exposure to multiple programming languages and paradigms

Why Join Us

  • Work on some of the most advanced AI systems in existence alongside leading research labs
  • Fully remote and asynchronous — work when and where it suits you
  • Freelance autonomy with meaningful, intellectually stimulating work
  • Make a direct and lasting impact on how AI understands and generates code
  • Potential for ongoing work and contract extension as new projects launch

Requirements

  • Fluent proficiency in English (Written & Verbal)
  • Reliable high-speed internet connection
  • Bachelor's degree or equivalent professional experience
  • Demonstrated expertise in Software Engineering

Eligible Languages

Fluent proficiency in English

English

Compensation Analysis

What if your engineering instincts could directly influence how the world's most advanced AI systems write code for millions of developers? We're looking for experienced software engineers to evaluate frontier AI models — putting them through rigorous real-world scenarios, exposing their failure modes, and providing the expert feedback that makes t

Skills & Categories

Explore other opportunities in related specializations:

Related Jobs

Alignerr

Browse All Jobs from Alignerr

Discover more opportunities on Alignerr that match your skills and interests.

View All Alignerr Jobs →

Community Reviews

Loading reviews…

Frequently Asked Questions

What is the assessment actually like?

Notoriously strict. Alignerr uses TestGorilla for role-specific timed tests — a blank coding environment for engineers, rigorous grammar and fact-checking for writers. There is almost no hand-holding. The critical catch: this is essentially a one-shot process. Fail or abandon the assessment, and you are typically locked out of that role permanently with no option to retake.

How quickly can I start earning after I pass?

Not immediately. Even after passing the assessment and completing identity verification (via Persona) and billing setup (via Deel), you may sit in a waiting pool for weeks or months. You only start earning when a project matching your specific skills launches and you are officially assigned. Do not plan around Alignerr income until you are actively on a project.

Is there a community?

Yes — and it is one of Alignerr's genuine strengths. Once assigned to a project, you are added to Slack channels where you can ask questions, get rubric clarifications from admins, and talk to other AI trainers. This is rare in AI training and makes a real difference when guidelines are ambiguous or change mid-project.

What does the work actually look like?

It is practical, hands-on data work. You might be recording short videos, categorizing images, rating text responses, or analyzing data. The tasks are designed to be short and distinct—typically 5-60 minutes per task.

How flexible is the schedule?

Extremely. This is true "log in and work" flexibility. You can usually work for 20 minutes or 4 hours depending on your availability. There are rarely minimum hour requirements, making it ideal for side income.

Is there an interview?

Usually, no. Hiring for these roles is almost entirely based on passing an automated assessment or "qualification" task. If you pass the test, you get access to the work.

What is the barrier to entry?

Alignerr is known for difficult technical assessments. You must pass a timed test in your specific domain (e.g., Python, Physics, or Language) before you are eligible for any paid projects.