aitrainer.work - AI Training Jobs Platform
Software Engineering mercor

STEM PhD Experts

Mercor Remote Posted 1 days ago

Education

Any

Type

Pay Rate

$85/task

Posted

1d ago

✅ Applying through this link gives you a verified candidate referral.

Referrals from verified candidates give your profile a visibility boost and help support our platform at no cost to you.

This position is hosted on an external talent platform. Please only apply for this position if it fits your skills and interests.

Apply Now

Applying to Mercor?

We support strong candidates applying here. Set up your talent profile so we know who you are.

Set up your profile →

About this Role

Join a leading AI lab's cutting-edge research team to be at the core of the AI revolution, where your expertise fuels the development of the most advanced LLMs.

We are seeking advanced STEM Researchers and PhD-level subject-matter experts (SMEs) to contribute to a project supporting a frontier-model evaluation effort focused on rigorous scientific and technical reasoning. The AI lab is building next-generation models capable of solving complex, research-grade problems across the sciences, and requires deep domain expertise to design, solve, and evaluate the challenging tasks that train and benchmark these systems.

Location Requirements

requiring a commitment of 40 hours per week (during weekdays)

About Cincinnatus LLC

Equal Employment Opportunity

Join a leading AI lab's cutting-edge research team to be at the core of the AI revolution, where your expertise fuels the development of the most advanced LLMs. We are seeking advanced STEM Researchers and PhD-level subject-matter experts (SMEs) to contribute to a project supporting a frontier-model evaluation effort focused on rigorous scientific and technical reasoning. The AI lab is building next-generation models capable of solving complex, research-grade problems across the sciences, and requires deep domain expertise to design, solve, and evaluate the challenging tasks that train and benchmark these systems. This is a W-2 employment position with Cincinnatus LLC, requiring a commitment of 40 hours per week (during weekdays). This position will be placed at a leading AI Lab as part of their extended workforce. Guide research teams to close knowledge gaps in STEM domains by surfacing edge cases, ambiguities, and frontier problems where current models underperform. Design challenging, rigorous domain tasks and write accurate, well-reasoned solutions that demonstrate expert-level scientific and technical reasoning. Evaluate tasks and solutions produced by AI agents and other contributors, providing clear written technical feedback grounded in domain expertise. Develop evaluation frameworks and rubrics for assessing scientific reasoning quality across STEM domains. Collaborate with other subject matter experts to ensure consistency and accuracy in training data. PhD (completed, enrolled, or equivalent research track) in Physics, Chemistry, Biology, Mathematics, Statistics, Computer Science, Electrical Engineering, Mechanical Engineering, Civil Engineering, Materials Science, or another STEM discipline. 3+ years of research, academic, or industry experience in their primary STEM domain. Demonstrated technical expertise in at least one domain: computational modeling, laboratory methods, data analysis, statistical inference, programming, or equivalent scientific methods. Ability to commit to 40 hours per week during weekdays for the duration of the engagement. Prior experience with data annotation, labeling, evaluation, or human feedback collection is a strong plus. Experience with LLMs, AI systems, or agentic workflows; familiarity with agentic frameworks is a plus. Strong written communication skills; ability to explain complex scientific or technical concepts clearly in writing. About Cincinnatus LLC: Cincinnatus LLC is an enterprise staffing company that partners with leading technology companies to source and employ highly skilled professionals for contingent and contract-based opportunities. Cincinnatus serves as the employer of record for these engagements, providing W-2 employment, payroll, benefits, and compliance, while placing employees directly within client teams to work on high-impact initiatives. Equal Employment Opportunity: Cincinnatus is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or any other legally protected characteristic. We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.

  • Guide research teams to close knowledge gaps in STEM domains by surfacing edge cases, ambiguities, and frontier problems where current models underperform.
  • Design challenging, rigorous domain tasks and write accurate, well-reasoned solutions that demonstrate expert-level scientific and technical reasoning.
  • Evaluate tasks and solutions produced by AI agents and other contributors, providing clear written technical feedback grounded in domain expertise.
  • Develop evaluation frameworks and rubrics for assessing scientific reasoning quality across STEM domains.
  • Collaborate with other subject matter experts to ensure consistency and accuracy in training data.
  • PhD (completed, enrolled, or equivalent research track) in Physics, Chemistry, Biology, Mathematics, Statistics, Computer Science, Electrical Engineering, Mechanical Engineering, Civil Engineering, Materials Science, or another STEM discipline.
  • 3+ years of research, academic, or industry experience in their primary STEM domain.
  • Demonstrated technical expertise in at least one domain: computational modeling, laboratory methods, data analysis, statistical inference, programming, or equivalent scientific methods.
  • Ability to commit to 40 hours per week during weekdays for the duration of the engagement.
  • Prior experience with data annotation, labeling, evaluation, or human feedback collection is a strong plus.
  • Experience with LLMs, AI systems, or agentic workflows; familiarity with agentic frameworks is a plus.
  • Strong written communication skills; ability to explain complex scientific or technical concepts clearly in writing.

Requirements

  • Must be eligible to work in Remote
  • Fluent proficiency in English (Written & Verbal)
  • Reliable high-speed internet connection
  • Bachelor's degree or equivalent professional experience
  • Demonstrated expertise in Software Engineering

Compensation Analysis

Monetize your niche expertise without the billable hours. At $85/hr, this role offers elite compensation for pure intellectual work—no client management or administrative bloat.

Skills & Categories

Explore other opportunities in related specializations:

Related Jobs

Mercor

Browse All Jobs from Mercor

Discover more opportunities on Mercor that match your skills and interests.

View All Mercor Jobs →

Community Reviews

Loading reviews…
💬

Share your experience with Mercor

Help other candidates make better decisions by leaving a review.

Sign in to leave a review

Frequently Asked Questions

Is this for freelancers or full-time employees?

Both. Mercor tries to match you with clients who want long-term contractors. Unlike other platforms where you log in and grab small tasks, Mercor matches you with one company for a steady role (e.g., 'Python Tutor for 3 months').

I'm not comfortable on camera. Can I still apply?

No. The application requires a video interview with an AI avatar. The AI asks you questions about your resume, and the video is shared with potential clients to prove your communication skills.

Does it cost money to join?

No. You should never pay to join these platforms. Mercor makes money by charging the client a fee on top of your hourly rate.

What does the work actually look like?

It is practical, hands-on data work. You might be recording short videos, categorizing images, rating text responses, or analyzing data. The tasks are designed to be short and distinct—typically 5-60 minutes per task.

How flexible is the schedule?

Extremely. This is true "log in and work" flexibility. You can usually work for 20 minutes or 4 hours depending on your availability. There are rarely minimum hour requirements, making it ideal for side income.

Is there an interview?

Usually, no. Hiring for these roles is almost entirely based on passing an automated assessment or "qualification" task. If you pass the test, you get access to the work.

How soon will I start?

Important: Mercor is a talent marketplace, not a task queue. Applying puts you in a pool of candidates. You will only start working when a specific client (like a major AI lab) selects your profile. This matching process can take weeks.