Mathematics Expert (Competition & Olympiad-Level) - AI Training & Evaluation
Turing • Remote • Posted 4 days ago
Education
Any
Type
Pay Rate
$50/task
Posted
4d ago
✅ Applying through this link gives you a verified candidate referral.
Referrals from verified candidates give your profile a visibility boost and help support our platform at no cost to you.
This position is hosted on an external talent platform. Please only apply for this position if it fits your skills and interests.
Applying to Turing?
We support strong candidates applying here. Set up your talent profile so we know who you are.
Set up your profile →About this Role
About Turing
Based in San Francisco, California, Turing is the world's leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing supports customers in two ways: first, by accelerating frontier research with high-quality data, advanced training pipelines, and top AI researchers who specialize in coding, reasoning, STEM, multilinguality, multimodality, and agents; and second, by applying that expertise to help enterprises transform AI from proof of concept into proprietary intelligence with systems that perform reliably, deliver measurable impact, and produce lasting results on the P&L.
Role Overview
In this role, you will work on projects that improve and evaluate large language models by crafting challenging, competition-level mathematics problems and rigorously assessing model reasoning. The ideal candidate has a strong foundation in competitive mathematics at the AIME, HMMT, and IMO (Olympiad) level across the four classic pillars: Algebra, Number Theory, Combinatorics, and Geometry. You should be able to design novel, "Google-proof" problems intended to expose deep reasoning deficiencies in state-of-the-art models, and to diagnose precisely where and why a model's reasoning breaks down. The role combines original problem authoring, rigorous solution writing, and detailed evaluation of model-generated responses. This is your chance to future-proof your career in an AI-first world by working at the frontier of mathematical reasoning evaluation.
What does the day-to-day look like:
Design original, challenging mathematics problems at AIME, HMMT, and IMO difficulty that test the reasoning limits of large language models in multi-step, abstract settings, drawn strictly from Algebra, Number Theory, Combinatorics, or Geometry. Author novel prompts that "break" evaluated models, meaning the model arrives at an incorrect final answer; ensure problems cannot be bypassed via brute-force or computationally intensive methods. Solve problems independently and write detailed, logically structured, self-contained solutions with clear justifications, properly rendered in LaTeX. Review model-generated solutions, identify mathematical errors, logical fallacies, or missing arguments, and diagnose the root cause using defined failure categories (Final Answer, Reasoning Steps, Instruction Following). Contribute to defining new evaluation benchmarks across competition and Olympiad-level mathematics curricula. Classify each prompt accurately by domain, sub-domain, topic, and proficiency level within the labeling tool.
Requirements
Mathematical Expertise: Strong command of competitive mathematics at the level of AIME, HMMT, and IMO across Algebra, Number Theory, Combinatorics, and Geometry. Writing Proficiency: Excellent structured written communication, including fluency with standard LaTeX delimiters for all mathematical expressions. Analytical Skills: Strong research and analytical skills, with the ability to construct rigorous, proof-based reasoning. Creative Thinking: Creative and lateral thinking abilities to design novel problems that are not adapted from existing competitions or online repositories. Feedback Skills: Ability to provide constructive feedback, precise annotations, and accurate error diagnosis on model outputs. Independence: Self-motivated and able to work independently in a remote setting. Technical Setup: Desktop/Laptop setup with a good internet connection.
Preferred Qualifications
Candidates pursuing or holding a Bachelor’s/Master's degree in Mathematics, Applied Mathematics, Statistics, Engineering, or a related field are eligible and encouraged to apply. Prior experience in competitive mathematics (e.g., national or international Olympiads or equivalent competitive examinations) as a participant, coach, or problem setter is a bonus. Ability to analyze and solve complex problems with a structured, logical approach and to express solutions clearly and rigorously.
Perks of Freelancing With Turing
Work in a fully remote environment. Opportunity to work on cutting-edge AI projects with leading LLM companies. Potential for contract extension based on performance and project needs.
Offer Details
Commitments Required: At least 8 hours per day and 40 hours per week, with 4 hours of overlap with PST. Engagement type: Contractor assignment/freelancer (no medical/paid leave).
Requirements
- Must be eligible to work in Remote
- Fluent proficiency in English (Written & Verbal)
- Reliable high-speed internet connection
- Bachelor's degree or equivalent professional experience
- Demonstrated expertise in Mathematics
Compensation Analysis
Shape the "brain" of future AI. By working as a Mathematics Expert (Competition & Olympiad-Level) - AI Training & Evaluation, you ensure that future models understand the nuance of your field. At $50/hr, it's a lucrative way to preserve the integrity of your profession in the digital age.
Skills & Categories
Explore other opportunities in related specializations:
Related Jobs
Browse All Jobs from Turing
Discover more opportunities on Turing that match your skills and interests.
View All Turing Jobs →Community Reviews
Share your experience with Turing
Help other candidates make better decisions by leaving a review.
Sign in to leave a reviewLeave your review
Frequently Asked Questions
Do I need to be a software engineer?
Not anymore. Turing built its reputation matching senior engineers with Silicon Valley companies, but they have heavily pivoted into AGI infrastructure. They now hire non-engineering domain experts, technical writers, and researchers for post-training data annotation and RLHF. A strong analytical background and excellent English are required, but you do not need to code.
How does matching work?
Turing calls it the 'Intelligent Talent Cloud.' You build a profile and go through deep vetting — automated tests, an AI-powered interview, and practical skill assessments. Once vetted, Turing's algorithm automatically surfaces you to partner companies (Fortune 500s and top AI labs). You don't browse job boards or bid on work — matches come to you.
How does payment work?
You are hired as an independent contractor, responsible for your own local taxes. Turing collects payment from the client and pays you monthly in USD via Deel, Payoneer, or direct bank/wire transfer. Monthly pay is standard for long-term contract roles — if you need weekly cash flow, this structure requires adjustment.
What does the work actually look like?
It is practical, hands-on data work. You might be recording short videos, categorizing images, rating text responses, or analyzing data. The tasks are designed to be short and distinct—typically 5-60 minutes per task.
How flexible is the schedule?
Extremely. This is true "log in and work" flexibility. You can usually work for 20 minutes or 4 hours depending on your availability. There are rarely minimum hour requirements, making it ideal for side income.
Is there an interview?
Usually, no. Hiring for these roles is almost entirely based on passing an automated assessment or "qualification" task. If you pass the test, you get access to the work.