Bilingual Spanish Generalist Evaluator Expert
Mercor • Remote • Posted 26 days ago
Education
Any
Type
Pay Rate
$24.5/task
Posted
26d ago
✅ Applying through this link gives you a verified candidate referral.
Referrals from verified candidates give your profile a visibility boost and help support our platform at no cost to you.
This position is hosted on an external talent platform. Please only apply for this position if it fits your skills and interests.
Applying to Mercor?
We support strong candidates applying here. Set up your talent profile so we know who you are.
Set up your profile →About this Role
Mercor is seeking native Spanish speakers from the United States, Spain, Chile, or Mexico with exceptional writing skills to contribute to a high-impact AI research project with a leading lab. Freelancers will author Spanish/English prompt–golden answer pairs that train and evaluate advanced language models.
This role is strictly limited to candidates who are native to the United States, Spain, Chile, or Mexico and have lived in or spent significant time in the country, with deep familiarity with local language usage, tone, and cultural context (including distinctions such as U.S. Spanish, Peninsular Spanish, Chilean Spanish, and Mexican Spanish conventions).
Location Requirements
native Spanish speakers from the United States, Spain, Chile, or Mexico
Job Details
Multilingual Prompt Design & Optimization
the United States, Spain, Chile, and Mexico
Define and Document Evaluation Standards
Model Testing and Grading (Bilingual)
Benchmarking & Quality Assurance
Minimum Qualifications
Preferred Qualifications
More Details About This Role
2–4 months
Mercor is seeking native Spanish speakers from the United States, Spain, Chile, or Mexico with exceptional writing skills to contribute to a high-impact AI research project with a leading lab. Freelancers will author Spanish/English prompt–golden answer pairs that train and evaluate advanced language models. This role is strictly limited to candidates who are native to the United States, Spain, Chile, or Mexico and have lived in or spent significant time in the country, with deep familiarity with local language usage, tone, and cultural context (including distinctions such as U.S. Spanish, Peninsular Spanish, Chilean Spanish, and Mexican Spanish conventions). This is a short-term, flexible opportunity for professionals who combine language mastery, strong critical thinking, and a knack for instructional clarity. Ideal for those who enjoy distilling complex concepts into well-crafted, culturally grounded Spanish text while maintaining technical precision in English. Create detailed prompts in Spanish and/or English with multiple constraints and instructions, ensuring natural phrasing and real-world relevance for Spanish-speaking users in the United States, Spain, Chile, and Mexico contexts. Establish high-level expectations for correct responses in United States, Spain, Chile, and Mexico consumer contexts, and develop comprehensive rubrics that account for linguistic nuance, tone, and cultural conventions specific to these regions. Run prompts through models and assess preliminary outputs for accuracy, fluency, and cultural fit in Spanish, comparing results against English where needed. Collaborate in QA review processes to ensure prompt tasks and rubrics meet rigor—maintaining consistency and reliability across Spanish-language benchmarks before integration into official evaluations. Native-level fluency in Spanish (written), specific to United States, Spain, Chile, or Mexico usage, with strong reading/writing ability in English. Must be native to the United States, Spain, Chile, or Mexico and have lived in or spent significant time in-country, with deep cultural and linguistic familiarity. BS or BA from a reputable institution (completed or in progress). Strong writing and critical thinking skills. Ability to work independently and meet deadlines. Significant familiarity with ChatGPT or similar tools for personal decision-making, hobbies, or general interests. Based in the United States, Spain, Chile, or Mexico (or able to reliably produce country-specific, culturally accurate Spanish aligned with one of these regions). Experience in teaching, research, editing, or academic writing. Experience creating evaluation criteria, rubrics, or grading guidelines. Familiarity with LLMs, prompting, or model evaluation (helpful but not required). Complete an AI-led interview (about 15 minutes). If approved, complete a paid assessment focused on writing and rubric creation Then, if selected, you will be invited to work on the project. Expect to contribute at least 20 hours per week. Expect a commitment of approximately 2–4 months. You’ll be working in a structured project environment with clear goals and tools. We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.
- Native-level fluency in Spanish (written), specific to United States, Spain, Chile, or Mexico usage, with strong reading/writing ability in English.
- Must be native to the United States, Spain, Chile, or Mexico and have lived in or spent significant time in-country, with deep cultural and linguistic familiarity.
- BS or BA from a reputable institution (completed or in progress).
- Strong writing and critical thinking skills.
- Ability to work independently and meet deadlines.
- Significant familiarity with ChatGPT or similar tools for personal decision-making, hobbies, or general interests.
- Based in the United States, Spain, Chile, or Mexico (or able to reliably produce country-specific, culturally accurate Spanish aligned with one of these regions).
- Experience in teaching, research, editing, or academic writing.
- Experience creating evaluation criteria, rubrics, or grading guidelines.
- Familiarity with LLMs, prompting, or model evaluation (helpful but not required).
- Complete an AI-led interview (about 15 minutes).
- If approved, complete a paid assessment focused on writing and rubric creation
- Then, if selected, you will be invited to work on the project.
- Expect to contribute at least 20 hours per week.
- Expect a commitment of approximately 2–4 months.
- You’ll be working in a structured project environment with clear goals and tools.
Requirements
- Must be eligible to work in Remote
- Fluent proficiency in English (Written & Verbal)
- Reliable high-speed internet connection
Eligible Languages
Fluent proficiency in English or Spanish
Compensation Analysis
Rare opportunity for top 1% experts. Earn $24.5/hr contributing to the world's most advanced AI labs. This is one of the few roles where academic precision is valued as highly as commercial output.
Skills & Categories
Explore other opportunities in related specializations:
Related Jobs
Browse All Jobs from Mercor
Discover more opportunities on Mercor that match your skills and interests.
View All Mercor Jobs →Community Reviews
Share your experience with Mercor
Help other candidates make better decisions by leaving a review.
Sign in to leave a reviewLeave your review
Frequently Asked Questions
Is this for freelancers or full-time employees?
Both. Mercor tries to match you with clients who want long-term contractors. Unlike other platforms where you log in and grab small tasks, Mercor matches you with one company for a steady role (e.g., 'Python Tutor for 3 months').
I'm not comfortable on camera. Can I still apply?
No. The application requires a video interview with an AI avatar. The AI asks you questions about your resume, and the video is shared with potential clients to prove your communication skills.
Does it cost money to join?
No. You should never pay to join these platforms. Mercor makes money by charging the client a fee on top of your hourly rate.
What does the work actually look like?
It is practical, hands-on data work. You might be recording short videos, categorizing images, rating text responses, or analyzing data. The tasks are designed to be short and distinct—typically 5-60 minutes per task.
How flexible is the schedule?
Extremely. This is true "log in and work" flexibility. You can usually work for 20 minutes or 4 hours depending on your availability. There are rarely minimum hour requirements, making it ideal for side income.
Is there an interview?
Usually, no. Hiring for these roles is almost entirely based on passing an automated assessment or "qualification" task. If you pass the test, you get access to the work.
How soon will I start?
Important: Mercor is a talent marketplace, not a task queue. Applying puts you in a pool of candidates. You will only start working when a specific client (like a major AI lab) selects your profile. This matching process can take weeks.