aitrainer.work - AI Training Jobs Platform
Generalist mercor

Video Annotation Expert

Mercor Remote Posted 29 days ago

Education

Any

Type

Pay Rate

$16/task

Posted

29d ago

✅ Applying through this link gives you a verified candidate referral.

Referrals from verified candidates give your profile a visibility boost and help support our platform at no cost to you.

This position is hosted on an external talent platform. Please only apply for this position if it fits your skills and interests.

Apply Now

About this Role

Mercor is partnering with an AI lab on a project that teaches robots to understand and perform everyday household tasks — like putting away dishes, folding clothes, or tidying a room.

We're looking for detail-oriented freelance contributors to watch first-person videos of people completing household activities and break them down into precise, labeled action segments. Your annotations will directly train the next generation of robots to understand what tasks like "pick up a mug" or "walk to the counter" actually look like in the real world.

Location Requirements

Role Overview

What You'll Do

down to a fraction of a second.

What We're Looking For

Strong attention to detail

Good judgment and common sense

Clear, concise writing in English

Comfort with repetitive, focused work

Basic computer proficiency

How to Apply

Eligibility

United States

Canada

Mercor is partnering with an AI lab on a project that teaches robots to understand and perform everyday household tasks — like putting away dishes, folding clothes, or tidying a room. We're looking for detail-oriented freelance contributors to watch first-person videos of people completing household activities and break them down into precise, labeled action segments. Your annotations will directly train the next generation of robots to understand what tasks like "pick up a mug" or "walk to the counter" actually look like in the real world. Watch egocentric (first-person) videos of people performing household tasks like cleaning, organizing, and tidying rooms. Segment each video into individual actions — identifying exactly when each action starts and ends, down to a fraction of a second. Write short, natural language descriptions for each action (e.g., "Pick up the blue towel and place it on the shelf"). Label each segment with the correct action type and which hand(s) are being used. Screen videos for quality issues before annotating. Follow detailed project guidelines to ensure consistent, high-quality training data. Strong attention to detail — you'll be placing timestamps with frame-level precision and catching subtle differences between similar actions. Good judgment and common sense — many decisions require interpreting guidelines and applying them to ambiguous real-world scenarios, not just following a checklist. Clear, concise writing in English — you'll write short action descriptions that need to sound natural and specific. Native or near-native English fluency required. Comfort with repetitive, focused work — a typical video may have 20–100+ action segments, each requiring careful review. Basic computer proficiency — you'll use a browser-based annotation tool with keyboard shortcuts. Comfort learning new software quickly is a plus. No prior annotation experience is required, but experience with video editing, data labeling, transcription, or similar detail-oriented media work is a plus. Participate in a short AI interview (~20 minutes) Complete a screening quiz (~15 to 20 minutes) that tests attention to detail, judgment, and writing skills. Top candidates will be extended offers in a soon as <1 day. You can apply if you’re based in: United States (except residents of California, New York, Connecticut, Washington, or the District of Columbia) We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.

  • Watch egocentric (first-person) videos of people performing household tasks like cleaning, organizing, and tidying rooms.
  • Segment each video into individual actions — identifying exactly when each action starts and ends, down to a fraction of a second.
  • Write short, natural language descriptions for each action (e.g., "Pick up the blue towel and place it on the shelf").
  • Label each segment with the correct action type and which hand(s) are being used.
  • Screen videos for quality issues before annotating.
  • Follow detailed project guidelines to ensure consistent, high-quality training data.
  • Strong attention to detail — you'll be placing timestamps with frame-level precision and catching subtle differences between similar actions.
  • Good judgment and common sense — many decisions require interpreting guidelines and applying them to ambiguous real-world scenarios, not just following a checklist.
  • Clear, concise writing in English — you'll write short action descriptions that need to sound natural and specific. Native or near-native English fluency required.
  • Comfort with repetitive, focused work — a typical video may have 20–100+ action segments, each requiring careful review.
  • Basic computer proficiency — you'll use a browser-based annotation tool with keyboard shortcuts. Comfort learning new software quickly is a plus.
  • United States (except residents of California, New York, Connecticut, Washington, or the District of Columbia)
  • Canada

Requirements

  • Must be eligible to work in Remote
  • Fluent proficiency in English (Written & Verbal)
  • Reliable high-speed internet connection

Eligible Languages

Fluent proficiency in English

English

Compensation Analysis

Monetize your niche expertise without the billable hours. At $16/hr, this role offers elite compensation for pure intellectual work—no client management or administrative bloat.

Skills & Categories

Explore other opportunities in related specializations:

Related Jobs

Mercor

Browse All Jobs from Mercor

Discover more opportunities on Mercor that match your skills and interests.

View All Mercor Jobs →

Community Reviews

Loading reviews…

Frequently Asked Questions

Is this for freelancers or full-time employees?

Both. Mercor tries to match you with clients who want long-term contractors. Unlike other platforms where you log in and grab small tasks, Mercor matches you with one company for a steady role (e.g., 'Python Tutor for 3 months').

I'm not comfortable on camera. Can I still apply?

No. The application requires a video interview with an AI avatar. The AI asks you questions about your resume, and the video is shared with potential clients to prove your communication skills.

Does it cost money to join?

No. You should never pay to join these platforms. Mercor makes money by charging the client a fee on top of your hourly rate.

What does the work actually look like?

It is practical, hands-on data work. You might be recording short videos, categorizing images, rating text responses, or analyzing data. The tasks are designed to be short and distinct—typically 5-60 minutes per task.

How flexible is the schedule?

Extremely. This is true "log in and work" flexibility. You can usually work for 20 minutes or 4 hours depending on your availability. There are rarely minimum hour requirements, making it ideal for side income.

Is there an interview?

Usually, no. Hiring for these roles is almost entirely based on passing an automated assessment or "qualification" task. If you pass the test, you get access to the work.

How soon will I start?

Important: Mercor is a talent marketplace, not a task queue. Applying puts you in a pool of candidates. You will only start working when a specific client (like a major AI lab) selects your profile. This matching process can take weeks.