aitrainer.work - AI Training Jobs Platform
Generalist alignerr

English Audio Transcription Specialist

Alignerr β€’ Africa, South Africa, USA

Education

Any

Type

Pay Rate

$12.5/task

Listed

Today

βœ… Applying through this link gives you a verified candidate referral.

Referrals from verified candidates give your profile a visibility boost and help support our platform at no cost to you.

This position is hosted on an external talent platform. Please only apply for this position if it fits your skills and interests.

Apply Now β†’

About this Role

What You'll Do

  • Listen to audio recordings in English and produce precise, verbatim transcriptions
  • Accurately capture spoken content including natural speech patterns, filler words, and speaker changes where required
  • Apply consistent formatting, punctuation, and spelling conventions according to project guidelines
  • Identify and flag unclear, inaudible, or ambiguous audio segments
  • Handle a diverse range of audio types β€” conversations, interviews, lectures, podcasts, customer interactions, and more
  • Review and proofread transcriptions for accuracy, completeness, and adherence to style guides
  • Work independently and asynchronously β€” fully on your own schedule

About the Role

What if your sharp ear and command of English could directly shape how AI listens to and understands the spoken word? We're looking for English Audio Transcription Specialists in Johannesburg to convert spoken audio into accurate, well-formatted text β€” helping AI systems learn to process real human speech with all its accents, nuances, and complexity. South Africa's strong English-speaking workforce and exposure to a rich diversity of accents and speaking styles make Johannesburg an excellent talent hub for this role. This is a fully remote, flexible contract role. No prior transcription or AI experience is required β€” just excellent English listening skills, strong attention to detail, and a reliable internet connection.

  • Organization: Alignerr
  • Type: Hourly Contract
  • Location: Remote
  • Commitment: 10–40 hours/week

Who You Are

  • Native or near-native fluency in English with excellent listening comprehension
  • Strong written English skills β€” confident with grammar, spelling, and punctuation
  • Naturally detail-oriented with a consistent, methodical approach to work
  • Able to follow structured guidelines and apply them accurately at scale
  • Comfortable working with different accents, speaking speeds, and audio quality levels
  • Self-motivated and reliable when working independently
  • No prior transcription, AI, or tech experience required

Nice to Have

  • Experience in transcription, closed captioning, subtitling, or stenography
  • Familiarity with transcription tools or audio editing software
  • Background in linguistics, journalism, communications, or media production
  • Experience working with varied English dialects and accents (American, British, Australian, South African, Indian, etc.)
  • Fast, accurate typing skills (70+ WPM)
  • Knowledge of specialized vocabulary in fields such as medicine, law, technology, or finance

Why Join Us

  • Work on cutting-edge AI projects alongside leading research labs
  • Fully remote and flexible β€” work when and where it suits you
  • Freelance autonomy with the structure of meaningful, task-based work
  • Contribute to AI development that directly improves how technology understands human speech
  • Build skills in a high-demand field at the intersection of language and AI
  • Potential for ongoing work and contract extension as new projects launch

Requirements

  • Must be eligible to work in one of: Africa, South Africa, USA
  • Fluent proficiency in English (Written & Verbal)
  • Reliable high-speed internet connection

Eligible Languages

Fluent proficiency in English

English

Compensation Analysis

What if your sharp ear and command of English could directly shape how AI listens to and understands the spoken word? We're looking for English Audio Transcription Specialists in Johannesburg to convert spoken audio into accurate, well-formatted text β€” helping AI systems learn to process real human speech with all its accents, nuances, and complexi

Skills & Categories

Explore other opportunities in related specializations:

Related Jobs

Alignerr

Browse All Jobs from Alignerr

Discover more opportunities on Alignerr that match your skills and interests.

View All Alignerr Jobs β†’

Community Reviews

Loading reviews…
πŸ’¬

Share your experience with Alignerr

Help other candidates make better decisions by leaving a review.

Sign in to leave a review

Frequently Asked Questions

What is the assessment actually like?

Notoriously strict. Alignerr uses TestGorilla for role-specific timed tests β€” a blank coding environment for engineers, rigorous grammar and fact-checking for writers. There is almost no hand-holding. The critical catch: this is essentially a one-shot process. Fail or abandon the assessment, and you are typically locked out of that role permanently with no option to retake.

How quickly can I start earning after I pass?

Not immediately. Even after passing the assessment and completing identity verification (via Persona) and billing setup (via Deel), you may sit in a waiting pool for weeks or months. You only start earning when a project matching your specific skills launches and you are officially assigned. Do not plan around Alignerr income until you are actively on a project.

Is there a community?

Yes β€” and it is one of Alignerr's genuine strengths. Once assigned to a project, you are added to Slack channels where you can ask questions, get rubric clarifications from admins, and talk to other AI trainers. This is rare in AI training and makes a real difference when guidelines are ambiguous or change mid-project.

What equipment do I need?

For voice or audio roles at this pay level, you typically need a professional home studio setup (XLR microphone, treated room). Phone recordings or laptop mics are usually rejected by quality control.

How is my work used?

You are providing high-quality "ground truth" data. For writers, this means creative generation. For voice actors, it often means training Text-to-Speech models. Be sure to check the specific contract details regarding rights usage for your voice or likeness.

Is creative freedom allowed?

Yes and no. While you are hired for your talent, you must often follow strict style guides (e.g., "Speak in a neutral tone" or "Write in the style of a technical manual"). The goal is consistency for the dataset.

What is the barrier to entry?

Alignerr is known for difficult technical assessments. You must pass a timed test in your specific domain (e.g., Python, Physics, or Language) before you are eligible for any paid projects.