Software Engineering alignerr

Robotics ML Expert — MuJoCo & Reinforcement Learning

Alignerr • Remote

Education

Any

Type

hourly

Pay Rate (by country)

$100–$150/hr

Listed

96d ago

✅ Applying through this link supports our platform at no cost to you.

This position is hosted on an external talent platform. Please only apply for this position if it fits your skills and interests.

Apply Now →

What We Know About This Role

Weekly hours: 10–40 hrs/week

About this Role

From the Alignerr listing

What You'll Do

Design, develop, and iterate on MuJoCo simulation environments for robotics research and AI training
Implement and tune reinforcement learning algorithms (PPO, SAC, TD3, etc.) to train agents in simulated tasks
Define reward functions, observation spaces, and action spaces that produce robust, transferable policies
Debug and optimize physics simulations — contact models, actuator dynamics, and scene configurations
Evaluate trained policies for stability, generalization, and sim-to-real transfer potential
Document environment specifications, training procedures, and experimental results clearly and thoroughly
Collaborate asynchronously with research teams to align simulation work with broader project goals
Stay current with the latest advances in robot learning, simulation, and embodied AI

About the Role

What if your expertise in robotics and machine learning could directly shape how the next generation of intelligent agents learn to move, manipulate, and interact with the physical world? We're looking for Robotics ML Experts in Munich's world-class robotics community with hands-on MuJoCo experience to design, build, and refine simulation environments that train AI systems to perform real-world tasks — from locomotion and dexterous manipulation to complex multi-agent coordination. This is a fully remote, flexible contract role for experienced practitioners who live and breathe physics simulation, reinforcement learning, and robot control. If you've spent time wrangling MJCF files, tuning reward functions, and debugging contact dynamics, this role was made for you.

Organization: Alignerr
Type: Hourly Contract
Location: Remote
Commitment: 10–40 hours/week

Who You Are

Strong hands-on experience with MuJoCo (or MuJoCo via dm_control, Gymnasium/Gymnasium-Robotics, or similar wrappers)
Solid understanding of reinforcement learning theory and practical training pipelines
Proficient in Python and comfortable with ML frameworks such as PyTorch or JAX
Experienced in defining and shaping reward functions for complex robotic tasks
Familiar with robot kinematics, dynamics, and control fundamentals
Able to read and write MJCF/XML model files and understand their physics implications
Self-directed, detail-oriented, and comfortable working independently in an async environment
Strong written communicator who can document technical work clearly

Nice to Have

Experience with sim-to-real transfer techniques (domain randomization, system identification)
Familiarity with other physics simulators — Isaac Gym, PyBullet, Drake, or Genesis
Background in multi-agent environments or hierarchical RL
Published research or open-source contributions in robotics, RL, or embodied AI
Experience with imitation learning, model-based RL, or world models
Graduate-level coursework or degree in robotics, ML, computer science, or a related field

Why Join Us

Work on cutting-edge robotics and AI simulation projects alongside leading research labs
Fully remote and flexible — work when and where it suits you
Freelance autonomy with the structure of meaningful, milestone-driven work
Directly influence how AI agents learn to interact with the physical world
Engage with a global community of top-tier ML and robotics practitioners
Potential for ongoing work and contract extension as new projects launch

Requirements

Fluent proficiency in English (Written & Verbal)
Reliable high-speed internet connection
Bachelor's degree or equivalent professional experience
Demonstrated expertise in Software Engineering

Why This Role

Skills & Categories

Explore other opportunities in related specializations:

Software Engineering Expert

Related Jobs

Engineering & Data tools Specialist

micro1 • Software Engineering

$80 /hr

QA / Software Engineering Reviewer – Browser Test Validation

mercor • Software Engineering

$60 /hr

Head of AI & Engineering Expert

ethos • Software Engineering

$150 /hr

Senior Machine Learning Engineer / Model Evaluations Expert

ethos • Software Engineering

$125 /hr

Browse All Jobs from Alignerr

Discover more opportunities on Alignerr that match your skills and interests.

View All Alignerr Jobs →

Verified Reviews

Loading reviews…

Community Reviews

Loading reviews…

💬

Share your experience with Alignerr

Help other candidates make better decisions by leaving a review.

Frequently Asked Questions

How hard is the Alignerr assessment?

Hard, and unforgiving. Alignerr uses TestGorilla for timed, role-specific tests: a blank coding environment for engineers, strict grammar and fact-checking for writers. Treat it as one shot. Failing or abandoning it typically locks you out of that role permanently, with no retake.

How soon can I start earning on Alignerr after passing the assessment?

Not right away. After passing, you still complete identity verification through Persona and billing setup through Deel, then wait in a pool for weeks or months. You only start earning once a project matching your specific skills launches and assigns you. Don't count on Alignerr income until you're actively placed on a project.

Does Alignerr have a trainer community?

Yes, and it's a genuine strength. Once you're assigned to a project, you join Slack channels where you can get rubric clarifications from admins and talk to other trainers. That kind of support is rare in AI training and matters most when guidelines are ambiguous or shift mid-project.

Is AI training work the same as traditional consulting?

No. Instead of client deliverables, you're given complex scenarios to evaluate: grading the AI's logic, correcting its hallucinations, and supplying expert-level reasoning it doesn't have on its own. The job is closer to teaching than consulting.

Why do these AI training roles pay so much?

Because general knowledge isn't what's being tested. The model already knows the basics; what it needs is expertise on edge cases, the rare, difficult, highly technical judgment calls only a senior professional in the field would make correctly.

What does the day-to-day workload look like for elite-expert AI training roles?

Slow and deep, not fast and repetitive. A single task can take 45-60 minutes of researching citations or verifying complex calculations. Quality is what's being measured here, not throughput.

What does Software Engineering work look like for a Robotics ML Expert — MuJoCo & Reinforcement Learning?

Tasks here are scoped to Software Engineering, not generic labeling. As a Robotics ML Expert — MuJoCo & Reinforcement Learning, expect to draw on real domain judgment (evaluating outputs, correcting errors, or providing expert reasoning specific to Software Engineering) rather than following a one-size-fits-all rubric. If you don't have hands-on Software Engineering background, this is likely not the right listing to start with.

How many hours per week does this role require?

Based on the listing, this role is scoped at 10–40 hours per week. Treat this as a real commitment expectation, not a loose estimate.

What happens when I click Apply on this listing?

You'll be taken to Alignerr's external site to complete your application there. This listing links through a referral, but the process is identical to applying directly; the link just routes you correctly. Create an account on their site and follow their onboarding steps.

What is the barrier to entry for Alignerr?

A difficult, timed technical assessment in your specific domain, like Python, physics, or language. Passing it is required before you're eligible for any paid projects.

Robotics ML Expert — MuJoCo & Reinforcement Learning

What We Know About This Role

About this Role

What You'll Do

About the Role

Who You Are

Nice to Have

Why Join Us

Requirements

Why This Role

Skills & Categories

Related Jobs

Engineering & Data tools Specialist

QA / Software Engineering Reviewer – Browser Test Validation

Head of AI & Engineering Expert

Senior Machine Learning Engineer / Model Evaluations Expert

Browse All Jobs from Alignerr

Verified Reviews

Community Reviews

Leave your review

Frequently Asked Questions

$150–$225/hr. Lawyers, MDs and Finance Experts Wanted.

Get Paid for the Expertise You Already Have

Turn Your Expertise Into $78/hr, On Average

AI Trainer? Don't Let the IRS Keep Your Bonus

Fight AI with AI

No Projects Available?

Fight AI with AI

Fight AI with AI

No Projects Available?

Fight AI with AI

AI Trainer? Don't Let the IRS Keep Your Bonus

Fight AI with AI

Turn Your Expertise Into $78/hr, On Average