What is data annotation?

Data annotation is the process of labeling raw data — images, text, audio, video — so that machine learning models can learn from it. When you see a self-driving car recognize a stop sign, that happened because thousands of annotators drew bounding boxes around stop signs in training images. When ChatGPT gives you a coherent answer, that was shaped in part by human annotators who rated which responses were better and why.

The work ranges from simple to surprisingly complex. At the basic end: classifying whether a photo contains a cat, transcribing an audio clip, or labeling customer reviews as positive or negative. At the advanced end: evaluating whether an AI's mathematical reasoning is correct, writing code that trains a model to be better at coding, or assessing the factual accuracy of long-form AI outputs. The pay difference between these tiers is significant — $12-15/hr for commodity annotation versus $50-80/hr for expert-level RLHF work.

Why demand exploded (and why it will stay high)

The generative AI boom that began with the release of ChatGPT in late 2022 directly created hundreds of thousands of data annotation jobs. Every major AI lab — OpenAI, Anthropic, Google, Meta, Mistral, and dozens of others — needs continuous streams of human-labeled data to train, fine-tune, and align their models.

This demand is not going away. Even as AI models become more capable, they require more sophisticated human feedback to improve, not less. The bottleneck has shifted from "can we get humans to label images?" to "can we find humans with the domain expertise to evaluate complex AI outputs?" That shift is what drives the upper end of the pay range higher over time.

In 2026, the annotation market is estimated to be worth $5-8 billion annually, with growth projections continuing to be strong through the decade.

Types of annotation work

Image labeling and segmentation ($15-25/hr)

Image annotation is the most familiar category: drawing bounding boxes around objects, classifying images into categories, or doing pixel-level segmentation where you outline exactly where an object begins and ends. This work is used primarily for computer vision applications — autonomous vehicles, medical imaging, satellite imagery analysis, and retail product recognition.

The work is repetitive but requires reasonable attention to detail. Most platforms have accuracy requirements (typically 90-95% agreement with other annotators) and will remove workers who fall below threshold. Pay is typically project-based, which effectively translates to $15-25/hr for consistent workers.

Text classification and sentiment analysis ($12-20/hr)

Text annotation includes classifying whether a product review is positive, negative, or neutral; labeling whether a social media post violates platform rules; categorizing customer support tickets by topic; and hundreds of similar tasks. This is the most accessible entry point into annotation work — it requires only basic reading comprehension in the relevant language.

Pay is at the lower end of the annotation range because the skill requirements are minimal and the supply of workers is high. However, annotation in languages other than English pays a significant premium ($20-35/hr for rare languages) because the pool of qualified annotators is much smaller.

RLHF — Reinforcement Learning from Human Feedback ($20-80/hr)

RLHF is the high-value tier of the annotation market. In RLHF work, human annotators evaluate AI outputs — rating which of two AI responses is better, flagging factual errors, writing "ideal" responses that the model should have given, or scoring outputs across multiple dimensions (helpfulness, accuracy, safety, style).

The pay range is wide because the skill requirements are wide. Basic RLHF comparison tasks (which response is better?) start at $20-25/hr. Domain-specific RLHF — evaluating AI outputs in medicine, law, finance, or advanced coding — pays $40-80/hr for people with genuine expertise in those fields. A doctor evaluating medical AI outputs or a senior software engineer evaluating code quality can command significantly higher rates than a generalist.

Audio transcription ($15-25/hr)

Audio annotation involves transcribing speech to text, labeling speaker identities (diarization), and classifying audio clips by content type (music, speech, ambient noise). Transcription platforms like Rev and Verbit use primarily human workers, while AI labs use transcription specifically to create training data for speech recognition systems.

Pay varies by accuracy requirement and audio complexity. Clean speech in quiet environments: $15-18/hr equivalent. Medical or legal transcription, accented speech, or poor audio quality: $20-25/hr. Specialized medical transcription is a separate (more specialized and better paid) category.

Video annotation ($15-30/hr)

Video annotation is similar to image annotation but applied to sequences of frames — tracking objects across time, labeling actions, classifying scene types. Autonomous vehicle training data uses enormous amounts of video annotation. Video annotation is more complex than single-image annotation because consistency across frames is required, and pay reflects that.

Top platforms for data annotation work

1. Appen

Appen is one of the oldest and largest data annotation platforms, with a global network of over 1 million contractors. They have a broad range of projects: search quality evaluation, social media content rating, image annotation, and text classification. Appen contracts directly with major tech companies including Google, Microsoft, and Apple.

The reality of working on Appen is inconsistent. Work is project-based and not guaranteed — you might have plenty of work one month and little the next. Pay is on the lower end of the annotation market. Getting qualified for the better-paying "Qualified Contributor" track takes time and passing assessments.

Appen is a good starting point for people with no annotation experience because the barrier to entry is relatively low. Use it to build a track record, then pursue higher-paying platforms.

2. Scale AI

Scale AI is the premium end of the annotation market. Their taskers (the term Scale uses for annotators) typically earn more than on Appen, and the projects tend to be more sophisticated. Scale works primarily with enterprise clients and AI labs on complex annotation tasks.

Getting into Scale is harder than Appen. There are skills assessments and qualification requirements. Pay depends significantly on project type — basic annotation tasks start at $15-18/hr, while expert-level projects can pay $30-50/hr. Scale's Outlier platform (below) handles the highest-skill work at the highest rates.

3. Remotasks

Remotasks (owned by Scale AI) is explicitly designed as a beginner-friendly entry point. They offer free training modules for different annotation skill types, and you can start earning immediately upon completing training. This makes it the easiest platform to get started on.

Pay is modest ($10-20/hr equivalent) and work volume is variable. But for someone with zero annotation experience, Remotasks provides a structured way to develop skills and build a track record that opens doors to higher-paying platforms.

4. Outlier (formerly Surge AI)

Outlier is where the high-paying annotation work lives. Projects focus on evaluating and improving AI reasoning — coding tasks, math problem evaluation, scientific accuracy assessment, and complex RLHF work. Pay ranges from $25-50/hr for most projects, with specialized domain work paying more.

The qualification bar is higher: Outlier typically requires domain expertise or a strong quantitative background. Applications go through a screening process including skills tests. But for people who qualify, Outlier is significantly more lucrative than the general annotation market.

5. Prolific and Labelbox

Prolific focuses on academic and research data collection — surveys, behavioral experiments, and social science research. Pay is per study rather than per hour, but studies are designed to achieve at least minimum wage equivalent for participants. Prolific is useful as a supplementary income source rather than primary annotation work.

Labelbox is an enterprise annotation platform used primarily by companies building their own data annotation operations, not a marketplace for individual workers. Labeling jobs via Labelbox typically go through staffing agencies or direct employer relationships.

6. Mercor

Mercor is newer but increasingly significant — an AI-powered talent marketplace that matches contractors with AI training projects. They vet contractors through assessments and connect them with companies running specific annotation or evaluation projects. Pay is typically in the $20-50/hr range. The vetting process is more rigorous, but it results in higher-quality project matches and better pay.

How to get hired at Appen specifically

Since Appen is the most common starting point, here's the specific process:

Create a profile on Appen's website. Include all languages you speak, your educational background, and any domain expertise (medical, legal, financial, technical).
Apply to open projects. Projects are listed in your dashboard. Apply to multiple simultaneously — acceptance to any one is not guaranteed.
Pass the qualification assessments. Most projects require you to complete training material and pass a quality assessment before you can earn. Take these seriously — your accuracy score affects which projects you're invited to.
Complete "Qualification Tasks." These are paid test tasks that allow Appen to evaluate your work quality before assigning full project work.
Maintain quality standards. Appen monitors accuracy using "honeypot" tasks — items where the correct answer is known. Workers who fall below quality thresholds are removed from projects.

Appen qualifications are language-specific. If you speak multiple languages, qualifying for annotation work in each language dramatically expands your project options and income potential.

RLHF vs basic annotation: the pay gap explained

The difference between $15/hr annotation work and $50/hr RLHF work is essentially the difference between commodity labor and knowledge work. Basic annotation — drawing boxes, classifying images — can be done by almost anyone with basic literacy and a stable internet connection. The supply of workers is global and massive, which keeps rates low.

High-value RLHF work requires genuine domain expertise. An AI company that needs a doctor to evaluate whether its medical AI is giving safe advice cannot substitute a generalist annotator. The supply of qualified people is small; the demand is high; the pay reflects that.

If you have specialized knowledge — software engineering, data science, medicine, law, finance, academic research — lead with that expertise when applying to annotation platforms. The difference in earnings is not marginal; it can be 3-5x the hourly rate for the same time invested.

How much can you realistically earn?

Honest income expectations for 2026:

Starting out on Appen or Remotasks: $200-600/month is common while building your track record. Work availability is inconsistent and qualification requirements limit access to projects.
Established generalist annotator: $1,000-2,500/month working 20-30 hours/week across multiple platforms. This requires being qualified on multiple platforms and being selective about which projects to take.
Domain expert (RLHF tier): $3,000-8,000/month is realistic for someone with strong domain expertise working consistent hours on high-value projects. Software engineers evaluating AI code quality and medical professionals evaluating health AI outputs are in the highest demand category.

Data annotation is best understood as a flexible income supplement rather than a primary career for most people. The exception is RLHF-level work for domain experts, where the income can genuinely rival traditional employment.

Is data annotation a good long-term career?

The honest answer is: it depends on which tier you're in.

For commodity annotation (image labeling, text classification), the outlook is mixed. Automation is gradually taking over the most repetitive tasks, and the pay has not increased meaningfully. It's a way to earn flexible income today, but it's not a career path with meaningful growth trajectory.

For expert-level RLHF and AI evaluation work, the outlook is significantly better. The demand for domain experts who can evaluate increasingly sophisticated AI outputs is growing, not shrinking. If you can build a reputation as a reliable, high-accuracy annotator in a specialized field, it's possible to build a sustainable income that functions like a permanent part-time consulting arrangement.

Tax and contractor status considerations

All major annotation platforms pay workers as independent contractors, not employees. This means:

You are responsible for self-employment taxes (approximately 15.3% on top of income tax in the US for the first $160,200 of net self-employment income in 2026).
Platforms will issue a 1099-NEC form if you earn more than $600 in a calendar year. Keep track of all income across platforms regardless of whether a 1099 is issued.
Home office expenses, internet costs, and equipment used for annotation work may be deductible as business expenses. Consult a tax professional familiar with freelance and contractor income.
Many annotation platforms pay via PayPal or bank transfer. Keep records of all payments received.

If annotation income becomes a meaningful part of your earnings, making quarterly estimated tax payments prevents a large tax bill in April and potential underpayment penalties.

Remote Data Annotation Jobs in 2026: How to Get Hired at Appen, Scale AI & More