Data Labeler

Remote, USA
Posted Jun 14, 2026
Full-time

About us
White Circle is an AI Safety company building the safety, reliability, and optimization layer for AI systems. At the core of our platform are policies – simple natural-language rules that define what an AI model should and shouldn’t do. We automatically test, enforce, and continuously improve these policies at scale.
We’ve raised $11M from top funds, founders, and senior leaders at OpenAI, Anthropic, HuggingFace, Mistral, DeepMind, Datadog, Sentry, and others

We process over one hundred million API calls every month

We fine-tune and train our own LLMs so they run faster and cheaper than any open or proprietary model

We’re a small, highly focused team. If you want to work deeply on hard problems, see your work ship to production quickly, and influence how AI safety is actually built – you’re the one we need.

In this role, you will
Review and evaluate AI conversations and model outputs

Assess responses for safety, quality, accuracy, policy compliance, and user intent

Identify harmful, unsafe, misleading, or low-quality behavior

Label and categorize model outputs according to internal evaluation frameworks

Moderate sensitive content and identify policy violations

Compare, rank, and score model responses

Investigate edge cases and ambiguous situations

Provide structured feedback to researchers and engineers

Help improve evaluation guidelines and annotation processes

Contribute to the datasets used to train and evaluate AI systems

We're looking for someone who
Has exceptional attention to detail

Can make consistent decisions across large volumes of data

Enjoys analysing nuanced situations where there isn't always a clear answer

Can follow guidelines while exercising good judgment

Has strong written English skills

Communicates clearly and explains reasoning well

Is curious about AI and how these systems work

You might be a great fit if you
Have experience with content moderation, trust & safety, quality assurance, compliance, or policy enforcement

Have experience in data annotation, AI evaluation, RLHF, or model assessment

Have worked with AI tools extensively and understand their strengths and limitations

Enjoy finding edge cases and unusual model behavior

Important note
This role may involve reviewing content that is offensive, harmful, violent, sexual, or otherwise disturbing We provide tooling, and support, but candidates should be comfortable working with sensitive content when necessary

Why White Circle
Salary of $30,000 to $50,000 + equity

Paid time off in line with your local regulations, no matter where you work from

All the hardware, tools, and services you need

How we hire
Intro call with HR (25 min)

Take-home assignment

Final conversation with our CEO (35 min)

Please submit your application in English.

More Remote Jobs