Human AI Evaluation Specialist
Human AI Evaluation Specialist
Role Focus: Human-in-the-loop Validation · Model Output Quality
Role Summary
Human AI Evaluation Specialists will perform structured human evaluation of Gemini AI outputs by labeling, rating, and providing qualitative feedback on model relevance, correctness, safety, and enterprise usability.
Key Responsibilities
Perform manual evaluation of AI-generated responses
Label datasets for training and validation
Apply quality rubrics and evaluation guidelines
Identify hallucinations, bias, and factual errors
Provide structured feedback for model improvements
Support benchmark creation and golden dataset validation
Required Skills & Experience
2+ years in content review, data labeling, QA, or AI evaluation
Strong analytical and critical thinking skills
Excellent written communication skills
Familiarity with AI evaluation frameworks is preferred
High attention to detail