About the job
Lilt, Inc. is at the forefront of developing a global network of domain experts dedicated to ensuring high-quality AI evaluation across various domains, including training, benchmarking, red-teaming, and continuous model monitoring. We invite healthcare and life sciences professionals to lend their expert judgment to our human-in-the-loop AI evaluation workflows utilized by leading enterprises and hyperscalers.
This role is tailored for those who possess a deep understanding of the application of medical, clinical, scientific, and life sciences information in real-world healthcare and research settings. Your expertise will play a critical role in evaluating, assessing, and enhancing multilingual AI systems.
Your valuable contributions will directly impact the quality, safety, and deployment readiness of multilingual AI models.
This position offers two distinct expert tracks based on experience level and scope of responsibility.
Track A: AI Rater in Healthcare & Life Sciences
As a Rater, you will undertake structured evaluation tasks, guided by well-defined rubrics and instructions.
Key Responsibilities
Evaluate AI outputs concerning healthcare, medical, and life sciences content.
Perform structured scoring, comparison, classification, and judgment tasks.
Assess clinical accuracy, scientific validity, completeness, and potential safety risks.
Identify inaccuracies, misleading medical guidance, unsupported claims, or unsafe recommendations.
Consistently apply domain-specific healthcare and life sciences guidelines across all tasks.
Ideal Candidate Profile
Healthcare professionals, clinical practitioners, life sciences researchers, or biomedical specialists.
Experience interpreting medical literature, clinical guidelines, scientific research, or health data.
Strong attention to detail and proficiency in working with structured evaluation criteria.
Track B: Senior AI Evaluator in Healthcare & Life Sciences
In the Evaluator track, you will provide higher-level oversight and contribute to shaping the evaluation process.
Key Responsibilities
Validate and refine evaluation rubrics and manage edge-case handling.
Conduct adjudication in instances of disagreement among raters.
Perform error analysis and qualitative reviews of model outputs.

