About the job

LILT is creating an extensive global network of domain experts dedicated to delivering top-notch AI evaluations in training, benchmarking, red-teaming, and ongoing model monitoring. We are on the lookout for talented software engineering and DevOps professionals to lend their expert judgment in enhancing human-in-the-loop AI evaluation workflows utilized by leading enterprises and hyperscalers.

This position is tailored for individuals who possess a deep understanding of software systems, infrastructure, and development methodologies in real-world production environments. Your expertise will play a crucial role in evaluating, assessing, and improving multilingual AI systems.

Your contributions will directly impact the quality, safety, and deployment readiness of multilingual AI models.

This position offers two distinct expert tracks, differentiated by experience level and scope of responsibility.

Track A: Software Engineering & DevOps AI Rater

Raters will carry out structured evaluation tasks following clearly defined rubrics and instructions.

Responsibilities

Evaluate AI outputs related to software engineering, DevOps, and infrastructure topics.
Conduct structured scoring, comparison, classification, and judgment tasks.
Assess technical correctness, completeness, security implications, and adherence to best practices.
Identify hallucinations, incorrect code, unsafe recommendations, or misleading system guidance.
Consistently apply domain-specific engineering and DevOps guidelines across tasks.

Ideal Background

Software engineers, site reliability engineers, DevOps engineers, or platform engineers.
Experience with production systems, CI/CD pipelines, cloud infrastructure, or distributed systems.
Exceptional attention to detail and comfort working with structured evaluation criteria.

Track B: Software Engineering & DevOps AI Evaluator (Senior Track)

Evaluators provide advanced technical oversight and help shape the evaluation processes.

Responsibilities

Validate and refine evaluation rubrics and edge-case handling.
Adjudicate in cases of disagreement among raters.
Conduct error analysis and qualitative assessments of model behavior.

About the job

Your contributions will directly impact the quality, safety, and deployment readiness of multilingual AI models.

This position offers two distinct expert tracks, differentiated by experience level and scope of responsibility.

Track A: Software Engineering & DevOps AI Rater

Raters will carry out structured evaluation tasks following clearly defined rubrics and instructions.

Responsibilities

Evaluate AI outputs related to software engineering, DevOps, and infrastructure topics.
Conduct structured scoring, comparison, classification, and judgment tasks.
Assess technical correctness, completeness, security implications, and adherence to best practices.
Identify hallucinations, incorrect code, unsafe recommendations, or misleading system guidance.
Consistently apply domain-specific engineering and DevOps guidelines across tasks.

Ideal Background

Software engineers, site reliability engineers, DevOps engineers, or platform engineers.
Experience with production systems, CI/CD pipelines, cloud infrastructure, or distributed systems.
Exceptional attention to detail and comfort working with structured evaluation criteria.

Track B: Software Engineering & DevOps AI Evaluator (Senior Track)

Evaluators provide advanced technical oversight and help shape the evaluation processes.

Responsibilities

Validate and refine evaluation rubrics and edge-case handling.
Adjudicate in cases of disagreement among raters.
Conduct error analysis and qualitative assessments of model behavior.

About the job

Your contributions will directly impact the quality, safety, and deployment readiness of multilingual AI models.

This position offers two distinct expert tracks, differentiated by experience level and scope of responsibility.

Track A: Software Engineering & DevOps AI Rater

Raters will carry out structured evaluation tasks following clearly defined rubrics and instructions.

Responsibilities

Evaluate AI outputs related to software engineering, DevOps, and infrastructure topics.
Conduct structured scoring, comparison, classification, and judgment tasks.
Assess technical correctness, completeness, security implications, and adherence to best practices.
Identify hallucinations, incorrect code, unsafe recommendations, or misleading system guidance.
Consistently apply domain-specific engineering and DevOps guidelines across tasks.

Ideal Background

Software engineers, site reliability engineers, DevOps engineers, or platform engineers.
Experience with production systems, CI/CD pipelines, cloud infrastructure, or distributed systems.
Exceptional attention to detail and comfort working with structured evaluation criteria.

Track B: Software Engineering & DevOps AI Evaluator (Senior Track)

Evaluators provide advanced technical oversight and help shape the evaluation processes.

Responsibilities

Validate and refine evaluation rubrics and edge-case handling.
Adjudicate in cases of disagreement among raters.
Conduct error analysis and qualitative assessments of model behavior.

About the job

Your contributions will directly impact the quality, safety, and deployment readiness of multilingual AI models.

This position offers two distinct expert tracks, differentiated by experience level and scope of responsibility.

Track A: Software Engineering & DevOps AI Rater

Raters will carry out structured evaluation tasks following clearly defined rubrics and instructions.

Responsibilities

Evaluate AI outputs related to software engineering, DevOps, and infrastructure topics.
Conduct structured scoring, comparison, classification, and judgment tasks.
Assess technical correctness, completeness, security implications, and adherence to best practices.
Identify hallucinations, incorrect code, unsafe recommendations, or misleading system guidance.
Consistently apply domain-specific engineering and DevOps guidelines across tasks.

Ideal Background

Software engineers, site reliability engineers, DevOps engineers, or platform engineers.
Experience with production systems, CI/CD pipelines, cloud infrastructure, or distributed systems.
Exceptional attention to detail and comfort working with structured evaluation criteria.

Track B: Software Engineering & DevOps AI Evaluator (Senior Track)

Evaluators provide advanced technical oversight and help shape the evaluation processes.

Responsibilities

Validate and refine evaluation rubrics and edge-case handling.
Adjudicate in cases of disagreement among raters.
Conduct error analysis and qualitative assessments of model behavior.

Software Engineering & DevOps AI Rater/Evaluator

Experience Level

Qualifications

About the job

Track A: Software Engineering & DevOps AI Rater

Track B: Software Engineering & DevOps AI Evaluator (Senior Track)

About Lilt

Direct Appointment Setter at Southern National Roofing | Columbia, MD

Project Superintendent

Community Support Lead Care Manager at Pacific Health Group | Remote

Physical Therapist at Performance Optimal Health | New Canaan

Part-Time In-Home Veterinarian

Sales Support Specialist at Golden Lighting | Tallahassee, FL

New Home Sales Consultant at LGI Homes | Lebanon, TN

Medical Director - Licensed Psychiatrist

Recruiting Coordinator - Join Our Innovative Team

Experienced Litigation Paralegal - Remote

Senior Director of Digital Communications

Nutritional Cook for Early Childhood Center

FMS Analyst at ACT1 Federal | Patuxent River, MD

Automotive Technician Opportunity at Citrus Kia

Software Security Analyst at TP-Link Systems Inc. | Irvine, California

Network Intrusion Detection Engineer - Active TS/SCI with CI Poly

Tax Associate - Private Client

Lead Behavior Technician - Full-Time Position

Local Roofing Sales Representative - Roof Restoration Specialist

Senior Director of Inventory and Merchandise Planning

Software Engineering & DevOps AI Rater/Evaluator

Experience Level

Qualifications

About the job

Track A: Software Engineering & DevOps AI Rater

Track B: Software Engineering & DevOps AI Evaluator (Senior Track)

About Lilt

Software Engineering & DevOps AI Rater/Evaluator

Experience Level

Qualifications

About the job

Track A: Software Engineering & DevOps AI Rater

Track B: Software Engineering & DevOps AI Evaluator (Senior Track)

About Lilt

Software Engineering & DevOps AI Rater/Evaluator

Experience Level

Qualifications

About the job

Track A: Software Engineering & DevOps AI Rater

Track B: Software Engineering & DevOps AI Evaluator (Senior Track)

About Lilt