Tailoring 0 resumes…

We'll move completed jobs to Ready to Apply automatically.

AI Agent Testing Specialist at Mindrift | Belo Horizonte, Brazil | RoboApply Jobs

This job posting is no longer active and is not accepting applications.

AI Agent Testing Specialist - Evaluation Scenario Writer

MindriftRemote — Belo Horizonte, State of Minas Gerais, Brazil

Remote Part-time $17/hr - $17/hr

No Longer Active

Experience Level

Experience

Qualifications

This role is ideal for experienced developers, software engineers, and/or test automation specialists interested in part-time, non-permanent projects. Preferred qualifications include a degree in Computer Science or Software Engineering, over 5 years of software development experience in Python, a strong background in Full-Stack development, proficiency in writing functional and integration tests, Docker container experience, understanding of CI/CD processes, and B2 level English proficiency.

About the job

Please submit your CV in English and specify your level of English proficiency.

At Mindrift, we specialize in connecting talented professionals with project-based AI opportunities for premier technology companies. Our primary focus is on the testing, evaluation, and enhancement of AI systems. Note: Positions are project-based and do not constitute permanent employment.

Your Role

As an Evaluation Scenario Writer, you will design rigorous coding test cases that challenge AI coding systems to excel:

Evaluate and enhance realistic coding tasks derived from actual production codebases, ensuring they align with practical scopes, requirements, and relevant information sources.
Devise thorough functional tests to validate the AI's end-to-end performance and edge cases, moving beyond superficial assessments.
Create “fair yet challenging” tasks that equip AI with essential context while requiring it to apply complex reasoning across varied information sources.
Investigate AI shortcomings to differentiate between areas of struggle and mastery.
Refine your work based on feedback from expert QA reviewers who assess your contributions against seven key quality metrics.

Ideal Candidate Profile

This role is well-suited for seasoned developers, software engineers, or test automation specialists interested in part-time, non-permanent opportunities. Preferred qualifications include:

A degree in Computer Science, Software Engineering, or a related field.
Over 5 years of software development experience, focusing on Python (including pytest, async/await, subprocess, and file operations).
A strong foundation in Full-Stack development, balancing expertise in both React-based front-end interfaces and robust back-end systems.
Proficiency in writing tests, including functional and integration tests (not merely executing them).
Experience with Docker containers for local evaluations.
Understanding of CI/CD processes, particularly with GitHub Actions (triggers, labels, result interpretation).
English proficiency at a B2 level or higher.

How This Works

Application → Qualification Assessment → Project Assignment → Task Completion → Payment

Estimated Effort

Project tasks are estimated at 20 hours, contingent upon complexity. This is a guideline, allowing you flexibility in when and how to work. All tasks must be submitted by their deadlines and meet acceptance criteria to be approved.

Compensation

Compensated contributions with rates up to $17/hour*.
Fixed project rates or individual rates depending on the project specifics.
Some projects may include incentive payments.

*Note: Compensation rates vary based on expertise, skill assessments, location, project requirements, and other factors. Higher rates may be available for highly specialized professionals. Lower rates may apply during onboarding or initial project phases. Payment details will be provided per project.

About Mindrift

Mindrift connects specialists with project-based AI opportunities for leading tech companies, focusing on testing, evaluating, and improving AI systems.

This job posting is no longer active and is not accepting applications.

AI Agent Testing Specialist - Evaluation Scenario Writer

MindriftRemote — Belo Horizonte, State of Minas Gerais, Brazil

Remote Part-time $17/hr - $17/hr

No Longer Active

Experience Level

Experience

Qualifications

About the job

Please submit your CV in English and specify your level of English proficiency.

Your Role

As an Evaluation Scenario Writer, you will design rigorous coding test cases that challenge AI coding systems to excel:

Evaluate and enhance realistic coding tasks derived from actual production codebases, ensuring they align with practical scopes, requirements, and relevant information sources.
Devise thorough functional tests to validate the AI's end-to-end performance and edge cases, moving beyond superficial assessments.
Create “fair yet challenging” tasks that equip AI with essential context while requiring it to apply complex reasoning across varied information sources.
Investigate AI shortcomings to differentiate between areas of struggle and mastery.
Refine your work based on feedback from expert QA reviewers who assess your contributions against seven key quality metrics.

Ideal Candidate Profile

This role is well-suited for seasoned developers, software engineers, or test automation specialists interested in part-time, non-permanent opportunities. Preferred qualifications include:

A degree in Computer Science, Software Engineering, or a related field.
Over 5 years of software development experience, focusing on Python (including pytest, async/await, subprocess, and file operations).
A strong foundation in Full-Stack development, balancing expertise in both React-based front-end interfaces and robust back-end systems.
Proficiency in writing tests, including functional and integration tests (not merely executing them).
Experience with Docker containers for local evaluations.
Understanding of CI/CD processes, particularly with GitHub Actions (triggers, labels, result interpretation).
English proficiency at a B2 level or higher.

How This Works

Application → Qualification Assessment → Project Assignment → Task Completion → Payment

Estimated Effort

Compensation

Compensated contributions with rates up to $17/hour*.
Fixed project rates or individual rates depending on the project specifics.
Some projects may include incentive payments.

About Mindrift

Mindrift connects specialists with project-based AI opportunities for leading tech companies, focusing on testing, evaluating, and improving AI systems.

AI Agent Testing Specialist - Evaluation Scenario Writer

Experience Level

Qualifications

About the job

About Mindrift

Direct Appointment Setter at Southern National Roofing | Columbia, MD

Project Superintendent

Community Support Lead Care Manager at Pacific Health Group | Remote

Physical Therapist at Performance Optimal Health | New Canaan

Part-Time In-Home Veterinarian

Sales Support Specialist at Golden Lighting | Tallahassee, FL

New Home Sales Consultant at LGI Homes | Lebanon, TN

Medical Director - Licensed Psychiatrist

Recruiting Coordinator - Join Our Innovative Team

Experienced Litigation Paralegal - Remote

Senior Director of Digital Communications

Nutritional Cook for Early Childhood Center

FMS Analyst at ACT1 Federal | Patuxent River, MD

Automotive Technician Opportunity at Citrus Kia

Software Security Analyst at TP-Link Systems Inc. | Irvine, California

Network Intrusion Detection Engineer - Active TS/SCI with CI Poly

Tax Associate - Private Client

Lead Behavior Technician - Full-Time Position

Local Roofing Sales Representative - Roof Restoration Specialist

Senior Director of Inventory and Merchandise Planning

AI Agent Testing Specialist - Evaluation Scenario Writer

Experience Level

Qualifications

About the job

About Mindrift

AI Agent Testing Specialist - Evaluation Scenario Writer

Experience Level

Qualifications

About the job

About Mindrift

AI Agent Testing Specialist - Evaluation Scenario Writer

Experience Level

Qualifications

About the job

About Mindrift