About the job
We kindly request that you submit your CV in English and specify your English proficiency level.
Mindrift seamlessly connects skilled professionals with project-based AI opportunities at leading technology firms, concentrating on testing, evaluating, and enhancing AI systems. Note that participation is project-based and does not represent permanent employment.
About the Role
This project is ideal for a Senior Python Developer specializing in functional testing, with advanced Linux and Docker capabilities. The role requires proficiency in reading code in multiple languages (including C, Rust, and Go) with the assistance of LLMs and translating migration requirements effectively. Confidence in utilizing tools such as Roo Code or Claude Code to facilitate rapid iterative development is essential.
Key Responsibilities
- Develop functional black box tests for extensive codebases across various source languages.
- Establish and manage Docker environments to guarantee fully reproducible builds and test executions on diverse platforms.
- Oversee code coverage and set up automated scoring criteria to achieve industry-standard benchmarks.
- Utilize LLMs (Roo Code, Claude) to expedite development cycles, automate repetitive tasks, and enhance overall code quality.
Requirements
- Minimum of 5 years of experience as a Software Engineer, primarily focused on Python.
- Extensive experience with pytest (including fixtures, session-scoped tests, and timeouts) and designing black-box functional tests for CLI tools.
- Advanced Docker skills, including creating reproducible Dockerfiles and secure workspaces.
- Strong proficiency in Linux and Bash scripting, with the ability to debug within containers.
- Familiarity with modern Python tooling (uv, pyproject.toml, packaging).
- Capability to read and comprehend multiple programming languages with the aid of LLMs (e.g., C, C++, Rust, Go).
- Experience employing LLMs (Claude Code, Roo Code, Cursor) for accelerating development and generating test cases.
- English proficiency at B2 level or higher.
Preferred Qualifications
- Previous experience with agent evaluation platforms and MCP CLI.
Tools and Technologies: Python (pytest, uv, Pillow), Docker, Bash, Git Submodules, C/C++/Rust/Go (reading), Dagger, GitHub Codespaces, LLMs (Claude Code, Roo Code, Cursor), coverage.py, gcov, kcov.
Benefits
What We Offer
- Freelance project-based collaboration through the Mindrift platform (supported by Toloka AI).
- Completely remote and flexible participation — decide when and how much to contribute (20-30 hours per week).
- Compensation for each project varies based on scope and expertise required, with AI trainers earning up to $80 per hour on this project.

