About the job
We encourage you to submit your CV in English, along with your self-assessed English proficiency level.
Mindrift, a platform that connects skilled professionals with project-based AI opportunities, is collaborating with leading technology firms to enhance AI systems through rigorous testing and evaluation. This role is project-based and does not imply permanent employment.
Position Overview
We are seeking an experienced Senior Python Engineer adept in functional testing, possessing robust skills in Linux and Docker. The ideal candidate should be proficient in reading and interpreting code across diverse languages with the assistance of LLMs (for example, C, Rust, Go) and should be capable of translating requirements for migration tasks. Familiarity with tools such as Roo Code or Claude Code for fostering iterative development is essential.
Key Responsibilities
- Develop and implement functional black box tests for extensive codebases across various programming languages.
- Set up and manage Docker environments to guarantee 100% reproducibility in builds and testing execution across multiple platforms.
- Track code coverage and establish automated scoring criteria to achieve industry-standard benchmarks.
- Utilize LLMs (Roo Code, Claude) to streamline development cycles, automate repetitive tasks, and enhance overall code quality.
Required Qualifications
- Over 5 years of experience as a Software Engineer with a primary focus on Python.
- Extensive experience with pytest (including fixtures, session-scoped, and timeouts) and the design of black-box functional tests for CLI tools.
- Advanced Docker expertise (creating reproducible Dockerfiles, user contexts, and secure workspaces).
- Strong skills in Linux and Bash scripting, with the ability to debug within containers.
- Familiarity with modern Python tools (uv, pyproject.toml, packaging).
- Capability to read and understand various coding languages (like C, C++, Rust, or Go) with the assistance of LLMs.
- Proven experience using LLMs (Claude Code, Roo Code, Cursor) to enhance iterative development and generate test cases.
- English language proficiency at B2 level or higher.
Preferred Qualifications
- Previous experience with agent evaluation platforms and MCP CLI.
Technologies and Tools: Python (pytest, uv, Pillow), Docker, Bash, Git Submodules, C/C++/Rust/Go (reading), Dagger, GitHub Codespaces, LLMs (Claude Code, Roo Code, Cursor), coverage.py, gcov, kcov.
What We Offer
- Freelance project-based collaboration through the Mindrift platform powered by Toloka AI.
- Completely remote and flexible participation—choose your hours (20-30 hours per week).
- Compensation based on tasks, up to $17/hour* depending on performance and volume.
- Chance to contribute to groundbreaking AI projects with leading tech companies.

