About the job
We invite you to submit your CV in English and detail your level of English proficiency.
At Mindrift, we connect talented specialists with project-based AI opportunities for esteemed tech companies, focusing on the testing, evaluation, and enhancement of AI systems. Please note that this role is project-based, not a permanent employment position.
About the Position
This role is ideal for a Senior Python Engineer with extensive functional testing experience. Candidates should possess strong Linux and Docker skills, the ability to comprehend and translate code across multiple programming languages with the assistance of LLMs (for instance, C, Rust, Go), and confidence in utilizing tools like Roo Code or Claude Code to expedite iterative development.
Key Responsibilities
- Develop functional black box tests for extensive codebases in various source languages.
- Create and manage Docker environments to guarantee 100% reproducibility of builds and test executions across diverse platforms.
- Oversee code coverage and establish automated scoring criteria to meet industry benchmark standards.
- Utilize LLMs (Roo Code, Claude) to enhance development cycles, automate repetitive tasks, and elevate overall code quality.
Qualifications
- A minimum of 5 years of experience as a Software Engineer, primarily focusing on Python.
- In-depth expertise with pytest (including fixtures, session-scoped, timeouts) and crafting black-box functional tests for CLI tools.
- Advanced skills in Docker (creating reproducible Dockerfiles, managing user contexts, ensuring secure workspaces).
- Strong proficiency in Linux and Bash scripting, with a solid comfort level for debugging within containers.
- Familiarity with modern Python tooling (uv, pyproject.toml, packaging).
- Aptitude for reading and understanding various coding languages with the help of LLMs (e.g., C, C++, Rust, or Go).
- Experience utilizing LLMs (Claude Code, Roo Code, Cursor) to accelerate iterative development and generate test cases.
- English proficiency at a B2 level or higher.
Preferred Qualifications
- Previous experience with agent evaluation platforms and MCP CLI.
Tools and Technologies: Python (pytest, uv, Pillow), Docker, Bash, Git Submodules, C/C++/Rust/Go (reading), Dagger, GitHub Codespaces, LLMs (Claude Code, Roo Code, Cursor), coverage.py, gcov, kcov.
What We Offer
- Project-based freelance collaboration through the Mindrift platform (powered by Toloka AI).
- Fully remote and flexible participation, allowing you to choose when and how much to contribute (20-30 hours per week).
- Compensation based on tasks, with rates up to $45/hour* depending on performance and workload.

