About the job
We invite you to submit your CV in English along with your English proficiency level.
At Mindrift, we specialize in connecting talented professionals with project-based AI opportunities at leading technology firms. Our focus is on testing, evaluating, and enhancing AI systems. Please note: This role is project-based and does not constitute permanent employment.
Role Overview
We are looking for an experienced Senior Python Developer with a robust background in functional testing. Candidates should have proficiency in Linux and Docker, the capability to read and interpret code in multiple programming languages (such as C, Rust, and Go) with the assistance of Large Language Models (LLMs), and the skill to translate requirements for migration tasks. Familiarity with tools like Roo Code or Claude Code for accelerating iterative development is essential.
Key Responsibilities
- Develop and implement functional black-box tests for extensive codebases across various source languages.
- Create and manage Docker environments to ensure 100% reproducible builds and test executions on diverse platforms.
- Monitor code coverage and establish automated evaluation criteria to adhere to industry benchmark standards.
- Utilize LLMs (like Roo Code, Claude) to streamline development cycles, automate repetitive tasks, and enhance overall code quality.
Qualifications
- Minimum of 5 years of experience as a Software Engineer, with a focus on Python.
- In-depth knowledge of pytest (including fixtures, session-scoped, timeouts) and black-box functional testing for command-line interface tools.
- Expertise in Docker (including reproducible Dockerfiles, user contexts, and secure workspaces).
- Strong Linux and Bash scripting skills, with a comfort level in debugging within containers.
- Familiarity with modern Python tooling (uv, pyproject.toml, packaging).
- Ability to read and comprehend multiple coding languages with LLMs (e.g., C, C++, Rust, Go).
- Experience leveraging LLMs (such as Claude Code, Roo Code, Cursor) to enhance iterative development and test-case generation.
- Fluency in English at a B2 level or higher.
Preferred Qualifications
- Experience with agent evaluation platforms and MCP CLI.
Tools and Technologies: Python (pytest, uv, Pillow), Docker, Bash, Git Submodules, C/C++/Rust/Go (reading), Dagger, GitHub Codespaces, LLMs (Claude Code, Roo Code, Cursor), coverage.py, gcov, kcov.
What We Offer
- Freelance, project-based collaboration through the Mindrift platform (powered by Toloka AI).
- Fully remote and flexible participation—you can choose your working hours and commitment (20-30 hours per week).
- Compensation is task-based, with rates up to $80/hour* based on performance and volume.

