About the job
We invite you to submit your CV in English, along with your level of English proficiency.
At Mindrift, we bridge the gap between talented specialists and innovative AI projects for top tech companies. Our focus is on the testing, evaluation, and enhancement of AI systems. Please note that this collaboration is project-based and does not offer permanent employment.
About the Role
We are seeking a skilled Senior Python Developer with extensive experience in functional testing. The ideal candidate will possess strong expertise in Linux and Docker, an ability to read code across various programming languages (such as C, Rust, Go) with the aid of LLMs, and will be adept at translating requirements for migration tasks. Proficiency in utilizing tools like Roo Code or Claude Code to expedite iterative development is essential.
Key Responsibilities
- Design and implement functional black box tests for extensive codebases in multiple programming languages.
- Create and manage Docker environments to ensure complete reproducibility of builds and test executions across diverse platforms.
- Oversee code coverage and set up automated scoring criteria to comply with industry benchmark standards.
- Utilize LLMs (Roo Code, Claude) to enhance development cycles, automate repetitive tasks, and elevate overall code quality.
Requirements
- Minimum of 5 years experience as a Software Engineer, with a primary focus on Python.
- In-depth knowledge of pytest (fixtures, session-scoped, timeouts) and experience designing black-box functional tests for CLI tools.
- Advanced Docker skills (creating reproducible Dockerfiles, user contexts, secure workspaces).
- Solid Linux & Bash scripting capabilities and familiarity with debugging in containerized environments.
- Proficiency in modern Python tools (uv, pyproject.toml, packaging).
- Ability to read and comprehend multiple programming languages with the assistance of LLMs (e.g., C, C++, Rust, Go).
- Experience in utilizing LLMs (Claude Code, Roo Code, Cursor) for accelerating iterative development and generating test cases.
- English proficiency at B2 level or higher.
Preferred Qualifications
- Experience with agent evaluation platforms and MCP CLI.
Tools and Technologies: Python (pytest, uv, Pillow), Docker, Bash, Git Submodules, C/C++/Rust/Go (reading), Dagger, GitHub Codespaces, LLMs (Claude Code, Roo Code, Cursor), coverage.py, gcov, kcov.
Benefits
What We Offer
- Project-based freelance collaboration through the Mindrift platform (powered by Toloka AI).
- Fully remote and flexible engagement— you can choose your hours (20-30 hours per week).
- Compensation based on tasks completed, potentially up to $24/hour* depending on performance and workload.
- Opportunity to contribute to cutting-edge AI projects for leading tech companies.

