About the job
We kindly ask you to submit your CV in English and specify your English proficiency level.
Mindrift serves as a bridge between talented professionals and project-based AI opportunities with top technology companies, concentrating on the testing, evaluation, and enhancement of AI systems. Note that participation is project-based and does not constitute permanent employment.
Role Overview
This opportunity is ideal for a Senior Python Developer with extensive experience in functional testing. The candidate should be proficient in Linux and Docker, capable of reading and interpreting code in various programming languages with the assistance of LLMs (such as C, Rust, Go), and adept at translating requirements for migration tasks. A strong familiarity with tools like Roo Code or Claude Code to boost iterative development is essential.
Key Responsibilities
- Develop functional black box tests for substantial codebases across multiple source languages.
- Establish and maintain Docker environments to guarantee completely reproducible builds and testing across diverse platforms.
- Oversee code coverage and set automated scoring criteria to align with industry benchmark standards.
- Utilize LLMs (like Roo Code and Claude) to expedite development cycles, automate repetitive tasks, and enhance overall code quality.
Qualifications
- 5+ years of professional experience as a Software Engineer, predominantly in Python.
- In-depth knowledge of pytest (including fixtures, session-scoped, timeouts) and designing black-box functional tests for CLI tools.
- Proficient in Docker (creating reproducible Dockerfiles, managing user contexts, ensuring secure workspaces).
- Strong skills in Linux and Bash scripting, with the ability to debug within containers.
- Familiarity with modern Python tools (uv, pyproject.toml, packaging).
- Capability to read and comprehend multiple coding languages with LLM assistance (for instance, C, C++, Rust, or Go).
- Experience leveraging LLMs (Claude Code, Roo Code, Cursor) to enhance iterative development and generate test cases.
- English proficiency of B2 or higher.
Preferred Qualifications
- Experience with agent evaluation platforms and MCP CLI.
Technologies and Tools: Python (pytest, uv, Pillow), Docker, Bash, Git Submodules, C/C++/Rust/Go (reading), Dagger, GitHub Codespaces, LLMs (Claude Code, Roo Code, Cursor), coverage.py, gcov, kcov.
Benefits
What We Offer
- Project-based freelance collaboration via the Mindrift platform (powered by Toloka AI).
- Completely remote and flexible participation — choose your own schedule and commitment (20-30 hours per week).
- Compensation based on tasks, amounting to up to $40/hour* depending on performance and workload.
- An opportunity to engage in groundbreaking AI projects with leading tech companies.

