About the job
Please submit your CV in English and indicate your level of English proficiency.
Mindrift specializes in connecting talented professionals with project-based opportunities in artificial intelligence for top-tier technology companies, emphasizing the testing, evaluation, and enhancement of AI systems. This is a project-driven role rather than permanent employment.
About the Role
We are seeking a highly skilled Senior Python Developer with extensive experience in functional testing. The ideal candidate will possess advanced Linux and Docker capabilities, alongside the proficiency to interpret code in various programming languages with the assistance of Large Language Models (LLMs) such as C, Rust, or Go. Additionally, the successful candidate will be adept at utilizing tools like Roo Code or Claude Code to streamline the iterative development process.
Key Responsibilities
- Develop and implement functional black box tests for extensive codebases across multiple source languages.
- Establish and manage Docker environments to guarantee fully reproducible builds and testing across diverse platforms.
- Assess code coverage and configure automated scoring criteria to align with industry benchmark standards.
- Utilize LLMs (Roo Code, Claude) to enhance development cycles, automate repetitive tasks, and elevate overall code quality.
Requirements
- Minimum of 5 years of experience as a Software Engineer, with a strong focus on Python.
- In-depth knowledge of pytest (including fixtures, session-scoped, timeouts) and experience in designing black-box functional tests for command-line interface tools.
- Expertise in Docker (including reproducible Dockerfiles, user contexts, and secure workspaces).
- Strong skills in Linux and Bash scripting, along with the ability to debug within containers.
- Familiarity with modern Python tooling (such as uv, pyproject.toml, and packaging).
- Ability to read and comprehend various coding languages (e.g., C, C++, Rust, or Go) with the support of LLMs.
- Experience leveraging LLMs (Claude Code, Roo Code, Cursor) to expedite iterative development and automate test case generation.
- Proficient in English at B2 level or higher.
Preferred Qualifications
- Prior experience with agent evaluation platforms and MCP CLI.
Tools and Technologies: Python (pytest, uv, Pillow), Docker, Bash, Git Submodules, C/C++/Rust/Go (reading), Dagger, GitHub Codespaces, LLMs (Claude Code, Roo Code, Cursor), coverage.py, gcov, kcov.
Benefits
What we offer:
- Freelance project-based collaboration through the Mindrift platform (powered by Toloka AI).
- Fully remote and flexible participation — select your hours and contribution level (20-30 hours weekly).
- Task-based compensation, earning up to $17/hour* depending on performance and workload.

