About the job
Please submit your CV in English and indicate your level of English proficiency.
At Mindrift, we specialize in connecting top-tier professionals with project-based artificial intelligence opportunities for renowned technology firms, emphasizing the evaluation, testing, and enhancement of AI systems. This is a project-based engagement rather than a permanent position.
Role Overview
We are seeking a highly skilled Senior Python Developer with extensive experience in functional testing. The ideal candidate will possess strong expertise in Linux and Docker, be capable of reading and understanding code in multiple languages (including C, Rust, and Go) with the assistance of LLMs, and adept at translating requirements for migration tasks. Proficiency in utilizing tools such as Roo Code or Claude Code to streamline iterative development is essential.
Key Responsibilities
- Develop functional black-box tests for extensive codebases across various programming languages.
- Establish and manage Docker environments to guarantee fully reproducible builds and test executions across diverse platforms.
- Oversee code coverage and configure automated scoring criteria to align with industry benchmarking standards.
- Utilize LLMs (Roo Code, Claude) to expedite development cycles, automate repetitive tasks, and elevate overall code quality.
Requirements
- 5+ years of professional experience as a Software Engineer, primarily in Python.
- Strong expertise in pytest (including fixtures, session-scoped, and timeouts) and designing black-box functional tests for CLI tools.
- Expert-level proficiency in Docker (including reproducible Dockerfiles and secure workspaces).
- Comprehensive knowledge of Linux and Bash scripting, with the ability to debug within containers.
- Familiarity with modern Python tooling (such as uv, pyproject.toml, and packaging).
- Capability to read and comprehend various coding languages (e.g., C, C++, Rust, Go) with assistance from LLMs.
- Experience leveraging LLMs (Claude Code, Roo Code, Cursor) to enhance iterative development and generate test cases.
- English proficiency at B2 level or higher.
Preferred Qualifications
- Prior experience with agent evaluation platforms and MCP CLI.
Tools and Technologies: Python (pytest, uv, Pillow), Docker, Bash, Git Submodules, C/C++/Rust/Go (reading), Dagger, GitHub Codespaces, LLMs (Claude Code, Roo Code, Cursor), coverage.py, gcov, kcov.
Benefits
What We Offer
- Flexible freelance project-based collaboration through the Mindrift platform (powered by Toloka AI).
- Completely remote and flexible participation — choose when and how much to contribute (20-30 hours per week).
- Compensation based on tasks, potentially reaching up to $12/hour* depending on performance and workload.
- Opportunity to contribute to cutting-edge AI projects for leading technology companies.

