About the job
We invite you to submit your CV in English and detail your English proficiency level.
Mindrift connects talented specialists with project-based opportunities in AI for leading tech companies, emphasizing the testing, evaluation, and enhancement of AI systems. This role is project-based rather than permanent employment.
About the Role
This project calls for an experienced Senior Python Developer with extensive functional testing expertise, robust skills in Linux and Docker, and the capability to interpret code across various programming languages with the aid of LLMs (such as C, Rust, Go). You will be responsible for translating requirements related to migration tasks and utilizing tools like Roo Code or Claude Code to facilitate expedited development processes.
Key Responsibilities
- Design and implement functional black box tests for expansive codebases across diverse source languages.
- Establish and manage Docker environments to guarantee 100% reproducibility of builds and test execution across multiple platforms.
- Oversee code coverage and set up automated scoring criteria to align with industry benchmark standards.
- Utilize LLMs (such as Roo Code and Claude) to streamline development cycles, automate repetitive tasks, and enhance overall code quality.
Requirements
- A minimum of 5 years of experience as a Software Engineer, primarily focusing on Python.
- In-depth knowledge of pytest (including fixtures, session-scoped, and timeouts) and the design of black-box functional tests for CLI tools.
- Exceptional Docker skills (including reproducible Dockerfiles, user contexts, and secure workspaces).
- Proficient in Linux & Bash scripting and comfortable with debugging inside containers.
- Familiarity with modern Python tooling (such as uv, pyproject.toml, packaging).
- Ability to read and comprehend multiple coding languages with LLM support (e.g., C, C++, Rust, or Go).
- Experience leveraging LLMs (Claude Code, Roo Code, Cursor) to expedite iterative development and test-case generation.
- Fluency in English - B2 level or higher.
Preferred Qualifications
- Previous experience with agent evaluation platforms and MCP CLI.
Tools and Technologies: Python (pytest, uv, Pillow), Docker, Bash, Git Submodules, C/C++/Rust/Go (reading), Dagger, GitHub Codespaces, LLMs (Claude Code, Roo Code, Cursor), coverage.py, gcov, kcov.
What We Offer:
- Freelance project-based collaboration through the Mindrift platform (powered by Toloka AI).
- Fully remote and flexible participation — select your preferred working hours (20-30 hours per week).
- Compensation based on tasks, reaching up to $50/hour* contingent on performance and workload.
- A chance to contribute to groundbreaking AI projects for top tech companies.

