About the job
We invite you to submit your CV in English, including your English proficiency level.
At Mindrift, we bridge the gap between skilled professionals and project-based AI opportunities with top tech companies. Our focus lies in testing, evaluating, and enhancing AI systems. Please note that this is a project-based engagement, not a permanent position.
Position Overview
We are seeking an experienced Senior Python Engineer with substantial expertise in functional testing. The ideal candidate will have a strong command of Linux and Docker, the capability to read and understand code in multiple programming languages (such as C, Rust, and Go) with the aid of Large Language Models (LLMs), and the confidence to utilize tools like Roo Code or Claude Code to streamline iterative development processes.
Key Responsibilities
- Develop functional black box tests for extensive codebases across various programming languages.
- Set up and manage Docker environments to guarantee 100% reproducible builds and test executions across diverse platforms.
- Oversee code coverage and establish automated scoring criteria to align with industry-standard benchmarks.
- Employ LLMs (Roo Code, Claude) to enhance development cycles, automate routine tasks, and elevate overall code quality.
Qualifications
- Minimum of 5 years of experience as a Software Engineer, particularly with Python.
- Extensive experience with pytest (including fixtures, session-scoped tests, and timeouts) and the design of black-box functional tests for command line interface tools.
- Advanced Docker expertise (creating reproducible Dockerfiles, managing user contexts, and ensuring secure workspaces).
- Solid skills in Linux and Bash scripting, with the ability to debug within containers.
- Proficient in contemporary Python tooling (uv, pyproject.toml, packaging).
- Capability to comprehend and interact with various coding languages using LLMs (such as C, C++, Rust, or Go).
- Experience utilizing LLMs (Claude Code, Roo Code, Cursor) for accelerating iterative development and generating test cases.
- English proficiency of B2 level or higher.
Preferred Qualifications
- Prior experience with agent evaluation platforms and MCP CLI.
Tools and Technologies: Python (pytest, uv, Pillow), Docker, Bash, Git Submodules, C/C++/Rust/Go (reading), Dagger, GitHub Codespaces, LLMs (Claude Code, Roo Code, Cursor), coverage.py, gcov, kcov.
What We Offer
- Freelance project-based collaboration through the Mindrift platform (powered by Toloka AI).
- Completely remote and flexible participation — you can decide when and how much to contribute (20-30 hours weekly).
- Compensation based on tasks, up to $80/hour* depending on performance and workload.
- Opportunity to work on groundbreaking AI projects for leading tech companies.

