About the job
Please submit your CV in English and specify your English proficiency level.
Mindrift connects skilled professionals with project-based AI roles at leading tech companies. The platform focuses on testing, evaluating, and improving AI systems. All positions are temporary and project-based, not permanent employment.
Role overview
This Senior Python Systems Developer role centers on functional testing for a large-scale project. The work involves building and executing black box tests for codebases written in various languages, using strong Linux and Docker skills. The position calls for reading and understanding code in C, Rust, or Go, often with the support of large language models (LLMs) such as Roo Code or Claude Code. Translating migration requirements and streamlining development with modern tools is a key part of the job.
What you will do
- Create and run functional black box tests for large, multi-language codebases.
- Set up and manage Docker environments to ensure reproducible builds and reliable test execution across platforms.
- Monitor code coverage and implement automated scoring to meet industry standards.
- Use LLMs (such as Roo Code and Claude) to automate repetitive tasks, speed up development, and improve code quality.
Requirements
- Minimum 5 years of software engineering experience, with a strong focus on Python.
- Advanced knowledge of pytest (including fixtures, session-scoped tests, and timeouts) and experience designing black-box functional tests for CLI tools.
- Proficient with Docker: building reproducible Dockerfiles, managing user contexts, and creating secure workspaces.
- Strong command of Linux and Bash scripting, including debugging within containers.
- Comfortable with modern Python tooling (uv, pyproject.toml, packaging).
- Ability to read and understand code in multiple languages (C, C++, Rust, or Go) with LLM support.
- Experience using LLMs (Claude Code, Roo Code, Cursor) to accelerate development and create test cases.
- English proficiency at B2 level or higher.
Preferred qualifications
- Experience with agent evaluation platforms and MCP CLI.
Key tools and technologies
Python (pytest, uv, Pillow), Docker, Bash, Git Submodules, C/C++/Rust/Go (reading only), Dagger, GitHub Codespaces, LLMs (Claude Code, Roo Code, Cursor), coverage.py, gcov, kcov.
Project details and benefits
- Freelance, project-based work through the Mindrift platform (powered by Toloka AI).
- Fully remote and flexible: set your own hours and workload (20-30 hours per week).
- Competitive compensation: AI trainers on this project can earn up to $45 per hour equivalent.
Location: Remote , Manitoba, Canada
