About the job
Please submit your CV in English and describe your English language proficiency.
toloka-ai, working with Mindrift, connects experienced professionals to project-based AI work for leading technology companies. This freelance contract centers on evaluating and refining AI systems. The position is not permanent and assignments are project-based.
Project Scope
Each assignment brings a new data science challenge. Typical responsibilities include:
- Designing original data science problems that reflect real analytical workflows in sectors such as telecom, finance, government, e-commerce, and healthcare.
- Building problems that require Python solutions, using libraries like Pandas, Numpy, Scipy, Scikit-learn, Statsmodels, Matplotlib, and Seaborn.
- Ensuring tasks are computationally intensive and cannot be solved by hand in a short time.
- Developing complex scenarios involving data processing, statistical analysis, feature engineering, predictive modeling, and drawing insights.
- Creating deterministic problems with reproducible outcomes, either by avoiding randomness or setting fixed seeds.
- Basing problems on real business needs, such as customer analytics, risk assessment, fraud detection, forecasting, optimization, and operational efficiency.
- Covering the entire data science workflow: data ingestion, cleaning, exploratory analysis, modeling, validation, and deployment considerations.
- Including big data processing that requires scalable approaches.
- Validating solutions with Python and standard data science tools and statistical methods.
- Documenting each problem clearly, providing realistic business context and verified solutions.
Requirements
- At least 5 years of hands-on data science experience with measurable business results.
- A portfolio of completed projects or publications showing real-world problem solving.
- Expertise in Python for data science (including pandas, numpy, scipy, scikit-learn, statsmodels).
- Deep understanding of statistical analysis and machine learning, including practical algorithms and applications.
- Advanced SQL skills and experience with database operations for analysis and data manipulation.
- Background in Generative AI technologies (such as LLMs, RAG, prompt engineering, vector databases).
- Understanding of MLOps and model deployment practices.
- Familiarity with frameworks like TensorFlow, PyTorch, or LangChain.
- Strong written English skills at C1 level or above.
Application Process
- Apply
- Complete qualifications
- Join a project
- Perform tasks
- Receive payment
This freelance data scientist role is remote, based in Lyon, Auvergne-Rhône-Alpes, France.
