Turnitin, LLC logoTurnitin, LLC logo

Senior AI Data Engineer - Remote (UK)

Turnitin, LLCManchester
Remote Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Experience Level

Mid to Senior

Qualifications

Required Qualifications:Minimum of 4 years of experience in data engineering, especially in AI/ML data infrastructure or accelerating AI R&D. Expertise in Python, SQL, and Infrastructure as Code (Terraform, CloudFormation), along with experience in modern orchestration frameworks (Airflow, Prefect, or dbt). Proficient in cloud-native data platforms (AWS, Azure, GCP) and vector databases (Pinecone, Weaviate, Qdrant, or Chroma). Familiarity with MLOps tools and platforms (HuggingFace, SageMaker Bedrock, Vertex AI), and experiment tracking (MLflow, Weights & Biases). Experience with Large Language Models (LLMs), embedding generation, retrieval-augmented generation (RAG) systems, and frameworks for orchestrating LLM interactions (LiteLLM, LangFuse, LangChain, LlamaIndex). Strong problem-solving, analytical, and communication skills, with a proven ability to work collaboratively.

About the job

At Turnitin, we recognize that AI and data science are fundamental to our achievements and ambitious product strategy. As a Senior AI Data Engineer, you will join a dynamic global team of proactive and independent professionals dedicated to crafting sophisticated, well-structured AI and data systems. You will be at the forefront of developing our next-generation data and AI pipelines, significantly scaling our team's impact. You'll collaborate across various teams within Turnitin to integrate AI and data science into a diverse range of products aimed at enhancing learning, teaching, and academic integrity.

Key Responsibilities:

  • AI Data Infrastructure & Pipeline Management: Design, build, and operate scalable real-time data pipelines that facilitate ongoing Applied AI model training. Implement and maintain robust data infrastructure utilizing AI techniques and engineering best practices to ensure continuous model improvement.
  • Data Collection: Lead efforts to collect, normalize, and store data from various sources, including external LLM providers.
  • Collaboration: Work closely with AI R&D, Applied AI, and Data Platform teams to ensure smooth data flow and adherence to quality standards. Collaborate with stakeholders to curate and catalog high-quality datasets that support Applied AI retraining workflows and business goals.
  • Support for AI R&D: Contribute to AI Research & Development initiatives by leveraging advanced data warehousing and engineering technologies. Engage in exploratory data projects to extract insights from Turnitin's extensive datasets.
  • Communication: Foster clear communication across teams, aligning with the company vision while sharing insights on data infrastructure requirements and potential innovations.
  • Technology Evolution: Stay updated with emerging tools and methodologies in AI data engineering, providing recommendations to enhance our AI data infrastructure and capabilities.

About Turnitin, LLC

Turnitin is a leading innovator in the global education sector, dedicated to promoting academic integrity and supporting educational institutions for over 25 years. With a user base of over 21,000 academic institutions, publishers, and corporations, our services include Feedback Studio, Originality, Gradescope, ExamSoft, Similarity, and iThenticate. We offer a remote-centric culture that empowers you to work with purpose and accountability, supported by a comprehensive benefits package focused on your overall well-being. Our diverse team, spread across more than 35 countries, is united by a shared mission to make a transformative impact in education.

Similar jobs

Browse all companies, explore by city & role, or SEO search pages. View directory listings: all jobs, search results, location & role pages.

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.