AI Agent Testing Specialist Jobs in United Kingdom

3,658 jobs found

1 - 20 of 3,658 Jobs
Apply
Roboyo logo
Full-time|On-site|London, UK

Roboyo is a trailblazer in the field of Agentic Automation, empowering top-tier brands to incorporate autonomous, AI-driven agents into their workflows, processes, products, and services for enhanced scalability and smarter operations.With a robust foundation in automation, we emphasize the seamless integration of AI within enterprise-level organizations, fo…

Mar 6, 2026
Apply
Lottie logo
Full-time|£60K/yr - £120K/yr|Hybrid|London

Position: Founding Go-to-Market (GTM) Specialist for AI Agents Department: Eliza, Lottie. Reports directly to Chris Hart, GM of Eliza Compensation: OTE £60,000 - £120,000 (Base Salary ranging from £30,000 to £60,000) + Equity Culture & Benefits: Discover more here Equity Offering: Attractive EMI share options Vacation: 26 days plus bank holidays Vision: Transform later life for all and innovate the future of the care sector Work Model: Hybrid, requiring a minimum of 2 office days per week Office Location: London Bridge, London, UK The RoleJoin us in revolutionizing the social care landscape with Eliza, Lottie's cutting-edge AI voice agent. Eliza is designed to manage phone calls for care homes, capture inquiries, book viewings, and ensure every call is addressed. After completing her inaugural shift, we are expanding our reach through our pilot partner network.We are in search of a pioneering GTM professional to launch Eliza into the market. This is an exceptional chance to be the first commercial representative for an innovative AI solution within one of the UK's rapidly expanding startups, supported by Accel and General Catalyst. Collaborate directly with the GM and CEO to construct the go-to-market strategy from the ground up, encompassing the initial outreach to successfully closing deals.If you have thrived as a top-performing SDR or Commercial Associate and aspire to take full ownership of a product's GTM strategy, or if you have experience selling SaaS solutions to small and medium businesses and are ready to embrace a truly innovative AI product, we want to hear from you. We seek a candidate who is entrepreneurial, commercially astute, and eager to introduce transformative solutions to a sector that is in dire need of innovation.As the first in a burgeoning line of AI agent products, this is your opportunity to define the GTM framework that will facilitate expansion across our product portfolio. Key ResponsibilitiesManage the complete sales cycle for Eliza, from prospecting to demonstration and closing deals—targeting care home operators and home care providers.Establish your outbound marketing strategy from the ground up: refine messaging, explore various channels, and utilize Lottie's extensive network of over 6,000 care services for warm introductions.Conduct product demonstrations and represent Eliza at industry events to drive lead generation.Collaborate closely with the Eliza product and engineering teams to relay operator feedback and influence product development.

Mar 10, 2026
Apply
Parloa logo
Full-time|$350/yr - $350/yr|Remote|Dublin; London; Remotely in the UK

YOUR MISSION: As a Senior AI Agent Architect at Parloa, you will be pivotal in revolutionizing customer service through advanced AI Agents. In this dynamic, customer-centric role, you will facilitate the deployment of Parloa's innovative solutions, significantly enhancing the experience for our clients and partners. Utilizing your proficiency in Large Language Models (LLMs), Natural Language Understanding (NLU), and conversational design, you will create efficient AI workflows that ensure optimal performance and quality. Your responsibilities will include designing, prototyping, and validating conversational solutions for AI agent implementations, while guiding clients and partners to maximize the benefits of Parloa’s AI capabilities. IN THIS ROLE YOU WILL: Plan and execute AI Agent deployments, providing strategic insights and hands-on support to customers and partners. Utilize your understanding of LLM internals (e.g., embeddings) to assess customer needs and craft tailored prompts for reliable, user-aligned outcomes. Simplify intricate workflows into manageable conversational elements, empowering LLMs to tackle complex tasks efficiently. Refine conversational interfaces and voice outputs (e.g., SSML, lexicons, regex) to ensure alignment with customer branding. Collaborate closely with Agent Integration Engineers and Forward Deployed Engineers to integrate clients’ systems with Parloa’s Agent Platform through APIs. Identify and address obstacles in collaboration with other departments at Parloa (e.g., Product, Agent Integration Engineering, or Sales) and the client. Implement structured testing methodologies to verify AI agent behavior, quality, and performance in real-world scenarios. Create documentation of best practices, guides, and product features for both internal and external audiences, showcasing the expertise of the Agent Architect team.

Feb 14, 2026
Apply
Foundation Health logo
Full-time|On-site|Manchester

About Foundation HealthAt Foundation Health, we are on a mission to revolutionize the healthcare landscape through our cutting-edge, AI-powered digital pharmacy platform. Our unique approach seamlessly integrates operational infrastructure with exceptional patient experiences, striving for a future where patient-centric care is optimized and frictionless. We refuse to conform to outdated methods, instead setting forth innovative solutions that will define the healthcare practices of tomorrow.To bring this ambitious vision to life, we rely on a dedicated team committed to progress. We believe that a nurturing and stimulating work environment ignites creativity, driving it toward groundbreaking innovations that benefit not just our organization, but also our employees, partners, and, most importantly, our patients.At Foundation Health, we cultivate a culture that encourages our team members to explore new ideas and bring their passion and curiosity to work each day. We recognize that diverse perspectives foster growth, and we actively seek individuals who share our commitment to excellence and forward-thinking.Our Mission: Quality-First HealthcareThe healthcare industry is often bogged down by inefficient workflows. At Foundation Health, we believe that strategically implemented AI can streamline these systems. We are at the forefront of transforming the pharmacy sector by developing a robust, integrated infrastructure that redefines care delivery.As a QA Engineer in our team, you will not merely be ticking boxes; you will serve as a crucial guardian of our platform, directly influencing provider workflows and enhancing patient lives.The Role: Merging Innovation with IntegrityWe are an AI-native team. We don’t just develop AI; we harness it to enhance our productivity and automate routine tasks. We are looking for a QA Engineer who can leverage AI tools to elevate their efficiency and is enthusiastic about the challenge of testing AI-driven systems.In addition, we appreciate traditional tenacity. We need someone who can:Take Ownership: You will have the autonomy to shape our QA culture and standards from the ground up.Be the Gatekeeper: You are unafraid to assert your authority if a product is not ready for launch.

Feb 26, 2026
Apply
Swap logo
Full-time|Hybrid|London

Join Our Team as a Senior Data Scientist - Conversational & Agentic AILocation: London (Hybrid)About SwapAt Swap, we are at the forefront of revolutionizing agentic commerce with our innovative AI-native platform. We connect backend operations seamlessly with a modern storefront experience.Our platform is tailored for brands eager to expand their reach, enabling them to sell anything, anywhere. Swap consolidates global operations, enhances intelligent workflows, and empowers businesses to make informed decisions with real-time data. Our extensive product range includes solutions for cross-border transactions, tax management, returns, demand planning, and next-generation agentic storefronts, providing merchants with full transparency and confidence in their operations.We foster a company culture that values clarity, creativity, and shared ownership as we redefine the landscape of global commerce.About the RoleWe are looking for a Senior Data Scientist to spearhead the development and evaluation of intelligent autonomous AI agents focused on B2B analytics and operational workflows. You will be responsible for engineering sophisticated systems that enable agents to autonomously analyze data, generate actionable insights, execute operational tasks, and orchestrate complex business processes. Your role will involve working across analytical and operational workflows, designing multi-agent ecosystems, establishing evaluation frameworks, and ensuring the reliability of complex agentic systems in production.Key ResponsibilitiesAgentic System Design & Multi-Agent Architecture: Architect AI agent ecosystems capable of autonomously managing analytical and operational workflows, including data exploration, insight generation, process automation, and cross-functional coordination.Analytical & Operational Workflow Design: Create agent systems that proficiently navigate data discovery, hypothesis testing, insight delivery, and operational task execution, effectively managing transitions between specialized agents (SQL, visualization, statistical analysis, workflow automation).Prompt Engineering & Agent Optimization: Craft and refine prompts for autonomous agents, incorporating few-shot learning, chain-of-thought reasoning, tool usage, and structured output generation for multi-step analytical and operational workflows.Comprehensive Evaluation Frameworks: Develop evaluation systems that measure analytical accuracy, operational task success, agent reliability, and coordination effectiveness. Create automated testing suites, benchmark datasets, and continuous production monitoring protocols.Agent Performance Analysis: Conduct thorough analyses of agent performance to drive system improvements and enhance operational efficiency.

Jan 22, 2026
Apply
vega logo
Full-time|On-site|London

About the Role vega is looking for an AI/Agent Engineer in London. This position focuses on designing and building intelligent agents that address complex challenges. Work will directly support the advancement of AI solutions across the company.

Apr 16, 2026
Apply
StarCompliance logo
Full-time|Hybrid|Hybrid

Join Our Innovative Team as a Senior AI Engineer (Agentic Systems)At StarCompliance, we are at the forefront of creating software solutions that address critical compliance needs for our global clientele. As we integrate AI as a fundamental capability throughout our software development lifecycle, we invite you to be part of this transformative journey.We are looking for a Senior AI Engineer to spearhead the practical implementation and expansion of AI-assisted and agentic engineering within our teams. This is a hands-on role where you will engage directly with real codebases, leveraging cutting-edge AI-native development environments (Cursor preferred) to revolutionize the software development process.Your mission will be to evolve AI from a mere tool into a fully embedded system that is repeatable and scalable. You will craft and execute playbooks, establish workflows, and define patterns that allow our teams to harness the power of parallel AI agents, facilitate autonomous code reviews, and create AI-driven delivery pipelines. Additionally, you will play a pivotal role in launching new initiatives, ensuring they are built on the right architecture and AI-enabled engineering practices from the outset.This position is part of our R&D Engineering team and collaborates closely with Platform, QA, and Product Engineering. Here, influence is gained through tangible delivery rather than hierarchy.Our Vision of AIWe view AI not just as an assistant, but as an integral component of our engineering ecosystem. We expect engineers in this role to embrace this perspective and drive our AI initiatives forward.

Mar 31, 2026
Apply
anyone-ai logo
Contract|$40/hr - $40/hr|Remote|United Kingdom

About the RoleAt anyone-ai, we specialize in generating high-quality STEM training data for advanced AI models, utilized in training and evaluation processes at top-tier AI laboratories.We are seeking skilled Chemistry professionals to devise sophisticated, deterministic problems that yield a single verifiable correct answer. These challenges will test the capabilities of contemporary AI systems in deep reasoning, domain knowledge, and computational precision.Projects may encompass areas such as computational chemistry, physical chemistry, analytical chemistry, chemical engineering workflows, and chemistry tasks that necessitate specialized software tools.ResponsibilitiesDesign complex chemistry problems that mirror authentic scientific workflows.Develop deterministic tasks with one correct answer and comprehensive, verified solutions.Craft reasoning-intensive and computationally grounded challenges.Utilize Python and relevant domain-specific chemistry tools or simulation packages as needed.Ensure all tasks are reproducible, technically accurate, and well-documented.Compose clear and precise technical explanations in English.QualificationsBachelor's, Master's, or PhD in Chemistry or a related discipline.Experience in research or industry involving computational or quantitative chemistry workflows.Proficient in Python; familiarity with scientific libraries is advantageous.Strong grasp of modeling, numerical methods, and chemical reasoning.Capability to design original and challenging problems that exceed basic textbook knowledge.Exceptional attention to detail and proficiency in technical writing.Preferred QualificationsExperience with advanced simulation tools or chemical software packages.Prior experience in developing educational materials or assessments in STEM fields.

Mar 31, 2026
Apply
Oxford Dynamics logo
Full-time|Hybrid|Harwell Oxford, England, United Kingdom

Salary: Competitive based on experience.Location: 2-3 days working on-site at our Harwell office, with occasional travel to client sites as needed.Contract Type: Full-time, permanent position with a commitment of 37.5 hours per week.A Note from Our FoundersAt Oxford Dynamics, we are at a pivotal moment in our journey. Operating in some of the most complex and high-stakes domains, including defence, national security, AI, and robotics, the choices we make today will shape our future and growth trajectory.You will collaborate closely with our dedicated team, making impactful decisions and witnessing the results of your work daily. If you thrive in a dynamic environment where ownership, speed, and purpose matter, we encourage you to join us.About Oxford DynamicsEstablished in 2020, Oxford Dynamics is a rapidly growing UK-based deep-tech company focused on developing AI and robotic systems for mission-critical applications. Our flagship AI framework, AVIS (A Very Intelligent System), integrates multi-modal data sources—text, imagery, telemetry, and sensor inputs—empowering operators to analyze complex information swiftly and make informed decisions under pressure. Our STRIDER robotic platform autonomously executes tasks in hazardous environments, ensuring safety while enhancing operational capabilities.Our ambitious goal is clear yet challenging: to merge AI and robotics to create machines capable of perceiving, comprehending, and acting within intricate, real-world settings. We collaborate with global defence and security entities to safeguard nations, infrastructure, and lives.Role Importance and ResponsibilitiesAs a key member of our small, collaborative team, your attitude and approach are as critical as your experience. In your role as a Senior AI Generative Robotics Engineer, you will play a vital part in our success at a crucial stage of our growth.You will be at the forefront of agentic and generative AI, developing systems that transition from lab demonstrations to real-world applications at speed. At Oxford Dynamics, you'll enjoy the freedom to innovate in a fast-paced environment, the responsibility to deliver results, and the chance to influence the functionality of multi-agent AI systems in complex, high-trust contexts.

Apr 9, 2026
Apply
bjakcareer logo
Full-time|On-site|United Kingdom

Join our innovative team at bjakcareer as a Backend Engineer specializing in AI and Agent Systems. In this role, you will be instrumental in designing and developing cutting-edge backend solutions that leverage artificial intelligence to enhance our agent systems. Your expertise will enable us to deliver exceptional user experiences and streamline operations.

Apr 30, 2026
Apply
Anyone-AI logo
Contract|$40/hr - $40/hr|Remote|United Kingdom

Role OverviewJoin our innovative team at Anyone-AI, where we develop high-quality STEM training data for cutting-edge AI models utilized by premier AI research facilities. In this role, you will leverage your expertise in Biology to formulate complex, deterministic problems that yield a single verifiable correct answer, mirroring authentic scientific and analytical workflows.Key ResponsibilitiesCraft advanced biological challenges aimed at enhancing frontier AI systems.Develop deterministic tasks that ensure one correct solution.Provide complete, verified solutions for submitted problems.Construct scenarios that involve experimental reasoning, biological systems, computational analysis, or bioinformatics workflows.Utilize Python and relevant specialized biological or bioinformatics tools as necessary.Maintain exceptional standards of rigor, reproducibility, and technical clarity in your work.QualificationsA Bachelor’s, Master’s, or PhD in Biology or a related life sciences discipline.Experience in research or industry focusing on computational or quantitative biological analysis.Proficient in Python; familiarity with data analysis or bioinformatics workflows is advantageous.Strong analytical reasoning skills and comfort with complex scientific problem-solving.Capability to devise original, challenging problems grounded in real-world biological practices.Excellent written English with keen attention to detail.Preferred QualificationsExperience with molecular biology, genetics, systems biology, computational biology, bioinformatics, or quantitative biology fields.

Mar 31, 2026
Apply
Toloka AI logo
Contract|Remote|Remote — United Kingdom

Role overview This freelance, remote contract with Toloka AI centers on no-code automation and AI training. Based in the United Kingdom, the specialist will build automation solutions and help others learn to use AI tools effectively. What you will do Design and set up no-code automation workflows to make processes more efficient Work with a distributed team to spot areas where automation can help Train users, guiding them to understand and use AI technologies About Toloka AI Toloka AI creates platforms and solutions that blend human expertise with artificial intelligence. The company values practical problem-solving and clear, direct communication.

Apr 27, 2026
Apply
ServiceNow logo
Full-time|On-site|London

Role overview ServiceNow is seeking a Senior AI Agent Engineer to join the Customer Deployment team in London. The position centers on building and deploying advanced AI agent solutions that support clients in meeting their objectives. Deep experience in artificial intelligence and engineering is essential, especially in the context of customer deployment strategies. What you will do Design, develop, and maintain AI agent systems tailored to customer requirements Collaborate with cross-functional teams to ensure effective implementation of AI solutions Refine and enhance AI capabilities to improve user experience and drive customer satisfaction Collaboration and impact This position sits at the intersection of engineering and customer success. The Senior AI Agent Engineer will work with teams across ServiceNow, helping deliver reliable and effective AI deployments that create tangible value for clients.

Apr 21, 2026
Apply
Roku, Inc. logo
Full-time|On-site|Cambridge, United Kingdom

Collaborate to Enhance Streaming Experiences. Join Roku in Revolutionizing Television ViewingAs the leading TV streaming platform across the U.S., Canada, and Mexico, Roku is at the forefront of transforming how the world engages with television content. Our mission is to connect consumers with their favorite entertainment while empowering content creators and offering advertisers unique engagement opportunities.From day one at Roku, your contributions will be significant and recognized. This dynamic, fast-growing public company invites you to help us delight millions of streamers globally while gaining valuable experience in various technical disciplines. About The Role With a global user base, our products are renowned for their exceptional ease of use and reliability. This seamless experience is a direct result of the dedication employed by the Roku OS QA Team, whose mission is to ensure the highest quality in our streaming media platform. We strive to help users discover and enjoy their favorite content effortlessly, utilizing cutting-edge technologies and strong engineering principles. About the TeamIn this role, you will utilize your diverse skill set to assist both Software and QA Engineers, addressing daily challenges with innovative solutions. Your expertise, particularly in AI, will play a crucial role in enhancing product quality and optimizing workflows to consistently exceed user expectations. Your ResponsibilitiesContribute to the development of tools and technologiesDesign, build, and maintain AI-driven automation systemsCollaborate with cross-functional teams including data scientists, software engineers, and product teams to deliver AI-based solutionsDevelop new tools and technologies to enhance testing processesIdentify and debug failing tests to improve product reliability

Apr 30, 2026
Apply
Endava logo
Full-time|On-site|London

Join Endava as a Solution Developer, specializing in Agentic AI. This entry-level position presents a unique opportunity for recent graduates and aspiring developers to work in a dynamic client delivery environment. You will be involved in innovative projects that challenge your programming skills and creativity, while collaborating with industry experts.

Apr 30, 2026
Apply
Cresta logo
Full-time|Remote|United Kingdom (Remote)

Cresta is dedicated to transforming customer interactions into a strategic advantage by harnessing the full potential of contact centers. By intelligently blending AI and human insights, our platform empowers contact centers to uncover valuable customer insights, streamline operations through automation, and enhance the productivity of every team member. With roots in the esteemed Stanford AI lab, Cresta was co-founded by Sebastian Thrun, the visionary behind Google X, Waymo, and Udacity. Our leadership team also includes CEO Ping Wu, co-founder of Google Contact Center AI and Vertex AI, and Tim Shi, an early member of OpenAI.Join us on an exhilarating journey to reshape the workforce through AI technology. The future of work is here, and it begins at Cresta.

Mar 2, 2026
Apply
anyone-ai logo
Contract|$0/hr - $40/hr|Remote|United Kingdom

Role Overview anyone-ai develops high-quality STEM training data for AI models, supporting research and evaluation at leading AI labs. The team focuses on data that strengthens technical reasoning in artificial intelligence. This remote contract position is open to candidates based in the United Kingdom. What You Will Do Design advanced physics problems to challenge and evaluate AI systems. Ensure each problem is deterministic, with a single, clearly correct answer. Write full solutions that document each reasoning step and verify correctness. Create problems that test deep understanding and multi-step analytical thinking, not just recall. Use Python or other specialized tools as needed for simulations, modeling, or computational tasks. Maintain technical accuracy, reproducibility, and clear English in all materials.

Apr 17, 2026
Apply
Swap logo
Full-time|On-site|London

At Swap, we are revolutionizing the landscape of modern agentic commerce through our innovative AI-native platform. Our technology seamlessly integrates backend operations with an advanced storefront experience, empowering brands to sell anything, anywhere.Designed for forward-thinking brands, Swap centralizes global operations, enhances intelligent workflows, and enables margin-protecting decisions utilizing real-time data. Our comprehensive product suite encompasses cross-border logistics, tax management, returns processing, demand planning, and our pioneering agentic storefront, ensuring merchants have full transparency and the confidence to act decisively.We are committed to fostering a culture that prioritizes clarity, creativity, and shared ownership as we redefine the dynamics of global commerce.About the RoleWe are seeking an extraordinary individual who transcends traditional data science. Our ideal candidate embodies: • A blend of human intuition and machine intelligence. • The ability to run parallel code paths and deploy agents to explore solutions autonomously.Join our Agentic squad, collaborating with five other AI and data science experts to refine, scale, and innovate the intelligence layer that powers our agentic storefront. This senior-level, high-agency generalist role is for someone who can navigate models, agents, experiments, and product decisions with agility and speed in a startup setting.Key ResponsibilitiesWork in tandem with Product, Engineering, and the Agentic team to enhance AI-driven features throughout the entire commerce journey.Design and refine recommendation and personalization systems leveraging behavioral and transactional data.Advance conversational agents, multi-agent workflows, and tool integrations.Iterate on virtual try-on and vision systems using prompt engineering, fine-tuning, and model chaining.Rapidly prototype new AI functionalities and transition them into production-grade systems.Establish and enhance experimentation frameworks, including A/B testing, evaluation pipelines, and benchmarking.Drive improvements in conversion, retention, and operational efficiency through applied machine learning.Identify automation opportunities and design agent-driven internal tools.

Mar 2, 2026
Apply
WNS Global Services logo
Full-time|Hybrid|London

Join WNS Global Services as an AI Test Lead and play a pivotal role in shaping the future of AI solutions. As a leader in our AI Foundry, you will spearhead testing strategies and ensure the highest quality of AI products. This position offers a unique opportunity to work in a hybrid environment, requiring just three days in the office each week.In this role, you will collaborate with cross-functional teams to develop test plans, execute test cases, and validate AI models. You will also mentor junior testers and contribute to the continuous improvement of testing methodologies.

Mar 24, 2026
Apply
Checkout.com logo
Full-time|On-site|London

About Checkout.com Checkout.com powers digital payments for well-known brands such as eBay, ASOS, Klarna, Uber Eats, and Sony. Our technology supports billions of online transactions each year, helping businesses deliver smooth checkout experiences that drive growth and customer loyalty. With a presence in 19 offices across six continents and headquarters in London, our team brings together a global perspective. Every role here offers a chance to shape the future of fintech alongside colleagues committed to high standards and ongoing improvement. Role Overview: Manager, Product Marketing for AI and Agentic Commerce The Manager, Product Marketing for AI and Agentic Commerce will guide product marketing strategies at Checkout.com. This role works closely with Product, Commercial, and Marketing teams to deliver go-to-market plans and shape how we communicate our leadership in this evolving space. Expect to work hands-on with new technologies and trends, producing product marketing outcomes that support a range of initiatives. Key Responsibilities Lead product marketing for AI and agentic commerce, turning complex features into clear, engaging stories. Own go-to-market strategies: manage product launches, marketing campaigns, and content development, ensuring delivery meets business goals and timelines. Shape category positioning by defining and strengthening Checkout.com’s presence in agentic commerce and AI. Location This position is based in London.

Apr 15, 2026

Sign in to browse more jobs

Create account — see all 3,658 results

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.