Applied Researcher (Product) - AI Safety Solutions
Apollo ResearchLondon
On-site Full-time
Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Experience Level
Experience
Qualifications
Experience in empirical research and a strong understanding of AI safety mechanisms. Proficiency in coding and familiarity with AI systems. Ability to work collaboratively within a team and adapt to evolving challenges in a dynamic environment.
Application Deadline: We are actively interviewing candidates and seek to fill this position promptly with a suitable applicant.
THE OPPORTUNITY
Become a vital member of our groundbreaking AGI safety product team and play a key role in transforming intricate AI research into actionable tools aimed at minimizing AI-related risks. In your role as an applied researcher, you will collaborate closely with our CEO (who also acts as Head of Product), product engineers, and the Evals team’s software engineers to develop solutions that enhance AI agent safety for our clients. Currently, we are concentrating on the oversight of AI coding agents to identify failures in safety and security. You will be part of a compact team, which allows you to significantly influence both team dynamics and technological approaches while quickly assuming greater responsibilities.
This position is perfect for you if you have a fervent desire to employ empirical research methodologies to enhance the safety of AI systems in practical applications. If you relish the challenge of converting theoretical AI risks into tangible detection mechanisms, thrive in fast-paced environments, and are eager to see your research make a meaningful impact on real-world AI safety, then we would love to hear from you.
KEY RESPONSIBILITIES
Research & Development
- Collect and catalog coding agent failure modes systematically from real-world instances, public examples, research literature, and theoretical predictions.
- Design and execute experiments to evaluate monitor effectiveness across various failure modes and agent behaviors.
- Develop and maintain evaluation frameworks to track advancements in monitoring capabilities.
- Refine monitoring strategies based on empirical findings, optimizing detection accuracy alongside computational efficiency.
- Stay updated with the latest research in AI safety, agent failures, and detection methodologies.
- Keep abreast of advancements in coding security and safety vulnerabilities.
Monitor Design & Optimization
- Create a comprehensive library of monitoring prompts tailored to specific failure modes (e.g., security vulnerabilities, goal misalignment, deceptive behaviors).
- Experiment with various reasoning strategies and output formats to enhance monitor reliability.
- Design and evaluate hierarchical monitoring architectures and ensemble approaches.
About Apollo Research
Apollo Research is at the forefront of AI safety, dedicated to creating innovative products that minimize risks associated with artificial intelligence. Our mission is to make AI safety accessible and effective for all users, ensuring a secure future powered by intelligent technology.
Similar jobs
Browse all companies, explore by city & role, or SEO search pages. View directory listings: all jobs, search results, location & role pages.
