AI Benchmark Engineer - Hindi Language Specialist

Remote Contract

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.

Experience Level

Experience

About the job

Join our innovative team at Lilt, where we are developing a comprehensive evaluation suite of Terminal-Bench tasks aimed at pushing the boundaries of large language models in addressing multilingual software challenges. Our mission is to accurately assess multilingual robustness, focusing on prompt language effects, non-English data processing, and intricate locale/encoding edge cases within terminal workflows.

We are looking for skilled native-speaking software engineers who can design, construct, and validate these benchmarks. You will be tasked with creating high-quality, impactful tasks that authentically evaluate a model's capacity to navigate multilingual settings without relying on English translations.

This is a remote, freelance position.

Target Languages: Spanish, German, Czech, Turkish, Arabic (Egyptian), Korean, Japanese, Hausa, Hindi, Marathi.

Key Responsibilities:

Task Engineering: Assessing and optimizing Coding Agents.
Asset Creation: Develop realistic task scenarios using datasets and files in your native language. It is essential that these assets remain in the target language to adequately evaluate multilingual capabilities.
Prompting & Translation: Identify failure points in AI performance in your native language.
Implementation & Verification: Assist in the development of reliable solutions (reference implementations) and create highly accurate, deterministic verification scripts, using rubric-based judging only when absolutely necessary.
Calibration & Execution: Review execution logs and calibrate task difficulty (from Easy to Very Hard) using standard Terminal-Bench configurations against various model tiers (Haiku, Sonnet, Opus).
Quality Assurance: Engage in a rigorous, four-layer human quality control process (creation, human review, calibration review, and audit) alongside automated LLM-based checks to ensure fairness, grammatical accuracy, and the integrity of benchmarks.

Similar jobs

Browse all companies, explore by city & role, or SEO search pages. View directory listings: all jobs, search results, location & role pages.

1 - 20 of 10,126 Jobs

Select all on this page (20)

Apply

AI Benchmark Engineer - Hindi Language Specialist

Lilt, Inc.

Contract|Remote|India (Remote)

Feb 23, 2026

Apply

Senior Product Solution Engineer - Visibility AI

gohighlevel

Full-time|On-site|Delhi

gohighlevel is hiring a Senior Product Solution Engineer to focus on Visibility AI in Delhi. The position involves designing, developing, and implementing solutions that enhance how products are seen and perform in the market. Key responsibilities Collaborate with cross-functional teams to gather and clarify client requirements Develop and deliver solutions aimed at increasing product visibility and driving measurable performance improvements Apply technical expertise to influence product direction and features Support customer satisfaction by ensuring high-quality solution delivery Location The role is based in Delhi.

Apr 27, 2026

Apply

Senior Software Engineer - AI Products

Airbnb, Inc.

Full-time|On-site|Bangalore, India

Founded in 2007, Airbnb has transformed the way people travel, connecting over 5 million hosts with more than 2 billion guests across the globe. By offering unique accommodations and experiences, we enable travelers to immerse themselves in local cultures.Senior Software Engineer - AI Products (India)Join Our Innovative TeamThe Airfam Products team is dedicated to enhancing productivity for all Airbnb employees through a cohesive digital experience. As a member of a diverse team consisting of engineers, designers, researchers, and product managers, you will develop platforms that cater to Airbnb's global workforce.Your work will include projects such as One Airbnb, our internal cultural hub featuring enterprise search and AI-assisted communication, as well as OneChat, our enterprise AI assistant designed for secure interactions. You'll also contribute to a variety of tools that facilitate information sharing and collaboration among employees.Your ImpactAs a Senior Software Engineer on the Airfam Products team, you will play a crucial role in shaping the future of AI-driven employee experiences at Airbnb. Your contributions will enhance productivity throughout the organization, as each feature you develop and system you design will empower our workforce.In this role, you will:Democratize AI by creating tools that enable non-technical employees to harness the capabilities of large language models (LLMs).Drive innovation by scaling AI prototypes from concept to production.Shape the future of AI tools that make technology accessible to all employees, regardless of their technical expertise.

Mar 18, 2026

Apply

AI Product Owner

MicroStrategy Incorporated

Full-time|On-site|Pune

MicroStrategy Incorporated is on the lookout for a seasoned and strategic AI Product Owner to spearhead the development and implementation of cutting-edge AI-powered solutions. In this pivotal role, you will be responsible for defining the product vision, creating the roadmap, and executing strategies for AI/LLM-based applications. You will collaborate closely with engineering, data, and business stakeholders to ensure successful delivery.The ideal candidate possesses a robust technical understanding of AI technologies (LLMs, APIs, cloud infrastructure) combined with deep product management expertise. You should excel in aligning stakeholders and translating complex business needs into scalable AI solutions.Key ResponsibilitiesDefine and manage the product vision, roadmap, and backlog for AI-driven solutions.Convert business requirements into precise technical specifications.Supervise the development of AI applications utilizing Python and REST APIs (FastAPI).Guide the deployment of LLM solutions (OpenAI, Azure OpenAI), including vector databases, embeddings, and RAG architectures.Collaborate with DevOps to launch scalable AI applications on AWS, Azure, or GCP using Docker.Monitor product performance, establish success metrics, and drive ongoing optimizations.Engage with cross-functional stakeholders to ensure alignment and adoption throughout the organization.

Feb 23, 2026

Apply

AI Engineer at Nightfall AI | Bengaluru

Nightfall AI

Full-time|On-site|Bengaluru

About Nightfall AI:Nightfall AI is an innovative platform at the forefront of data loss prevention and insider risk management, designed to safeguard sensitive information across a wide range of applications including SaaS tools, email, endpoint devices, and generative AI technologies. Trusted by hundreds of clients, from pioneering AI companies to the world's leading financial institutions, we empower organizations to pursue innovation without compromising on data security. Our cutting-edge platform automates data loss prevention, allowing security teams to focus on strategic initiatives while automatically addressing violations before they escalate into incidents.Supported by prominent venture capital firms such as Bain Capital Ventures, Venrock, WestBridge Capital, and esteemed cybersecurity leaders, Nightfall AI is positioned for rapid growth and innovation.About the Role:We are looking for a dedicated and talented AI Engineer to join our dynamic AI Engineering team at Nightfall AI. You will collaborate with a team of exceptional engineers to design, develop, and optimize the AI models that are integral to our security solutions. This role offers the opportunity to work on the architecture and scalability of systems that underpin these models.If you have a passion for tackling complex challenges, experimenting with natural language processing architectures, and diving deep into system internals to create impactful solutions, this is an exciting opportunity to join us early in our journey and accelerate your career growth. Key ResponsibilitiesDevelop and deploy AI systems that enhance Nightfall's security offerings.Utilize Python for data analysis, experimentation with natural language and other model architectures, model benchmarking, and automation of training and deployment processes.Employ Go to create microservices that efficiently handle data ingestion and processing at scale, leveraging advanced streaming infrastructure and distributed caching techniques.Engage in cross-functional collaboration to enhance the performance and capabilities of our products.

Feb 26, 2026

Apply

AI Benchmark Engineer - Native Language Specialist | Marathi

lilt-production

Contract|Remote|India (Remote)

Join our dynamic team as we develop a sophisticated and verifiable evaluation suite of Terminal-Bench tasks aimed at pushing the boundaries of large language models in tackling multilingual software challenges. Our mission is to assess multilingual resilience in diverse environments, focusing on prompt language variations, non-English data handling, and intricate locale and encoding edge cases within terminal workflows.We are looking for skilled native-speaking software engineers to design, construct, and validate these benchmarks. You will be responsible for creating high-impact, quality tasks that authentically evaluate a model's proficiency in multilingual settings without the reliance on English translations.Please note: This role is offered as a remote, freelance opportunity.Target Languages: Spanish, German, Czech, Turkish, Arabic (Egyptian), Korean, Japanese, Hausa, Hindi, Marathi.

Feb 23, 2026

Apply

AI Product Owner

weekday-1

Full-time|₹2M/yr - ₹2.6M/yr|On-site|Pune, Maharashtra, India

Join a dynamic team as an AI Product Owner for one of Weekday's esteemed clients.Salary Range: ₹20,00,000 to ₹26,00,000 (INR 20-26 LPA)Minimum Experience Required: 5 yearsLocation: PuneJob Type: Full-timeWe are in search of a seasoned and strategic AI Product Owner to spearhead the development and delivery of groundbreaking AI-driven solutions. In this critical position, you will shape the strategic direction and execution of AI/LLM-based applications, working in close collaboration with engineering, data science, and business teams to transform complex requirements into scalable, impactful products that innovate analytics and redefine financial investments.

Mar 31, 2026

Apply

AI Product Manager

Libra AI, Inc.

Full-time|On-site|Bengaluru

AI Product Manager at Libra AI, Inc.Are you ready to make a significant impact in the field of artificial intelligence? We are seeking a dynamic AI Product Manager to join our founding team at Libra AI in Bengaluru. This pivotal 0→1 role will give you the chance to shape innovative products, expedite delivery, and push the boundaries of AI technology.Your ResponsibilitiesTake full ownership of product development from concept to execution.Collaborate closely with engineering teams to design and launch AI-driven workflows.Rapidly prototype using cutting-edge tools such as Claude and Lovable.Determine product direction based on user insights rather than mere opinions.Prioritize user experience, meticulous attention to detail, and product excellence.Operate with agility and a strong sense of ownership.Qualifications We Seek2-4 years of substantial product management experience (not operations or analyst backgrounds).Deep understanding of AI systems, including LLMs, agents, and orchestration.Proven ability to independently prototype and validate concepts.Technical proficiency to engage effectively with engineering teams.A strong aesthetic sense: you recognize quality design and usability.Exceptional attention to detail and a proactive approach.Immediate availability (within two weeks preferred).Desirable SkillsExperience in developing or launching AI products is a plus.Strong design and UX sensibility.Experience thriving in fast-paced 0→1 environments.Why Join Us?Collaborate directly with our founder.Competitive salary and meaningful equity options.Be part of building a groundbreaking product in the AI landscape.Enjoy full ownership of your projects from day one.If you believe you are a great fit for this role, please send us a direct message along with your resume and a brief introduction. We look forward to hearing from you!

May 3, 2026

Apply

Senior Product Engineer - AI Agents in Healthcare

100ms

Full-time|On-site|Bengaluru

About Us At 100ms, we are at the forefront of revolutionizing patient access workflows in the U.S. healthcare system through innovative AI agents. Our focus is on automating essential processes such as benefits verification, prior authorization, and referral intake, particularly within specialty pharmacy.Our mission is to alleviate the administrative burdens faced by care teams, enabling them to provide timely treatment for patients. By harnessing advanced healthcare expertise, LLM-based agents, and a robust operational infrastructure, our automation platform is designed to enhance efficiency and streamline workflows.Join our dedicated team at 100ms, where we are pioneering healthcare automation through artificial intelligence.

Aug 27, 2025

Apply

Senior AI Product Manager

Cyara

Full-time|Hybrid|Hyderabad

Cyara develops an AI-powered platform designed to ensure reliable customer experiences. The company supports brands in testing and monitoring customer journeys across voice, digital, messaging, and conversational AI channels. Each year, Cyara’s technology underpins more than 350 million customer journeys. As more enterprises use adaptive AI systems that learn and make decisions on the fly, Cyara serves as a key assurance layer. The platform uses AI agents to evaluate other AI agents, identifying issues that traditional scripted tests may overlook. This method helps organizations shift from pilot programs to production-ready AI systems with greater confidence. Cyara’s unified solution provides journey visibility, AI governance, trust validation, and compliance support. The aim is to make every customer interaction seamless, secure, and consistently high quality at scale.

Apr 20, 2026

Apply

Lead AI Product Designer

Aidash Inc.

Full-time|On-site|Bengaluru, Karnataka, India; Palo Alto, California, United States

Role overview The Lead AI Product Designer at Aidash Inc. takes charge of designing AI products from initial concept to launch. Based in either Bengaluru or Palo Alto, this position centers on shaping user-centered designs that strengthen product experience and contribute to overall success. What you will do Collaborate with engineering, product, and cross-functional teams to steer the direction of AI products Lead the design process from early concept stages through to final delivery Base design choices on user needs and incorporate feedback throughout development Maintain high standards for usability and visual quality across all product touchpoints Impact This role shapes how users engage with Aidash's AI solutions. Design decisions made in this position have a direct effect on user satisfaction and the outcomes of Aidash products.

Apr 27, 2026

Apply

AI Product Manager at Writesonic | Remote

Writesonic

Full-time|Remote|Remote — India

About WritesonicWritesonic is at the forefront of transforming Generative Engine Optimization (GEO) to be as indispensable as SEO. Our innovative platform empowers brands to enhance their visibility across AI search engines, including ChatGPT, Claude, Perplexity, and Google AI Overviews.As a YC-backed and profitable company, we are thriving in a rapidly expanding market focused on AI discovery, serving prestigious clients such as Amazon, Unilever, Acer, PayTM, Mama Earth, and many more.About the RoleWe are seeking a dynamic AI Product Manager who thrives on building practical solutions rather than simply drafting specifications.This role diverges from the conventional PM responsibilities; you will actively prototype working solutions using tools like Claude Code and Cursor, allowing you to quickly ship products, learn from user feedback, and iterate effectively.Key ResponsibilitiesProduct Discovery & Strategy: Own and define the roadmap for essential organic growth features, conducting ongoing user interviews, data analysis, and competitor research to identify and seize opportunities.Build & Ship: Utilize AI coding tools to prototype solutions and collaborate with engineers to refine and launch production-ready features rapidly.Research & Competitive Intelligence: Monitor competitors closely, staying informed about new features and changes in the AI search landscape to translate trends into actionable insights.Data & Optimization: Engage with product analytics daily, establishing key KPIs, running experiments, and making data-driven decisions to enhance product performance.UX & User Advocacy: Prioritize user experience by addressing friction points and ensuring clarity in every interaction, incorporating user testing early and often.Cross-Functional Collaboration: Partner with engineering, design, marketing, and sales teams to support product launches and integrate customer feedback into decision-making.The Ideal Candidate:You are a Builder PM who prototypes solutions rather than merely documenting requirements, adept at leveraging AI tools for rapid development, and possess a solid engineering background. Your competitive analysis skills are exceptional, and you are driven by user-centered design principles.

Apr 12, 2026

Apply

Head of Product - AI-Powered Platform

weekday-1

Full-time|₹6M/yr - ₹8M/yr|On-site|Chennai, Tamil Nadu, India

Join a Leading Firm as Head of ProductWe are seeking an accomplished product leader to spearhead the vision, strategy, and execution of an enterprise-grade AI-powered platform for one of Weekday's esteemed clients. This pivotal role reports directly to the Chief AI Officer and encompasses the entire product lifecycle, from management and design to user research.Your mission will be to evolve the product from a feature-centric offering to a comprehensive, integrated platform that provides significant value to enterprise clients. This role demands a blend of strategic insight and practical execution, with a strong emphasis on developing AI-driven capabilities that are ready for market and deliver measurable impact.You will closely collaborate with engineering, sales, and leadership teams to ensure product direction aligns with business growth, customer demands, and market trends, while also engaging with global stakeholders across the US and Europe.

Apr 4, 2026

Apply

Senior AI Product Manager

Accellor

Full-time|On-site|Bengaluru, Karnataka, India

Role Overview Accellor is hiring a Senior AI Product Manager in Bengaluru, Karnataka. This role centers on building and launching enterprise AI products for a global client base. The position blends product strategy, hands-on AI work, and close collaboration with both clients and internal teams. Success here requires navigating ambiguity, shaping ideas into working solutions, and supporting clients as they adopt new AI-driven approaches. What You Will Do Own the full product lifecycle for enterprise AI applications, from concept to release and adoption. Work directly with clients and internal teams to identify business problems, frame opportunities, and turn them into actionable requirements and delivery plans. Design AI-first user experiences, including copilots, assistants, agent workflows, decision-support tools, and human-in-the-loop enterprise applications. Transform ambiguous needs into clear product documentation: problem statements, user journeys, PRDs, acceptance criteria, prioritization frameworks, and release scopes. Partner with engineering, UX, architecture, and delivery teams to ensure product vision becomes practical, scalable solutions. Facilitate workshops, requirement discovery sessions, stakeholder reviews, demos, and alignment meetings with enterprise clients and senior stakeholders. Create lightweight prototypes, proof-of-concept models, and product mockups using modern AI tools, no-code/low-code methods, and basic design techniques as needed. Define and refine AI use cases involving agents, orchestration, retrieval, workflow automation, and integration with enterprise systems. Maintain strong attention to detail across business requirements, product functionality, user experience, and delivery quality. Manage client expectations by balancing ambitious goals with technical feasibility, timelines, and delivery constraints. Support value realization and account growth by spotting opportunities for enhancement. Who Thrives Here This role fits someone with deep knowledge of GenAI applications and agentic patterns in enterprise contexts. Comfort with uncertainty, the ability to prototype and validate ideas quickly, and a willingness to move between strategy and detailed execution are important. Consulting experience and a track record of building trust with clients will help in this position. Location Bengaluru, Karnataka, India

Apr 14, 2026

Apply

AI Engineer / Senior AI Engineer

Enboarder

Full-time|On-site|Noida, Uttar Pradesh, India

About EnboarderEnboarder is pioneering the world's first agentic AI employee journey platform, seamlessly integrating onboarding, enablement, mobility, and offboarding into a cohesive experience layer.Utilizing the power of AI Assistants and AI Agents, Enboarder empowers HR leaders to deliver structured and personalized employee experiences at scale, liberating HR teams from administrative burdens. Renowned enterprises such as Deloitte, ING, T-Mobile, and Cisco leverage Enboarder to enhance productivity, reduce attrition, and amplify HR capabilities.The outcome: engaged managers, supported employees, and measurable business impact.

Nov 26, 2025

Apply

Associate Product Manager (AI)

Libra

Full-time|On-site|Bengaluru

Job Title: Associate Product Manager (AI)About the RoleWe are in search of a dynamic and technically proficient Associate Product Manager (AI) to become an integral part of our innovative team. This exciting opportunity is tailored for individuals with 1–3 years of relevant experience and a robust technical background, particularly in the realm of AI. We also warmly invite interns and recent graduates who are enthusiastic about product management in AI to apply.Key ResponsibilitiesCollaborate closely with engineering and analytics teams to conceptualize, plan, and deliver impactful features within AI products.Conduct thorough analysis of user data and feedback to guide product decisions and enhance the overall user experience.Lead the execution of product initiatives, from ideation through to launch, ensuring timely delivery and excellence in feature quality.Work with cross-functional stakeholders to establish product requirements, prioritize tasks, and align on strategic roadmaps.Keep abreast of AI trends and competitive products to contribute to product vision and strategy.Leverage strong technical skills, business acumen, product design sensibility, and intuition to achieve outstanding product results.Requirements1–3 years of experience in a technical, product, or analytics-focused role (including internships).Strong technical foundation (preferably in Computer Science, Engineering, Mathematics, or a related field).Demonstrated experience in AI (through coursework, personal projects, internships, or prior roles).Proven collaboration skills with engineering, analytics, design, and operations teams.Excellent planning, organizational, and problem-solving abilities.Outstanding written and verbal communication skills.Strong product intuition and a good sense of user experience and design.Business and operations insight with a data-driven approach.Nice to HaveExperience in a high-growth startup or agile product environment.Hands-on involvement in building or optimizing AI orchestration.Familiarity with product management and analytics tools.BenefitsHealth InsurancePTO

Mar 11, 2026

Apply

Principal Generative AI Product Manager - EMAP

Experian

Full-time|On-site|Hyderabad

Join Experian as a Principal Generative AI Product Manager for the EMAP division, where you will lead innovative AI product strategies that enhance customer experience and drive business outcomes. In this role, you will collaborate with cross-functional teams to define product vision, roadmap, and execution plans, ensuring alignment with market needs and company objectives.

Apr 2, 2026

Apply

Product Manager, ECS Generative AI Experiences

Experian

Full-time|On-site|Hyderabad

Experian is seeking a Product Manager to join the ECS Generative AI Experiences team in Hyderabad. This position centers on developing products that apply generative AI to enhance user experiences. Role overview The Product Manager will help shape the direction of new AI-driven offerings. Collaboration with teams across the organization is essential to define and advance product goals. What you will do Partner with cross-functional teams to establish product vision and strategy Create and update the product roadmap Oversee product launches and support continuous improvement efforts

Apr 21, 2026

Apply

Freelance Optical Engineer and AI Trainer

Mindrift

Part-time|$10/hr - $10/hr|Remote|Remote — Hyderabad, Telangana, India

Please submit your CV in English and include your English proficiency level. Mindrift connects professionals with project-based AI assignments for technology companies. Work centers on testing, evaluating, and improving AI systems. This is a freelance position, not permanent employment; all work is project-based. Role overview This freelance Optical Engineer and AI Trainer role is remote and open to candidates based in Hyderabad, Telangana, India. Projects vary, but often involve developing challenging optics problems that reflect real-world physics research. Tasks may include: Designing original optics problems modeled after actual research workflows Creating computationally intensive challenges that cannot be solved by hand within days or weeks Developing problems requiring advanced reasoning across mechanics, electromagnetism, thermodynamics, and quantum mechanics Basing tasks on real research obstacles or practical scenarios from optics and physics Clearly documenting problem statements and providing accurate, verified solutions Requirements Degree in Physics (theoretical, experimental, or computational) or a related field Minimum 2 years of experience in applied research or teaching Proficiency with numerical simulation methods Ability to create problems reflecting authentic physics research processes Creative problem-solving skills across multiple physics domains Familiarity with modeling and approximation techniques in physics Strong written English skills at C1 level or above Project workflow Apply Pass qualifications Join a project Complete assigned tasks Receive compensation Time commitment During active projects, expect to spend about 10–20 hours per week. Actual workload may change depending on project needs. Compensation Contributors can earn up to $10 per hour, depending on expertise and pace. Compensation varies by project based on scope, complexity, and required skills.

Apr 24, 2026

Apply

Freelance Optical Engineer for AI Training Projects

toloka-ai

Part-time|$10/hr - $10/hr|Remote|Remote — Hyderabad, Telangana, India

We invite you to submit your CV in English and specify your English proficiency level.At Mindrift, we bridge the gap between skilled professionals and project-centric AI opportunities with top-tier tech companies. Our focus is on evaluating, testing, and enhancing AI systems through innovative problem-solving. Please note that this is a project-based engagement, not a permanent role.What This Role EntailsWhile each assignment presents distinct challenges, contributors may engage in the following:Create original optical problems that replicate authentic physics research workflows;Ensure that problems are computationally demanding, requiring days or weeks for manual resolution;Devise tasks that involve complex reasoning across mechanics, electromagnetism, thermodynamics, and quantum mechanics;Base tasks on genuine research obstacles or practical applications in optics and physics;Clearly document problem statements and provide verified correct solutions.Ideal Candidate ProfileThis role is best suited for optical engineers interested in part-time, non-permanent projects. We seek candidates who possess:A degree in Physics (Theoretical, Experimental, or Computational) or related disciplines;A minimum of 2 years of professional experience, including applied, research, or teaching backgrounds;Proficiency in numerical simulation methods;The capability to formulate problems that reflect real physics research processes;Creative problem-solving skills across various physics domains;Knowledge of physics modeling and approximation techniques;Excellent command of written English (C1+).Work ProcessApplication → Qualification Assessment → Project Assignment → Task Completion → CompensationProject CommitmentEstimated project tasks will require approximately 10–20 hours per week during active phases, contingent on project specifics. This estimate is not a guaranteed workload and is applicable solely while the project is active.Compensation StructureParticipants can earn up to $10 per hour, commensurate with their contribution level and pace. Compensation varies across projects based on scope, complexity, and expertise required. Please be aware that different projects may offer varying compensation rates.

May 2, 2026

Create account — see all 10,126 results

Browse all companies, explore by city & role, or SEO search pages. View directory listings: all jobs, search results, or location & role pages.