AI Benchmark Engineer - Native Language Specialist | Serbian

lilt-productionSerbia (Remote)

Remote Contract

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.

Experience Level

Experience

Qualifications

Required Qualifications- Experience: Over 5 years of professional experience in software engineering.- Background: Established history at leading tech firms and/or graduation from top-tier engineering universities.- Language: Native or near-native fluency with a comprehensive understanding of its grammar, stylistic nuances, and phrasing conventions. High proficiency in English is also essential.- Technical Skills: Proficient in Python, standard shell scripting, and data processing techniques.- Workflow Familiarity: Extensive experience with Terminal/CLI-based development workflows and a solid understanding of coding agents.

About the job

We are looking for skilled native-speaking software engineers to conceptualize, construct, and validate these benchmarks. Your role will involve designing high-quality, impactful tasks that effectively evaluate a model's proficiency in multilingual contexts without depending on English translations.

Note: This is a remote, freelance opportunity.

Key Responsibilities

- Task Engineering: Assessing Coding Agents.

- Asset Creation: Develop realistic task environments utilizing datasets and files in your native language. Importantly, these assets must remain in the target language to accurately evaluate multilingual capability.

- Prompting & Translation: Identifying failure points where AI struggles in your native language.

- Implementation & Verification: Aid in creating robust solutions (reference implementations) and craft highly dependable verifier scripts (using rubric-based judging only as absolutely necessary).

- Calibration & Execution: Analyze execution logs and adjust task complexity (ranging from Easy to Very Hard) using standard Terminal-Bench run configurations across various model tiers (Haiku, Sonnet, Opus).

- Quality Assurance: Engage in a meticulous, four-layer human quality control process (creation, human review, calibration review, and audit) combined with automated LLM-based checks to uphold fairness, grammatical precision, and benchmark integrity.

About lilt-production

Lilt is at the forefront of multilingual software challenges, committed to enhancing the capabilities of large language models in diverse environments. Our team is dedicated to creating innovative solutions that ensure linguistic precision and cultural relevance across various platforms.

Similar jobs

Browse all companies, explore by city & role, or SEO search pages. View directory listings: all jobs, search results, location & role pages.

1 - 20 of 216 Jobs

Search for Applied AI Engineer

216 results

Select all on this page (20)

Apply

Senior Applied AI Engineer at Databricks | Belgrade, Serbia

Databricks

Full-time|On-site|Belgrade, Serbia

Join Databricks as a Senior Applied ML/AI Engineer, where you will leverage cutting-edge machine learning and optimization algorithms to enhance the functionality and performance of our AutoML products and other user-facing solutions. Your expertise will drive advancements in classification, regression, forecasting, and recommendation systems, utilizing both…

Jan 30, 2026

Apply

AI Engineer

SMG Swiss Marketplace Group

Full-time|On-site|Belgrade

Join our dynamic team at SMG Swiss Marketplace Group as an AI Engineer. In this pivotal role, you will leverage cutting-edge technologies and methodologies to develop and implement AI solutions that drive innovation and enhance user experiences. You will work collaboratively with cross-functional teams to analyze complex data, design algorithms, and create intelligent systems that solve real-world problems.

Mar 20, 2026

Apply

AI Go-to-Market Engineer

Flosum

Full-time|Remote|Remote — Serbia

Flosum builds DevOps and data protection solutions for enterprise Salesforce teams. As we expand our use of AI to accelerate pipeline generation, customer retention, and product delivery, we are hiring our first AI Go-to-Market Engineer. This is a remote role based in Serbia. Role Overview This position focuses on building and deploying AI agents and automation to strengthen every stage of our revenue process, from first contact through closing and customer expansion. The AI Go-to-Market Engineer will work closely with revenue operations, sales, marketing, and customer success teams. The goal: increase productivity and drive measurable impact using AI systems. This role reports directly to the CEO. What You Will Do Within your first 30 days, deploy and personally train an AI SDR agent for inbound qualification (from Qualified/Marketo) or outbound prospecting (via Outreach). Run weekly refinement cycles until the AI outperforms the lower tier of our human BDR team. Build an AI-powered inbound qualification system to capture, score, and route every website visitor and form submission in real time. Ensure no leads are missed, and account executives focus only on highly qualified prospects. Increase AE revenue-generating time from about 25% to over 50% by automating CRM upkeep, call summarization (using Gong), follow-up sequences, proposal creation, and pipeline updates, removing administrative tasks that don’t drive revenue. Design an AI framework across Flosum’s GTM stack (Salesforce, Marketo, Outreach, Qualified, Gong) to enable seamless data flow between systems without manual entry. Ensure every customer interaction is tracked, and AI agents learn and share insights in a continuous feedback loop centered on Salesforce. Create and manage an AI-driven competitive intelligence system that monitors competitors, pricing changes, and product launches. Deliver structured insights to sales and product teams each week. Use AI to generate hyper-personalized outbound campaigns at ten times current volume, without increasing headcount. Research accounts and craft tailored messaging based on ICP signals (such as Salesforce team size, DevOps tools, and compliance needs like DORA), and coordinate multi-channel outreach (email, LinkedIn, phone). Establish a closed-loop data feedback process linking BDR activities, pipeline results, and AI agent performance. Refine ICP targeting, messaging, and channel strategies weekly using real conversion metrics. Implement AI-assisted customer health scoring and churn prediction models to identify at-risk accounts before renewal. Provide actionable triggers for the customer success team based on usage data. Location Remote , Serbia

Apr 15, 2026

Apply

AI Infrastructure Engineer

Xsolla

Full-time|On-site|Serbia

Join our dynamic team at Xsolla as an AI Infrastructure Engineer, where you'll be at the forefront of innovating and optimizing our AI systems to enhance gaming experiences worldwide. You will collaborate with cross-functional teams to design, implement, and maintain robust infrastructure solutions that support cutting-edge AI applications.

Mar 26, 2026

Apply

Senior Software Engineer - AI Compiler

Tenstorrent

Full-time|On-site|Belgrade, Serbia

Join Tenstorrent as a Senior Software Engineer specializing in AI Compiler development. In this role, you will collaborate with a talented team to create cutting-edge compiler technologies that enhance AI performance.

Apr 2, 2026

Apply

AI Engineer - Python Developer

Aghanim

Full-time|On-site|Belgrade

Aghanim is seeking a highly motivated AI Engineer to join our innovative team. In this role, you will design and develop production-ready multi-agent large language model (LLM) systems.Your primary focus will go beyond simple coding; you will delve into understanding user needs and building systems that consistently meet these expectations.This position leans more towards systems engineering and architecture rather than traditional machine learning.You will have the autonomy to explore our API offerings and product workflows, creating agent-based solutions that are intricately connected with our platform.

Mar 30, 2026

Apply

AI QA Engineer

Unlimit

On-site|On-site|Belgrade

Join Unlimit as an AI Quality Assurance Engineer, where you will play a pivotal role in ensuring the highest quality standards for our innovative backend systems and user-friendly interfaces that manage card transactions from start to finish. In this dynamic position, you will not only perform testing but will also integrate quality assurance deeply into our payment infrastructure, guaranteeing reliability, precision, and compliance in our high-volume, cloud-native environment. Be part of a team that is dedicated to breaking down financial barriers and facilitating seamless money movement across borders.

Jun 23, 2025

Apply

QA Engineer for AI Assistant Features

JetBrains s.r.o.

Full-time|Hybrid|Belgrade, Serbia; Berlin, Germany; Limassol, Cyprus; Madrid, Spain; Munich, Germany; Paphos, Cyprus; Prague, Czech Republic; Warsaw, Poland; Yerevan, Armenia

Role Overview JetBrains is hiring a QA Engineer to support quality and performance for AI assistant features across several locations in Europe. This role centers on testing and improving AI-driven product capabilities, working closely with developers and product managers to maintain high standards for users.

Apr 20, 2026

Apply

Senior Full-stack QA Engineer for AI Development

JetBrains s.r.o.

Full-time|On-site|Belgrade, Serbia; Berlin, Germany; Limassol, Cyprus; Munich, Germany; Paphos, Cyprus; Prague, Czech Republic; Warsaw, Poland; Yerevan, Armenia

Join JetBrains, a pioneer in software development tools, as we create an innovative AI-native platform that revolutionizes developer workflows, enhances team collaboration, and strengthens organizational governance. This ambitious project represents a significant commitment to our vision for the future of software development, seamlessly integrating with existing developer tools across diverse teams, products, and environments.We are seeking a seasoned and proactive Full-stack QA Engineer to contribute to our functional and performance testing initiatives.

Mar 12, 2026

Apply

AI Benchmark Engineer - Native Language Specialist | Serbian

lilt-production

Contract|Remote|Serbia (Remote)

Join us in developing a robust and verifiable evaluation suite of Terminal-Bench tasks aimed at pushing the boundaries of large language models in addressing multilingual software challenges. Our mission is to assess multilingual resilience by investigating prompt language influences, processing non-English data, and navigating intricate locale and encoding scenarios in terminal workflows.We are looking for skilled native-speaking software engineers to conceptualize, construct, and validate these benchmarks. Your role will involve designing high-quality, impactful tasks that effectively evaluate a model's proficiency in multilingual contexts without depending on English translations.Note: This is a remote, freelance opportunity.Key Responsibilities- Task Engineering: Assessing Coding Agents.- Asset Creation: Develop realistic task environments utilizing datasets and files in your native language. Importantly, these assets must remain in the target language to accurately evaluate multilingual capability.- Prompting & Translation: Identifying failure points where AI struggles in your native language.- Implementation & Verification: Aid in creating robust solutions (reference implementations) and craft highly dependable verifier scripts (using rubric-based judging only as absolutely necessary).- Calibration & Execution: Analyze execution logs and adjust task complexity (ranging from Easy to Very Hard) using standard Terminal-Bench run configurations across various model tiers (Haiku, Sonnet, Opus).- Quality Assurance: Engage in a meticulous, four-layer human quality control process (creation, human review, calibration review, and audit) combined with automated LLM-based checks to uphold fairness, grammatical precision, and benchmark integrity.

Mar 2, 2026

Apply

Data Engineering Intern - Analytics and AI

Rivian and Volkswagen Group Technologies

Internship|On-site|Belgrade

Rivian and Volkswagen Group Technologies brings together two automotive leaders focused on shaping the future of mobility. This partnership emphasizes electric vehicle advancements, operating systems, connectivity, and intelligent solutions. The combined team works to improve how vehicles and systems connect, use artificial intelligence, and maintain security, all with the goal of creating smarter and more sustainable transportation. Role overview The Data Engineering Intern - Analytics and AI will join an engineering group responsible for building and refining the data infrastructure behind analytics, AI projects, and operational insights. This internship blends data engineering, analytics, and applied AI, offering hands-on experience with production-scale datasets and cloud-based analytics platforms. The team focuses on creating reliable data pipelines, developing analytical models, and automating workflows using modern AI tools. What you will do AI-driven analytics development: Apply AI tools to design intelligent scenarios that make complex data requests easier, enabling teams to answer key business questions quickly. Creation of intelligence playbooks: Develop and document playbooks for recurring data needs, setting standards for analysis and data audits with AI-based methods. Pipeline engineering: Build and maintain ELT pipelines using Python and SQL to support core AI-driven processes. Analytics engineering: Help create data models that convert raw telemetry into clean, usable datasets, providing a trustworthy source of truth. System and performance optimization: Monitor and improve data platforms to ensure they remain reliable and efficient for delivering insights. Collaboration: Act as a technical bridge between Engineering and Analytics teams, supporting communication and teamwork. Location This internship is based in Belgrade.

Apr 27, 2026

Apply

Golang Engineer for AI-Powered Search Infrastructure

Perplexity

Full-time|On-site|Belgrade

Become a part of Perplexity AI as a Search Golang Engineer and play a crucial role in shaping the future of highly scalable, AI-driven search infrastructure. In this dynamic position, you will utilize your expertise in Golang to create, implement, and manage backend systems capable of efficiently processing millions of queries with unparalleled reliability.Responsibilities:Develop robust and scalable distributed backend services utilizing Golang.Design, enhance, and sustain search infrastructure to accommodate rapid traffic increases.Create cloud-native solutions focusing on horizontal scalability and quick failover capabilities.Establish comprehensive monitoring, autoscaling, and incident recovery mechanisms.Work closely with product, infrastructure, and DevOps teams to optimize throughput and system resilience.Lead improvements in CI/CD processes, automation, and operational excellence for backend systems.Mentor fellow engineers and advocate for scalable design principles throughout the organization.

Oct 23, 2025

Apply

Senior Staff Software Engineer - AI Kernels

d-Matrix

Full-time|Hybrid|Belgrade

At d-Matrix, we are dedicated to unlocking the transformative power of generative AI. Positioned at the forefront of software and hardware innovation, we continuously challenge the limits of technological possibilities. Our workplace thrives on respect and collaboration.We embrace humility and prioritize open communication. Our inclusive team values diverse perspectives, leading to innovative solutions. We are on the lookout for passionate individuals eager to tackle challenges and driven by results. Ready to explore your potential? Together, we can shape the limitless future of AI.The Role: Senior Staff Software Engineer - AI KernelsLocation:Hybrid, working on-site at our Belgrade, Serbia office 3-5 days a week.What You Will Do:You will join a dynamic team responsible for productizing the software stack for our AI compute engine. Your role will involve the development, enhancement, and maintenance of software kernels tailored for next-generation AI hardware. With your extensive experience in building software kernels for hardware architectures, you will apply your deep understanding of various hardware systems and effectively map algorithms to these architectures. You will also be adept at translating computational graphs from AI frameworks into optimized implementations. Your comprehensive knowledge across the full-stack toolchain will enable you to navigate the complexities of hardware-software co-design, allowing you to deliver scalable software solutions within tight timelines. Collaborating closely with compiler experts, you will help develop robust compiler infrastructure while engaging with other software (ML and systems) and hardware (mixed signal, DSP, and CPU) specialists within the company.What You Will Bring:Minimum:An MS in computer engineering, mathematics, physics, or a related field coupled with 10+ years of industry experience, or a PhD in a pertinent area with at least 1 year of professional experience.A solid understanding of computer architecture, data structures, system software, and machine learning principles.Proficiency in C/C++ and Python development within Linux environments, with familiarity in using standard development tools.Experience in implementing algorithms using high-level programming languages such as C/C++ and Python.

Mar 31, 2026

Apply

Senior AI Translation Engineer (NLP) at Smartcat | Remote

Smartcat

Full-time|Remote|Serbia - Remote

About SmartcatAt Smartcat, we are redefining the future of work by integrating human expertise with advanced digital teammates, achieving unprecedented productivity boosts for leading global enterprises.We stand at the forefront of a groundbreaking category: Agentic AI. Our technology empowers organizations to cultivate high-performing hybrid teams comprised of both human professionals and AI agents. These AI agents are not merely assistants; they are fully trained digital colleagues, ready to contribute from day one by learning from your top talent, resources, and strategic vision.Our innovative platform merges generative AI capabilities with human-in-the-loop workflows, supported by a dynamic Enterprise Skill Graph that continually evolves and enhances. Whether you're globalizing a product, onboarding new employees, translating educational materials, or coordinating legal teams across borders, Smartcat transforms knowledge into actionable results and scales those results effectively.Over 1,000 enterprises, including 20% of the Fortune 500, depend on Smartcat for seamless, precise, and multilingual business operations. As a Series C company experiencing 130% annual growth, we are rapidly expanding and keen to invest in individuals who aspire to shape the future of work alongside us.Join us as we unlock global potential, one human and AI team at a time.MissionAs a Senior AI Translation Engineer, you will play a pivotal role in enhancing our platform's translation functionalities. Utilizing your machine learning expertise—including applied research, fine-tuning, benchmarking, and inference optimization—you will deliver high-performance solutions that drive essential features for our expanding clientele.

Feb 3, 2026

Apply

API & AI Plugins Integration Engineer

Telesign

Full-time|On-site|Belgrade

Join Telesign as an API & AI Plugins Integration Engineer. In this role, you will play a pivotal part in integrating cutting-edge APIs and AI plugins that enhance our product offerings. Your expertise will help us streamline operations and improve user experiences.

Mar 27, 2026

Apply

Search Rust Engineer at Perplexity | Belgrade

Perplexity

Full-time|On-site|Belgrade

Join the dynamic team at Perplexity AI as a Search Rust Engineer, where your expertise will drive the evolution of AI-powered search technologies. In this role, you will focus on optimizing every aspect of our search performance, ensuring that our systems are robust, scalable, and reliable.As a key player, you will:Design and enhance ultra-low-latency search infrastructure utilizing Rust programming.Profile and instrument services to continuously reduce response times as we scale.Develop and sustain distributed backend components that drive real-time search and retrieval.Work closely with product and infrastructure teams to craft systems that achieve exceptional query performance.Leverage advanced concurrency, memory management, and network programming techniques for optimal throughput.Monitor and fine-tune production workloads to maintain reliability under high traffic conditions.Participate in code reviews and mentor peers in high-efficiency Rust development practices.

Oct 23, 2025

Apply

Senior Backend Engineer

Smartcat

Full-time|Remote|Serbia - Remote

About SmartcatSmartcat is revolutionizing the future of work by merging human intelligence with digital assistants, achieving extraordinary productivity gains for top-tier enterprises.We are pioneers in a groundbreaking field: Agentic AI. Our technology empowers businesses to create high-performing hybrid teams that integrate human talent with AI agents. These AI agents are customized digital partners, trained on your best practices, content, and strategies—ready to contribute effectively from day one.Our sophisticated platform leverages generative AI, human-in-the-loop processes, and a dynamic Enterprise Skill Graph that perpetually evolves. Whether you’re launching a global product, onboarding new employees, translating educational content, or coordinating legal teams across different regions, Smartcat transforms knowledge into actionable results and scales your operations seamlessly.Over 1,000 organizations, including 20% of the Fortune 500, trust Smartcat to expand their global reach—swiftly, accurately, and in multiple languages. As a Series C company experiencing 130% annual growth, we are rapidly expanding and are committed to hiring individuals eager to shape the future of work alongside us.Join us in unlocking global potential, one human and AI team at a time.

Mar 9, 2026

Apply

Engineering Manager for Offsite Discovery (Remote)

Constructor

Full-time|$100K/yr - $130K/yr|Remote|Remote — Serbia

About UsConstructor is an innovative platform revolutionizing search and discovery within the ecommerce sector, meticulously designed to enhance key performance indicators such as revenue, conversion rates, and profitability. Our proprietary search engine, developed entirely in-house, utilizes advanced transformers and generative LLMs to drive personalized search experiences, recommendations, and shopping assistance.Our engineering team is the backbone of our organization, having built a state-of-the-art engine that consistently outperforms competitors in A/B testing. We are committed to pushing the boundaries of AI to maintain our leadership position.Our system operates at an unprecedented scale, handling over 1 billion queries daily in 150 languages across approximately 100 countries, serving some of the largest ecommerce brands globally, including Sephora, Under Armour, and Petco.Our team thrives on solving complex problems and is dedicated to improving the experiences of both customers and colleagues. We prioritize values such as empathy, transparency, curiosity, continuous improvement, and impactful metrics, believing that empowering our team members leads to remarkable outcomes.Founded in 2019 by Eli Finkelshteyn and Dan McCormick, Constructor is a U.S.-based company with a global footprint.About the TeamThe Recommendation Cross-Channel & Offsite Discovery team is committed to enabling marketers to maximize the potential of multichannel ecommerce discovery. Our platform harnesses real-time shopper behavior derived from website interactions to create personalized offsite experiences across various channels, including emails, push notifications, advertisements, and beyond.We specialize in delivering real-time personalized experiences in static emails, capable of managing high-volume traffic of up to 10,000 requests per second. Our mission is to present shoppers with what they desire at the right moment, enhancing engagement, driving revenue, and providing comprehensive campaign analytics at scale.Our dynamic team of cross-functional engineers thrives on challenges and strives to enhance the shopping experience. We uphold values of empathy, openness, curiosity, continuous improvement, and high-quality code, believing that empowering every team member fosters success.ResponsibilitiesAs the Engineering Manager, you will lead and expand the Offsite Discovery team, focusing on creating a high-performing, scalable, reliable, and cost-effective platform that extends personalized results beyond traditional storefronts. Key responsibilities include:Collaborate with the Product Manager and key stakeholders to establish quarterly objectives aligned with measurable service and business impact.Support engineers' professional development by helping them acquire new skills, broaden their ownership, and overcome challenges.

Nov 5, 2025

Apply

Senior Software Developer (IntelliJ AI) - UI/Desktop

JetBrains s.r.o.

Full-time|Remote|Belgrade, Serbia; Berlin, Germany; Limassol, Cyprus; Munich, Germany; Paphos, Cyprus; Prague, Czech Republic; Remote, Germany; Warsaw, Poland; Yerevan, Armenia

Join the innovative team at JetBrains as a Senior Software Developer specializing in IntelliJ AI for UI/Desktop applications. In this role, you will contribute to the development of cutting-edge software solutions that enhance user experiences and streamline workflows. You will collaborate with cross-functional teams to design and implement features that meet the needs of our diverse client base.

Mar 20, 2026

Apply

AI Automation Specialist

lago-1

Full-time|$8/hr - $14/hr|Remote|Remote — Serbia

Job Title: AI Automation SpecialistLocation: Remote - SerbiaSchedule: $8 - $14/hrAs an AI Automation Specialist, you will be responsible for designing, developing, and maintaining automated workflows for various client projects. You will utilize platforms such as Make.com, Zapier, and other no-code/low-code tools, with opportunities to apply basic coding skills when necessary.This position is non-client-facing and emphasizes internal development and delivery, making it ideal for problem-solvers who excel at creating intelligent, scalable systems.Key ResponsibilitiesDevelop, test, and refine automation workflows using Make.com, Zapier, and similar tools.Integrate multiple applications, APIs, and data sources to enhance process efficiency.Diagnose and resolve automation issues to ensure seamless performance.Collaborate with internal teams to define, prioritize, and execute automation projects.Document workflows and formulate clear Standard Operating Procedures (SOPs) for recurring tasks.Continuously explore and experiment with innovative AI tools and automation techniques.

Jan 13, 2026

Create account — see all 216 results

Browse all companies, explore by city & role, or SEO search pages. View directory listings: all jobs, search results, or location & role pages.