Senior/Staff Applied AI Engineer, Agents
Scale AI
About The ACE Team
The Agent Capabilities & Environments (ACE) team, part of Scale’s Research organization, brings together customer-facing Researchers and Applied AI Engineers. Our core mission includes benchmarking autonomous agent performance across real-world scenarios and environments, creating robust data programs to improve Large Language Models (LLMs) agentic capabilities, and building foundational tools and frameworks for evaluating models as agents. ACE focuses on autonomous agents that dynamically interact with diverse external environments, including code repositories, GUI interfaces, browsers, and more.
About The Role
As a Senior/Staff Applied AI Engineer on the ACE team, you’ll play a crucial role bridging state-of-the-art generative AI research, practical agent development, and the specialized data required to advance agentic systems.
You will:
Develop frameworks and tools to benchmark and evaluate advanced agent capabilities.
Construct realistic environments for training and evaluating autonomous agents.
Design agent-focused data programs leveraging supervised fine-tuning (SFT) and reinforcement learning (RL) methodologies.
Create robust data pipelines and novel agentic data types from diverse environments, including code repositories, web browsers, and computer systems.
Collaborate closely with customers to understand requirements, guide model development, and achieve product objectives.
Implement and adapt popular open-source agent libraries and benchmarks using proprietary datasets and models.
Responsibilities
Develop and apply frameworks to benchmark autonomous agent capabilities in complex environments.
Build and maintain data programs supporting agentic model development and evaluation.
Translate customer requirements into actionable development plans with clear milestones.
Qualifications
Min. 5+ years of practical experience building AI applications for real-world use cases.
Strong engineering and AI fundamentals, supported by a Bachelor’s and/or Master’s degree or equivalent experience in Computer Science, Machine Learning, AI, or a closely related field.
Deep understanding of modern deep learning methods, LLM technologies, and data-centric AI methodologies.
Proven proficiency in Python, with experience writing, testing, and debugging code using standard data science libraries (e.g., NumPy, Pandas).
Previous experience in customer-facing roles, effectively translating complex requirements into actionable development goals.
Passion for solving ambiguous, complex technical challenges using cutting-edge research.
Nice-to-haves
Hands-on experience developing AI applications within the modern Generative AI stack (OpenAI APIs, commercial or open-source LLMs).
Experience building autonomous agents that leverage external tools, produce structured outputs, and interact with various environments.
Familiarity with agent benchmarking datasets such as SWE-Bench, tau-bench, and OS-World.
About Us
At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are an inclusive workplace and provide reasonable accommodations to applicants with disabilities upon request.
We comply with applicable laws and policies surrounding privacy and pay transparency. For more information, please contact accommodations@scale.com or refer to our privacy policy.
#J-18808-Ljbffr