Join Capital One as a Lead AI Engineer specializing in Large Language Models, where you'll be at the forefront of designing, developing, and supporting cutting-edge AI systems. Our mission is to innovate banking for good by leveraging advanced AI techniques to enhance customer experiences and streamline internal operations.
About Us:
At Capital One, we are committed to creating responsible and reliable AI systems that redefine the banking landscape. As industry leaders in harnessing machine learning for personalized customer experiences, we have built a robust technology infrastructure supported by top-tier talent. Our AI applications transform customer interactions, from real-time responses to addressing unusual charges, by injecting humanity and simplicity into banking.
Team Overview:
The Intelligent Foundations and Experiences (IFX) team plays a pivotal role in actualizing our AI vision. We collaborate closely with various departments to enhance the state-of-the-art in AI science and engineering. Our proprietary solutions deliver significant value to millions of customers and enable different teams across Capital One to leverage the transformative potential of AI responsibly and efficiently.
Your Role:
Collaborate with a diverse team of engineers, research scientists, and product managers to develop AI-powered products that improve both customer interactions and associate workflows.
Design, develop, test, deploy, and maintain essential AI components, including foundation model training, large language model inference, similarity search, model evaluation, and governance.
Utilize an extensive array of Open Source and SaaS AI technologies, including AWS Ultraclusters, Hugging Face, VectorDBs, and PyTorch.
Innovate and implement state-of-the-art optimization techniques to enhance performance, scalability, and efficiency of large-scale AI systems.
Contribute to the technical vision and long-term roadmap for foundational AI systems at Capital One.
Who You Are:
You are passionate about building high-quality systems that make a difference, focusing on challenging problems that contribute to the greater good.
You keep up with the latest research in AI and ML, applying cutting-edge techniques pragmatically in production settings.
You thrive in ambiguity, enjoy uncovering root causes, and communicate your insights effectively.
With a strong technical foundation in engineering and mathematics, you spot optimization opportunities and leverage them creatively.
You are a resilient innovator who can navigate uncharted territories to achieve business objectives.
Qualifications:
Bachelor's or Master's degree in Computer Science, AI, Electrical Engineering, or related fields, along with relevant experience in AI and ML development.
Experience programming in Python, Go, Scala, or Java.
Preferred Skills:
At least 4 years of experience deploying scalable AI solutions on cloud platforms (AWS, Google Cloud, Azure, or equivalent).
Experience in developing and supporting AI services and algorithms, particularly in LLM inference and memory optimizations.
Expertise in state-of-the-art techniques to optimize software for improved latency, throughput, and cost efficiency.
This role is based in one of the following locations: Cambridge, MA; McLean, VA; New York, NY; San Francisco, CA; San Jose, CA.
The salary for this position ranges from $158,600 to $197,400 depending on your location. Additionally, performance-based incentives and comprehensive benefits packages are included.
Capital One is an equal opportunity employer that promotes diversity and inclusion. We welcome applicants from all backgrounds and are committed to providing a drug-free workplace.