C

Lead AI Engineer (FM Hosting, LLM Inference)

Capital One
Full-time
On-site
San Jose
$193,400 - $193,400 USD yearly
Overview

Lead AI Engineer (FM Hosting, LLM Inference) at Capital One. We are building responsible and reliable AI systems to reimagine banking for good. Our AI & ML initiatives deliver real-time, personalized customer experiences, with scalable, high-performance AI infrastructure and world-class applied science and engineering teams. Team Description

The Intelligent Foundations and Experiences (IFX) team is at the center of bringing Capital One's AI vision to life. We work with partners across the company to advance science and AI engineering, building and deploying proprietary solutions that deliver value to millions of customers in a responsible and scalable way. In this role, you will

Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI-powered products that change how our associates work and how our customers interact with Capital One.

Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability.

Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more.

Invent and introduce state-of-the-art LLM optimization techniques to improve performance metrics such as scalability, cost, latency, and throughput for large-scale production AI systems.

Contribute to the technical vision and long-term roadmap of foundational AI systems at Capital One.

The Ideal Candidate

You love to build systems, take pride in the quality of your work, and are motivated to do the right thing. You want to work on problems that will help change banking for good.

You stay current with the latest research and can apply novel techniques in production by understanding scientific publications.

You adapt quickly, bring clarity to big problems, ask questions, and can articulate findings concisely. You share new ideas even when unproven.

You are deeply technical with a strong foundation in engineering and mathematics, and you can identify optimization opportunities across hardware, software, and AI.

You are a resilient trailblazer who can forge new paths to achieve business goals when the route is unknown.

Basic Qualifications

Bachelor’s degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 4 years of experience developing AI and ML algorithms or technologies, or a Master’s degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 2 years of experience developing AI and ML algorithms or technologies

At least 4 years of experience programming with Python, Go, Scala, or Java

Preferred Qualifications

6 years of experience deploying scalable and responsible AI solutions on cloud platforms (e.g. AWS, Google Cloud, Azure, or equivalent private cloud)

Experience designing, developing, delivering, and supporting AI services

Experience developing AI and ML algorithms or technologies (e.g. LLM Inference, Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, Java, or Golang

Experience developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, and cost

Passion for staying abreast of the latest AI research and AI systems, and judiciously applying novel techniques in production

Capital One will consider sponsoring a new qualified applicant for employment authorization for this position. The minimum and maximum full-time annual salaries for this role are listed below, by location. Salaries are for candidates hired to work within those locations and reflect the amount Capital One is willing to pay at the time of posting. Salaries for part-time roles will be prorated. Cambridge, MA: $193,400 - $220,700; McLean, VA: $193,400 - $220,700; New York, NY: $211,000 - $240,800; San Francisco, CA: $211,000 - $240,800; San Jose, CA: $211,000 - $240,800. Candidates hired to work in other locations will be subject to the pay range of that location, and the actual salary offered will be reflected in the offer letter. This role is eligible for performance-based incentives, including cash bonuses and long-term incentives (LTI), which may be discretionary or non-discretionary depending on the plan. Capital One offers a comprehensive, competitive, and inclusive set of health, financial, and other benefits. Learn more at the Capital One Careers website. Eligibility varies by status and level. This role is open to applications for a minimum of 5 business days. No agencies, please. Capital One is an equal opportunity employer (EOE, including disability/vet) committed to non-discrimination in compliance with applicable laws. Capital One promotes a drug-free workplace and may consider applicants with criminal histories in accordance with applicable laws. If you require accommodations during the application process, please contact Capital One Recruiting at 1-800-304-9102 or via email at RecruitingAccommodations@capitalone.com. For technical support or recruiting questions, please email Careers@capitalone.com. Capital One does not endorse third-party products or services accessed through this site. Capital One Financial is composed of multiple entities; postings may refer to Capital One Canada, Capital One Europe, or COPSSC depending on the region.

#J-18808-Ljbffr