O

Senior AI Engineer (6a-3p PT)

Orion
Full-time
On-site
San Francisco
$163,188 - $163,188 USD yearly
​​​About this Opportunity: At Orion, we’re building the intelligent infrastructure that powers modern financial advisory. As part of our mission to unify planning, investing, and client engagement, we’re looking for a

Senior AI Engineer

to rapidly deploy cutting-edge large language models (LLMs) and generative AI into production, powering scalable systems and customer experiences. This is a hands-on, product-facing role focused on shipping working AI features. You’ll work closely with product managers, designers, and engineers to build systems that directly impact thousands of advisors and millions of investor accounts. For External Candidates: Candidates must work in-office at one of the following locations for at least 3 days per week: San Francisco, CA. For Internal Candidates: ll internal employees, regardless of their current work arrangement (remote or in-office), are encouraged to apply. In this role, you’ll get to: Integrate LLMs and generative AI into advisor facing products and workflows

Design and build RAG systems using internal and external data sources

Apply techniques like prompt engineering, fine-tuning (e.g. LoRA), and custom embeddings to optimize domain-specific performance

Evaluate and productionize open-source and proprietary models (GPT-4, Claude, LLaMA, Mistral, etc.)

Develop APIs and services to deliver AI- powered features at scale

Collaborate across product and engineering teams to deliver rapidly and reliably

Continuously measure AI feature quality: accuracy, latency, and user impact

We’re looking for talent who: Has proven experience working with large language models (e.g. OpenAI GPT, Claude, LLaMa)

Has familiarity with embedding models, understanding tokenization, prompt engineering and fine-tuning (e.g., LoRA).

Has practical experience with at least 1 vector database: Pinecone, Weaviate, FAISS

Has strong proficiency with orchestration tools like LangChain, Haystack or LlamaIndex for building RAG systems

Has ability to design and implement retrieval-augmented generation (RAG) systems.

Has experience in data preprocessing, chunking and vectorization pipelines

5+ years in software engineering, with 2+ years in applied ML or AI

Has deep understanding of LLMs, embeddings, RAG architecture, and vector search

Has strong grasp of prompt design, fine-tuning strategies, and model evaluation

Has proficiency with tools such as: LangChain, LlamaIndex, Pinecone, Weaviate, OpenAI, Hugging Face, FastAPI, Docker

Has strong engineering discipline and communication skills, especially in cross-functional settings

#LI-AP1 #LI-Onsite #LI-Hybrid Salary Range: $163,188.00 - $262,732.00 The pay listed in this posting indicates the estimated pay at the time of this posting; however, may vary depending on geographic location, job-related knowledge, skills, and experience. In addition, Orion offers a competitive benefits package which includes health, dental, vision, and disability coverage on day one, 401(k) plan with employer match, paid parental leave, pet benefits including pawternity leave and pet insurance, student loan repayment and more.