About this Opportunity:
At Orion, we’re building the intelligent infrastructure that powers modern financial advisory. As part of our mission to unify planning, investing, and client engagement, we’re looking for a
Senior AI Engineer
to rapidly deploy cutting-edge large language models (LLMs) and generative AI into production, powering scalable systems and customer experiences.
This is a hands-on, product-facing role focused on shipping working AI features. You’ll work closely with product managers, designers, and engineers to build systems that directly impact thousands of advisors and millions of investor accounts.
For External Candidates:
Candidates must work in-office at one of the following locations for at least 3 days per week: San Francisco, CA.
For Internal Candidates:
ll internal employees, regardless of their current work arrangement (remote or in-office), are encouraged to apply.
In this role, you’ll get to:
Integrate LLMs and generative AI into advisor facing products and workflows
Design and build RAG systems using internal and external data sources
Apply techniques like prompt engineering, fine-tuning (e.g. LoRA), and custom embeddings to optimize domain-specific performance
Evaluate and productionize open-source and proprietary models (GPT-4, Claude, LLaMA, Mistral, etc.)
Develop APIs and services to deliver AI- powered features at scale
Collaborate across product and engineering teams to deliver rapidly and reliably
Continuously measure AI feature quality: accuracy, latency, and user impact
We’re looking for talent who:
Has proven experience working with large language models (e.g. OpenAI GPT, Claude, LLaMa)
Has familiarity with embedding models, understanding tokenization, prompt engineering and fine-tuning (e.g., LoRA).
Has practical experience with at least 1 vector database: Pinecone, Weaviate, FAISS
Has strong proficiency with orchestration tools like LangChain, Haystack or LlamaIndex for building RAG systems
Has ability to design and implement retrieval-augmented generation (RAG) systems.
Has experience in data preprocessing, chunking and vectorization pipelines
5+ years in software engineering, with 2+ years in applied ML or AI
Has deep understanding of LLMs, embeddings, RAG architecture, and vector search
Has strong grasp of prompt design, fine-tuning strategies, and model evaluation
Has proficiency with tools such as: LangChain, LlamaIndex, Pinecone, Weaviate, OpenAI, Hugging Face, FastAPI, Docker
Has strong engineering discipline and communication skills, especially in cross-functional settings
#LI-AP1
#LI-Onsite
#LI-Hybrid
Salary Range:
$163,188.00 - $262,732.00 The pay listed in this posting indicates the estimated pay at the time of this posting; however, may vary depending on geographic location, job-related knowledge, skills, and experience. In addition, Orion offers a competitive benefits package which includes health, dental, vision, and disability coverage on day one, 401(k) plan with employer match, paid parental leave, pet benefits including pawternity leave and pet insurance, student loan repayment and more.