About Us
Benchmark is the AI platform for the world's best investment firms. Leading firms use Benchmark to work faster and smarter across their entire deal lifecycle — from sourcing to diligence to portfolio management.
Increase your chances of an interview by reading the following overview of this role before making an application.
The Role
You'll build AI-native features end-to-end, from design through production. That means working directly with our LLM infrastructure — retrieval, context management, memory, embeddings, evals — and turning it into product experiences that feel effortless to users. We believe AI should be a teammate, not a copilot. We're a small team shipping ambitious products, so you'll have real ownership from day one.
Things You Would Work On
Architect and ship new product features that help investment professionals move faster, with a focus on removing complexity and enabling collaboration across deal teams
Build on our LLM stack: run evals, improve retrieval, context and memory management, and integrate model capabilities into user-facing workflows
What We're Looking For
3+ years of experience building and shipping production applications
Genuine interest in agents and keeps up to date with current research and model capabilities
Self-motivated, high ownership and low ego with the ability to work through ambiguity
Excited to work in-person in our NYC office 4+ days/week
Bonus
Experience working with agents in production
Previous experience as a founder or at an early-stage startup
Tech Stack
Backend: Python, Flask, Postgres · Frontend: TypeScript, React · Infra: GCP
Why Us
We're a small, technical team where everyone in every role builds. We work in-person because we're trying to do something hard with a lean team, and the velocity we get from being together matters. xsgimln If that excites you, we'd love to talk.