Beam
is an ultrafast AI inference platform. We built a serverless runtime that launches GPU-backed containers in less than 1 second and quickly scales out to thousands of GPUs. Developers use our platform to serve apps to millions of users around the globe. We're backed by Y Combinator, Tiger Global, and prominent developer-tool founders, including the founder of Snyk and former CTO of GitHub.
About the Role
In this role, you'll build full-stack AI apps with our platform. You’ll build examples, demos, and sharable mini-apps that showcase the most interesting capabilities of AI — and you’ll use our infrastructure to do it. You’ll also optimize inference performance for a wide range of models running on our platform. You will minimize latency, maximize throughput, and experiment to make sure the apps running on our platform have industry-leading performance.
Your work will directly impact millions of users worldwide.
Skills & Experience
Familiar with modern AI workflows, like ComfyUI and LoRA adaptors for fine-tuning
Able to ship full-stack web apps, ideally using a modern stack like Python/Django and React/Next.js
You’ve built something from scratch in the past, from wireframes to launching it publicly
Interest in the modern AI/ML landscape: you’re experimenting with the newest models as soon as they’re released
Experience with the modern inference stack (e.g., PyTorch, TensorRT, vLLM)
Benefits
Work on challenging and impactful engineering problems
Competitive salary and meaningful equity
Join a fast-growing pre-Series A company at the ground floor
Health, dental, and vision benefits with 100% coverage for employees and 50% for dependents
Opportunities to participate in events across the cloud-native and AI communities
Fitness stipend, learning budget, and much more
#J-18808-Ljbffr