Design and implement LLM-powered systems and scalable agentic workflows in production.
Build robust data processing pipelines for evaluation datasets, and feedback loops.
Work closely with customers to deeply understand their workflows and deliver products that meaningfully impact their systems
Apply novel research ideas and build production systems that advance the frontier of self-improving AI systems
Contribute to technical blogs and research implementations that strengthen our technical foundation
What we look for:
Strong experience building large-scale LLM-powered or agentic systems in production environments
Strong research taste, especially in evaluation design, post-training methods, and model improvement workflows
Desire to own problems end-to-end and deliver measurable outcomes
Ability to work directly with customers
Obsession with detail, code quality, and clean, modular system design
Strong ownership mentality and ability to thrive in a fast-paced startup environment
Currently pursuing Bachelor's, Master's or PhD degree in Computer Science, Engineering, or a related field
Our Core Values:
Customer Obsession - We start with the customer and work backwards. We aim to earn trust through consistent delivery, thoughtful listening, and by obsessing over customers.
Intellectual Honesty - We operate with high trust and low ego. Ideas matter more than titles, and we communicate openly and directly while assuming good intent, even in strong disagreement.
Bias for Action - We set high standards and move quickly to meet them. We prefer building and learning with customers over debating in the abstract, and we iterate based on real feedback.
Extreme Ownership - We take responsibility for outcomes, not just tasks. Ownership means seeing problems through to completion and ensuring solutions truly work in practice.
In this role, you'll work closely with the NeoSigma team. This role has a natural path towards full-time conversion.
About Us
NeoSigma is a product-driven research lab building the intelligence layer that helps close the feedback loop between your customers, products, and AI systems.
We are a small, intensely technical team of researchers and engineers who have trained frontier-scale models and widely used AI products and agents at Parallel Web, Essential AI, MIT, Apple, and Amazon.