Compensation: $240,000 - $275,000 base + bonus up to 80%
We are seeking an experienced engineering leader to take ownership of a high-impact team building infrastructure for AI systems using Rust and modern cloud-native technologies.
In this role, you will lead the development of scalable platforms to support the full lifecycle of AI and machine learning models, from training and orchestration to deployment and monitoring. You will manage a senior engineering team and collaborate closely with researchers, data scientists, and infrastructure stakeholders to deliver robust, enterprise-grade solutions.
This is a leadership position in a highly technical, fast-moving environment. Ideal for someone who thrives at the intersection of systems programming, distributed infrastructure, and AI enablement.
Responsibilities
Lead a senior engineering team focused on AI platform development using Rust and modern cloud tooling
Architect and deliver scalable systems for AI model training, inference, and lifecycle management
Build and maintain secure, compliant cloud-native infrastructure across AWS, GCP, and Azure
Develop internal APIs and platform abstractions for cross-functional ML teams
Implement CI/CD pipelines, monitoring, and automation for highly available AI systems
Collaborate with research and data science teams to bring experimental models into production
Guide platform direction in areas like GPU orchestration, distributed systems, and MLOps
Required Experience
10+ years in software or platform engineering, with a track record of technical leadership
Strong programming expertise in Rust
Deep experience with Kubernetes, including Helm, CRDs, and container runtimes
Familiarity with infrastructure-as-code tools such as Terraform or CloudFormation
Hands-on experience with cloud platforms (AWS, Azure, or GCP)
Knowledge of production-grade ML frameworks such as PyTorch, JAX, or NumPy
Strong foundation in distributed systems, cloud security, and networked application design
Experience managing CI/CD pipelines and cloud-native operations at scale
Preferred Qualifications
Experience with GPU orchestration and high-performance computing environments
Background in building SDKs, APIs, or internal developer platforms
Familiarity with distributed data processing tools such as Spark or Dask
Knowledge of event-driven architectures and technologies like Kafka
Exposure to MLOps tools and concepts
Compensation and Benefits
Base salary of $240,000 to $275,000
Bonus opportunity up to 80% of base salary, based on performance
Comprehensive benefits including healthcare, retirement plans, generous time off, and hybrid working flexibility
This is a rare opportunity to drive platform strategy and hands-on development within a deeply technical and forward-thinking team. If you are passionate about Rust, AI systems, and scalable
infrastructure, we would like to hear from you.
To learn more or apply, feel free to reach out directly or submit your resume for consideration.