Role : Senior AI Engineer
Location: Detroit, Michigan
Key Responsibilities
• Design, build, and maintain cloud infrastructure and platform services on Azure supporting web and API-based applications.
• Implement and manage Infrastructure as Code (IaC) using Terraform to deliver repeatable and compliant environments.
• Support and optimize Azure Kubernetes Service clusters, including deployment, scaling, upgrades, and operational troubleshooting across multiple application environments (dev/test/prod).
• Partner with application teams to ensure secure, performant, and scalable deployments, providing guidance on best practices.
• Implement and enforce security, governance, and compliance controls across cloud environments.
• Support CI/CD pipelines, enabling automated builds, deployments, and environment provisioning in Azure DevOps.
• Manage and operate Argo CD.
• Monitor system health, performance, and cost, proactively identifying risks and optimization opportunities using Prometheus and Grafana.
• Lead and contribute to modernization efforts, including migrating legacy workloads to cloud-native architectures.
• Troubleshoot complex infrastructure and deployment issues across lower and higher environments.
• Participate in Agile ceremonies and collaborate effectively in a fast-paced, cross-functional team
Maintain documentation and promote operational best practices that support long-term platform reliability.
• Maintain and track certificates to ensure consistent website uptime and user experience.
• Along with other members of the team, provide support when production issues occur.
Required Skills & Experience
• Strong experience as a Cloud / Infrastructure Engineer, primarily in Microsoft Azure.
• Hands-on experience with Kubernetes (AKS) and containerized workloads.
• Strong understanding of networking, identity, access management, and cloud security principles.
• Experience implementing and maintaining CI/CD pipelines using Azure DevOps or similar tools.
• Proficiency with Git and branching strategies (e.g., GitFlow).
• Experience supporting Web APIs and application platforms from an infrastructure and operational perspective.
• Knowledge of performance, scalability, high availability, and disaster recovery design.
• Experience with automation, scripting, and operational tooling.
• Familiarity with Agile delivery models and DevSecOps/SRE practices.
• Ability to work independently, prioritize effectively, and collaborate across teams.
• Familiarity with React/.NET application stacks from a platform support perspective. Nice to Have
• Experience modernizing legacy applications to cloud-native architectures.
• Experience with policy management (Azure Policy) and governance frameworks.
• Exposure to monitoring and observability tools including Loki and OpenTelemetry.
• Strong communication skills to translate infrastructure concepts to non-infrastructure teams.
• Experience implementing AI thoughtfully across platform engineering and application teams