Data Full Stack AI Engineer
This role is pivotal in transforming complex system level data into high-quality, AI ready data products. The ideal candidate will bridge the gap between backenddata orchestration and frontend application development within a robust Google Cloud Platform (GCP) environment.
Core Responsibilities
● Data Translation & Modeling: Translate complex system level data into
business friendly entities and semantic models. Design models to enable
Generative AI and Agentic AI ready datasets and presentation ready data
products within GCP.
● Full Stack & Orchestration: Build and maintain the orchestration, serving,
and API integration layers. Leverage Cloud Dataflow for stream and batch
processing, and Pub/Sub for real-time messaging to support autonomous
agent workflows.
● CI/CD & Infrastructure as Code: Implement and manage robust CI/CD
pipelines using GitHub Actions. Automate infrastructure provisioning using
Terraform and manage GitHub branching strategies (Dev, Stage, Prod).
● Data Lifecycle Management: Perform discovery, preparation, and
integration of datasets for MVP and production use. Integrate requirements
with existing enterprise data landing platforms, in both structured and
unstructured data formats.
● AI & Agentic Integration: Work closely with Vertex AI to deploy and
manage machine learning models. Design data structures that support
agentic reasoning, tool calling, and long term memory.
● Governance & Documentation: Develop and maintain data dictionaries and
metadata documentation to ensure observability and high data quality for
autonomous system grounding.
● Strategic Collaboration: Participate in discovery sessions and prototyping
phases, mapping end-to-end customer journeys to specific data
requirements in collaboration with cross-functional teams.
Technical Skills & Qualifications
● Development: Proficiency in Full Stack development with a strong
emphasis on software engineering components (orchestration and serving
layers).
● Data Engineering: Strong experience in data pipeline development using
SQL, Python, Cloud Run, Cloud Functions and Cloud Composer
● Data Governance Framework: Deep technical knowledge of Data
Governance Framework, Data Quality, Metadata Management and Data
Security.
● AI & Agentic AI: Experience preparing datasets for Vertex AI and building
data foundations for Agentic AI frameworks (reasoning, planning, and
execution).
● Automation & IaC: Expert-level experience with GitHub Actions for CI/CD
and Terraform for infrastructure management.
● Modeling: Expertise in dimensional data modeling, semantic layers, and
metadata management.
● Application Integration: Expertise in integrating with, and serving to,
3rd-party applications and web portals.