Data Science & AI: Data Scientist / AI Engineer
McBride
Client: Allied Command Transformation (ACT)
is NATO’s leading agent for change: driving, facilitating, and advocating the continuous improvement of Alliance capabilities to maintain and enhance the military relevance and effectiveness of the Alliance. The main objectives of ACT
are:
providing appropriate support to NATO missions and operations; leading NATO military transformation; improving relationships, interaction and practical cooperation with partners, nations and international organizations. ACT therefore leads Alliance concept development, capability development, training and lessons learned initiatives and provides unfettered military support to policy development within NATO.
Tasking:
Contribute to the development and implementation of an enabling data science and AI capability at HQ SACT and across the NATO Enterprise, with a specific focus on scalable data engineering and software systems to support AI initiatives.
Design, develop, and
maintain
robust data pipelines and architectures to manage the ingestion, transformation, and processing of structured and unstructured data for large Language Model (LLM)-based applications and other AI systems.
Lead efforts to
optimize
data delivery and automate data engineering processes, proposing enhancements to infrastructure to improve scalability, efficiency, and reliability in support of LLM deployments.
Build API-based infrastructure and frameworks that enable seamless integration of LLMs and ML models with operational systems, ensuring performance, security, and interoperability with NATO environments.
Support the development, testing, and validation of microservices and containerized applications to operationalize AI/ML capabilities, including deployment of LLM use cases within NATO.
Implement distributed data storage and processing systems (e.g., cloud-based or hybrid architectures) that align with NATO standards and enable scalable use of LLMs across the enterprise.
Develop tools and systems to improve data accessibility, enabling data scientists and analysts to efficiently interact with and query data for training, inference, and analytics.
Coordinate with data scientists, software engineers, and system architects to align data engineering workflows with broader AI/ML
objectives
, ensuring
timely
delivery of clean, high-quality data for LLM training and inference.
Establish mechanisms for real-time data processing and streaming, enabling LLMs to
operate
effectively in dynamic and responsive applications, such as operational decision support or strategic analysis.
Conduct preprocessing, cleansing, and transformation of raw data into formats
optimized
for training, fine-tuning, and inference within LLM infrastructure.
Implement robust monitoring, logging, and performance optimization tools for data pipelines and APIs, ensuring reliability and traceability of LLM-enabled workflows.
Collaborate with teams to support federated learning approaches and cross-domain data sharing, ensuring compliance with NATO data sovereignty, security, and ethical guidelines.
Provide subject matter expertise on data engineering and software development to (military and civilian) staff within HQ SACT or the
NATO Enterprise, and
develop proofs of concept for LLM-based applications as directed.
Research, recommend, and implement best practices for deploying LLMs in secure, cloud-based environments such as Microsoft Azure or AWS, while considering NATO- specific data policies and standards.
Evaluate operational requirements and objectives, recommending appropriate engineering solutions for integrating LLMs into NATO workflows and systems.
Stay abreast of new developments in AI engineering, including innovations in LLM technologies, data architectures, distributed computing, and API development, to bring cutting-edge capabilities into implementation within NATO.
Provide technical training and mentoring to NATO staff, supporting educational efforts in AI engineering, data pipeline design, API development, and digital literacy.
Foster a culture of innovation and data-driven decision-making across NATO by building scalable systems that enable the effective exploitation of LLMs and advanced analytics.
Perform additional tasks as required by the Contracting Officer’s Technical Representative (COTR) related to the LABOR category.
Desir
ed
Qualifications:
Experience
leveraging
open-source frameworks and publicly available datasets to develop innovative AI and data engineering solutions for operational or analytical use cases.
Proficiency
in presenting data-driven insights clearly to non-technical audiences,
showcasing
an ability to craft compelling narratives and actionable recommendations for senior leadership.
Understanding of military staff workflows and processes, alongside familiarity with federated learning techniques for enabling secure collaboration across NATO nations while preserving
sovereignty of sensitive datasets.
Exposure to agile project management methods and tools (e.g., Loop, JIRA, Trello) for
coordinating and tracking progress across multi-disciplinary AI/ML projects.
Exposure to cross-domain data sharing and API-driven interoperability, ensuring effective
integration across systems while adhering to security and ethical guidelines within military or international environments.
Familiarity with principles of ethical AI development, including considerations for bias
mitigation, responsible data handling, and alignment with NATO’s ethical frameworks for AI deployment.
Requirements
Mandatory
Qualifications:
Minimum 4 years of proven work experience as a Data Scientist, Machine Learning Engineer, Data Engineer, or Software Engineer, with a strong emphasis on distributed systems, cloud- based architectures, developing operational AI/ML solutions, and designing API-based infrastructures, microservices architectures, and containerized applications (e.g., Docker, Kubernetes).
Demonstrated experience working with
GenAI
,
in particular LLMs
, including preprocessing data, fine-tuning, and deployment in secure and scalable environments to include AI/ML frameworks such as TensorFlow,
PyTorch
, or scikit-learn.
Proven
expertise
in programming languages such as Python, Java, or Scala, with
demonstrated
experience in software engineering practices (e.g., version control, CI/CD pipelines, containerization).
Experience building and
optimizing
data pipelines, ETL processes, and real-time streaming solutions using tools like Apache Airflow, Kafka, Spark, or equivalent.
Knowledge of applied AI principles, particularly in implementing AI systems for operational decision support and analyzing unstructured data (e.g., text, imagery).
Ability to architect and
maintain
scalable data lakes, data warehouses, or distributed storage systems (e.g., Delta Lake, Snowflake, Hadoop, or NoSQL solutions).
Demonstrated understanding of data security, privacy, and sovereignty issues, particularly in military or international environments, ensuring compliance with NATO operational and ethical standards.
Experience building visually impactful reports, dashboards, and analytics using tools such as Tableau, MS Power BI, or Kibana, supporting informed decision-making for high-level stakeholders.
Professional experience in NATO environments or familiarity with NATO processes, organizational culture, and decision-making structures.
Ability to translate operational problems into practical AI/ML solutions tailored for military and civilian teams.
Proven ability to collaborate effectively within multidisciplinary teams, including coordinating with data scientists, software engineers, and system architects on cross-functional projects.
Strong oral and written communication skills, with the ability to brief non-technical audiences and mentor staff in AI engineering, data science, and software development concepts.
style="margin:0px; padding:0px; font-size:11pt; line-height:19.425px; font-family:Calibri, Calibri_EmbeddedFont, Calib