About this role

Dataiku is the Platform for AI Success, the enterprise orchestration layer for building, deploying, and governing AI. In a single environment, teams design and operate analytics, machine learning, and AI agents with the transparency, collaboration, and control enterprises require. Sitting above data platforms, cloud infrastructure, and AI services, Dataiku connects the full enterprise AI stack — empowering organizations to run AI across multi-vendor environments with centralized governance.

The world’s leading companies rely on Dataiku to operationalize AI and run it as a true business performance engine delivering measurable value. For more, visit the Dataiku blog, LinkedIn, X, and YouTube.

How you’ll make an impact

At Dataiku, our mission is to enable customers to bring large-scale data analytics and AI technologies into a centralized, easy-to-use platform. To support this mission, we are looking for an Infrastructure Engineer to help operate, maintain, and troubleshoot our internal and customer-facing infrastructure.

You will work closely with experienced infrastructure and platform engineers, contributing to the reliability and day-to-day operations of our systems. This role is hands-on and operationally focused, with a strong emphasis on UNIX/Linux systems and cloud infrastructure.

Our infrastructure primarily runs on AWS, with some components on Azure and GCP. The tooling environment includes Terraform, Ansible, Kubernetes, and Python, though deep expertise in all of these is not required at entry.

What you’ll work on

Operate, maintain, and troubleshoot UNIX/Linux systems running in cloud environments

Support and maintain existing configuration management and Infrastructure as Code setups

Assist with the operation of cloud-based infrastructure, including virtual machines, networking components, and managed services

Help monitor system health and performance, investigate alerts, and participate in incident response and root cause analysis

Perform routine infrastructure updates and maintenance to ensure systems remain secure, reliable, and up to date

Support Kubernetes clusters and containerized workloads, primarily from an operational and troubleshooting perspective

Collaborate with senior engineers to improve automation, monitoring, and operational practices

Document procedures, operational runbooks, and troubleshooting steps to improve team efficiency

What you need to be successful

Experience working with UNIX/Linux systems, including hands-on troubleshooting and shell scripting

Understanding of networking fundamentals (TCP/IP, DNS, routing, firewalls, load balancing) in cloud or data-center environments

Basic experience operating infrastructure in a cloud environment (preferably AWS), including compute, networking, and monitoring services

Basic scripting or development experience (e.g., Python)

Clear communication skills and a collaborative, respectful approach to working with teammates

Willingness to learn, ask questions, and grow technical depth over time

Responsibilities