Dataiku is the Platform for AI Success, the enterprise orchestration layer for building, deploying, and governing AI. In a single environment, teams design and operate analytics, machine learning, and AI agents with the transparency, collaboration, and control enterprises require. Sitting above data platforms, cloud infrastructure, and AI services, Dataiku connects the full enterprise AI stack — empowering organizations to run AI across multi-vendor environments with centralized governance.
The world’s leading companies rely on Dataiku to operationalize AI and run it as a true business performance engine delivering measurable value. For more, visit the Dataiku blog, LinkedIn, X, and YouTube.
How you’ll make an impact
At Dataiku, our mission is to enable customers to bring large-scale data analytics and AI technologies into a centralized, easy-to-use platform. To support this mission, we are looking for an Infrastructure Engineer to help operate, maintain, and troubleshoot our internal and customer-facing infrastructure.
You will work closely with experienced infrastructure and platform engineers, contributing to the reliability and day-to-day operations of our systems. This role is hands-on and operationally focused, with a strong emphasis on UNIX/Linux systems and cloud infrastructure.
Our infrastructure primarily runs on AWS, with some components on Azure and GCP. The tooling environment includes Terraform, Ansible, Kubernetes, and Python, though deep expertise in all of these is not required at entry.
What you’ll work on
Operate, maintain, and troubleshoot UNIX/Linux systems running in cloud environments
Support and maintain existing configuration management and Infrastructure as Code setups
Assist with the operation of cloud-based infrastructure, including virtual machines, networking components, and managed services
Help monitor system health and performance, investigate alerts, and participate in incident response and root cause analysis
Perform routine infrastructure updates and maintenance to ensure systems remain secure, reliable, and up to date
Support Kubernetes clusters and containerized workloads, primarily from an operational and troubleshooting perspective
Collaborate with senior engineers to improve automation, monitoring, and operational practices
Document procedures, operational runbooks, and troubleshooting steps to improve team efficiency
What you need to be successful
Experience working with UNIX/Linux systems, including hands-on troubleshooting and shell scripting
Understanding of networking fundamentals (TCP/IP, DNS, routing, firewalls, load balancing) in cloud or data-center environments
Basic experience operating infrastructure in a cloud environment (preferably AWS), including compute, networking, and monitoring services
Basic scripting or development experience (e.g., Python)
Clear communication skills and a collaborative, respectful approach to working with teammates
Willingness to learn, ask questions, and grow technical depth over time