We are looking for an Incident Management Team Leader to join the Doctolib Operations Center team in Production Engineering.
As an Incident Manager Team Leader, your mission will be to improve the daily lives of care teams and patients by ensuring the reliability and operational excellence of our platform. You will manage and grow a team responsible for incident and problem management, elevate our operational standards, and partner with engineering and business teams to continuously improve reliability, change safety and observability.
Working in the tech team at Doctolib involves building innovative products and features to improve the daily lives of care teams and patients. We work in feature teams in an agile environment, while collaborating with product, design, and business teams.
Your responsibilities include but are not limited to:
Lead, coach, and grow a team of Incident Managers, supporting their technical development and career progression
Own the incident management program at scale, ensuring rapid response and resolution
Partner with Tech and product organization to drive reliability improvements
Lead change governance and production risk management
Build structure in ambiguous environments and establish best practices
About our tech environment
Our solutions are built on a single fully cloud-native platform that supports web and mobile app interfaces, multiple languages, and is adapted to the country and healthcare specialty requirements. To address these challenges, we are modularizing our platform run in a distributed architecture through reusable components.
Our stack is composed of Rails, TypeScript, Java, Python, Kotlin, Swift, and React Native.
We leverage AI ethically across our products to empower patients and health professionals. Discover our AI vision here and learn about our first AI hackathon here!
Who you are
Before you read on — if you don't have the exact profile described below, but you feel this job description matches your skill set, we still encourage you to apply.
You have experience managing and developing engineers, including hiring and performance management
You have a tech background and strong SRE mindset
You have strong organizational skills and can build structure in ambiguous environments
You are comfortable communicating across several levels of the organization, including with senior leadership
You have strong knowledge of Linux-based systems (CLI, bash) and cloud-based systems (AWS preferred)
Now it would be fantastic if you:
Have experience running major incident programs in a complex distributed environment
Have knowledge of a high-level language like Python or Ruby
Have working knowledge of the Kubernetes ecosystem