Senior Machine Learning Engineer - Voice Model(ASR/STT) - AI Teams (x/f/m)

Paris, Paris, FranceCompetitiveHybrid0 applicants

About this role

What You will Do

We are looking for a Senior Machine Learning Engineer to join the Voice Model (ASR/STT) engineering team in AI & Clinical Products.

Responsibilities

  • We are looking for a Senior Machine Learning Engineer to join the Voice Model (ASR/STT) engineering team in AI & Clinical Products.
  • As a Senior Machine Learning Engineer, your mission will be to deliver the ASR backbone that powers our AI products, helping health professionals save time on documentation and focus more on patient care. You will be working in a feature team developing the speech recognition technology for Doctolib's AI-powered solutions including Consultation Assistant and Phone Assistant.
  • Working in the tech team at Doctolib involves building innovative products and features to improve the daily lives of care teams and patients. We work in feature teams in an agile environment, while collaborating with product, design, and business teams.
  • Your responsibilities include but are not limited to:
  • Deliver the ASR roadmap end-to-end: model design, training, evaluation, and product integration for medical-grade speech recognition
  • Partner with MLOps to ensure training and inference pipelines are scalable, cost-efficient, and reliable in production
  • Collaborate with product, design, and clinical teams to translate user needs into measurable technical objectives
  • Drive continuous improvements to WER, medical term error rate, latency, diarization, domain adaptation, and multilingual performance
  • About our tech environment
  • Our solutions are built on a single fully cloud-native platform that supports web and mobile app interfaces, multiple languages, and is adapted to the country and healthcare specialty requirements. To address these challenges, we are modularizing our platform run in a distributed architecture through reusable components.
  • Our stack is composed of Rails, TypeScript, Java, Python, Kotlin, Swift, and React Native.
  • We leverage AI ethically across our products to empower patients and health professionals. Discover our AI vision here and learn about our first AI hackathon here!
  • Who you are
  • Before you read on — if you don't have the exact profile described below, but you feel this job description matches your skill set, we still encourage you to apply.
  • You have a Master's or Ph.D. degree in Computer Science, Data Science, or a related field
  • You have at least 5 years of experience in ML with deep expertise in ASR/Speech-to-Text (end-to-end or hybrid), including streaming STT and real-time constraints
  • You have hands-on experience with modern speech stacks: CTC/Transducer/Attention, Conformer/Whisper-style models, tokenizer/LM integration, diarization, and voice activity detection
  • You have strong PyTorch skills and production ML experience: model serving, monitoring, A/B testing, rollback, and incident response in partnership with MLOps
  • You are fluent in English
  • Now it would be fantastic if you have:
  • Experience with multilingual ASR, on-device or low-latency inference, telephony audio, or medical domain adaptation
  • A passion for pushing the boundaries of speech recognition and AI in healthcare

EU Requirements

Job Details

Posted31 March 2026
Closes30 April 2026
Work ModeHybrid

Contact

Similar Jobs

Finding similar jobs...