Data Scientist R&D / Speech-focused (CDD) @ ToumAI

ToumAI

Data Scientist R&D / Speech-focused (CDD)

Témara.Publiée il y a 6 jours

IA conversationnelle

Experience Client

Type de contrat: CDD

Lieu de travail: Hybride

Expérience: 3 - 5 ans

Langues: Anglais

Niveau d'étude: BAC +5

L'ENTREPRISE

Dans un monde de plus en plus vocal, global et multiculturel, la capacité à se comprendre au-delà des langues, des accents et des contextes culturels est devenue un enjeu majeur. Chez ToumAI, notre mission est de rendre chaque voix compréhensible et utile, quelles que soient la langue, la culture ou le niveau d’aisance numérique. Nous développons des solutions d’intelligence artificielle vocale multilingue qui rendent la conversation fluide et accessible, en permettant aux interactions de se dérouler naturellement dans plusieurs langues, avec une prise en compte fine des émotions et des nuances culturelles. En analysant les conversations au-delà des mots, nous aidons les organisations à mieux comprendre les intentions, les ressentis et les attentes de leurs clients, tout en transformant ces échanges en insights exploitables pour améliorer la qualité de service et l’expérience globale. Entreprise de développement technologique spécialisée en IA vocale, ToumAI accompagne des acteurs de secteurs variés — finance, télécommunications, commerce, communication, immobilier et services — avec une présence à la

Inscrivez-vous et postulez en un clic

Créez votre profil, découvrez des opportunités adaptées à vos compétences et postulez facilement !

About the position

Introduction to the position

As a Data Scientist – R&D (Speech-focused) at ToumAI, your mission will be to research, develop and optimize speech and voice intelligence models that power our Voice AI platforms, SDKs and on-device solutions. You will work on end-to-end speech pipelines, from data and annotation strategies to model training, evaluation and optimization, with strong constraints on accuracy, latency, robustness and footprint, particularly for low-resource languages and dialects such as Moroccan Darija. Your work will directly impact production systems used in voicebots, QMS, VoC analytics, APIs and on-device SDKs, bridging applied research and real-world deployment.

Your role

• Research, design and train speech-related models (STT components, language identification, diarization, emotion recognition, speech segmentation, code-switching). • Improve transcription accuracy and robustness in noisy, conversational and real-world audio conditions. • Work on multilingual and dialectal speech data with a focus on low-resource settings. • Define and refine data collection, annotation strategies and quality control processes for speech datasets. • Design evaluation protocols and metrics aligned with product and business requirements. • Collaborate with platform and ML engineers to integrate models into real-time and batch pipelines. • Optimize models for latency, memory footprint and inference efficiency (quantization, pruning, distillation). • Analyze model errors and failure modes, propose corrective strategies and iterate. • Document experiments, architectures and learnings to support long-term R&D. • Stay up to date with state-of-the-art research in speech processing and applied Voice AI.

Your qualifications

• Strong curiosity for speech processing and applied AI research. • Solid experience in machine learning and data science, with a focus on speech or audio-based models. • Hands-on experience with PyTorch (or equivalent deep learning frameworks). • Understanding of speech tasks such as ASR/STT, diarization, VAD, language identification or emotion recognition. • Experience working with audio data pipelines and annotation workflows. • Familiarity with model evaluation, error analysis and benchmarking in speech systems. • Interest in low-latency and on-device inference constraints is a strong plus. • Experience with multilingual, low-resource or dialectal speech is highly valued. • Familiarity with LLM-based speech pipelines or speech-to-text post-processing is a plus. • Strong analytical mindset and ability to collaborate with engineering teams. • Proficiency in French or Arabic is a plus; English required for technical work.

Benefits

At ToumAI, you will work at the intersection of applied research and production Voice AI. You will: • Contribute to core R&D on speech and voice intelligence • Work on real datasets and deployed systems • Influence model choices, architectures and optimization strategies • Help shape the future of Voice AI for low-resource languages If you enjoy pushing speech models from research to real-world deployment under tight constraints, this role is for you.

Recruitment process

First conversation with the HR team to get to know you better and introduce you to ToumAI
A technical test or applied research case (speech-focused)
Role-specific interview with your future manager / R&D lead
Final meeting with top management (if needed)

L'ENTREPRISE

Inscrivez-vous et postulez en un clic

Créez votre profil, découvrez des opportunités adaptées à vos compétences et postulez facilement !