Get started

Effortlessly leverage audio data
with speaker diarization

Discover our models that enable developers and product teams to easily access state-of-the-art speech AI everywhere it matters

Speed

Identify and differentiate speakers in the blink of an eye.

Accuracy

Our AI models reach the highest level of precision in the industry.

Deployment

Deploy our models everywhere, on premise or through API.

STATE-OF-THE-ART

Explore our cutting edge models

Open

Open source

Discover the most famous opensource diarisation model

Speed

Accuracy

#param

Turbo

Optimized

Deploy a super fast version of the opensource model

Speed

Accuracy

#param

Precision

Optimized

Unleash the true potential of your audio with the most accurate AI model

Speed

Accuracy

#param

30M+
monthly downloads of models on Hugging Face
5k+
stars on pyannote open source on Github
300k+ ⬇️
monthly package downloads on all platforms
Winner
DISPLACE 2024 Multilingual speaker diarization challenge
3rd
VoxSRC 2024 with 1st and 2nd also relying on pyannote
Winner
Ego4D 2022 with 2nd also relying on pyannote
Winner
Albayzin 2022 speaker diarization challenge

30M+
monthly downloads of models on Hugging Face
5k+
stars on pyannote open source on Github
300k+ ⬇️
monthly package downloads on all platforms
Winner
DISPLACE 2024 Multilingual speaker diarization challenge
3rd
VoxSRC 2024 with 1st and 2nd also relying on pyannote
Winner
Ego4D 2022 with 2nd also relying on pyannote
Winner
Albayzin 2022 speaker diarization challenge

Enhanced diarization

World-class diarization is the foundation of our work, and we are always looking for ways to maximize the benefits of our models to better match our particular needs. Enhance diariazation with our features

Speaker diarization

Speaker identification

Overlapping speech detection

Change point detection

Voice activity detection

Speaker separation (soon)

Confidence score (soon)

Speaker diarization

Our world-class speaker diarization technology segments multi-speaker conversations into speech turns and assigns a unique tag per speaker. It really shines with noisy, dynamic, and overlapping speech.

Speaker identification

Combined with our world-class speaker diarization technology and state-of-the-art voiceprints, speaker identification tracks recurring speakers across multiple podcast episodes or team meetings.

Overlapping speech detection

Overlapping speech happens all the time in spontaneous conversations (fast turn-taking, back-channeling, or interruption). Not only do we detect it, but we also attribute it to the right speakers.

Change point detection

Our world-class speaker diarization technology provides extremely accurate speaker change timestamps. Prevents cutting audio in the middle of a word, and improves speaker attribution.

Voice activity detection

Our world-class speaker diarization technology builds on top of state-of-the-art voice activity detection. Removes regions without speech, yet detects even the shortest words ("yes", "oh", "hmm").

Speaker separation (soon)

Our world-class speaker diarization technology not only detects overlapping speech, it can also isolate overlapping speakers and return one separate audio stream per speaker.

Confidence score (soon)

Our world-class speaker diarization technology highlights the most complex parts of a conversation automatically. Use confidence scores to filter out the noisiest parts of your training data, or for human-in-the-loop use cases.

Get started

Effortlessly leverage audio data with speaker diarization