Speaker Intelligence Platform that decodes voice interaction

Deliver accurate speaker metadata to every layer of the Voice AI stack, so it doesn’t fail in production

Speaker Intelligence Platform that decodes voice interaction

Deliver accurate speaker metadata to every layer of the Voice AI stack, so it doesn’t fail in production

Speaker Intelligence Platform that decodes voice interaction

Deliver accurate speaker metadata to every layer of the Voice AI stack, so it doesn’t fail in production

+200K developers

Billions of audio hours processed

Drop-in API integration

Understand real-world conversation

Our Platform

Transform raw audio into structured, reusable building blocks

Transform raw audio into structured, reusable building blocks

pyannoteAI provides a speaker intelligence layer that makes voice AI systems reliable in production.

pyannoteAI provides a speaker intelligence layer that makes voice AI systems reliable in production.

Our background

Research-backed, production-ready AI models

Research-backed, production-ready AI models

pyannoteAI is built on 12 years of speech processing and speaker diarization research. Our models have been open source since 2016, and have surpassed 1 billion downloads on Hugging Face.

pyannoteAI is built on 12 years of speech processing and speaker diarization research. Our models have been open source since 2016, and have surpassed 1 billion downloads on Hugging Face.

1


2


3


4


5


6
7
8
9
10
11
12

# download pretrained pipeline from Huggingface
from pyannote.audio import Pipeline
pipeline = Pipeline.from_pretrained('pyannote/speaker-diarization-community-1', token="HUGGINGFACE_TOKEN")

# perform speaker diarization locally
output = pipeline('/path/to/audio.wav')

# enjoy state-of-the-art speaker diarization
for turn, speaker in output.speaker_diarization:
print(f"{speaker} speaks between t={turn.start}s and t={turn.end}s")

Open source roots

Community-1

Open-source, community-supported model. Widely adopted for research and development.

State-of-the-art

Precision-2

Higher accuracy, advanced controls, and enterprise-grade tooling for production teams.

11.5K

GitHub Stars

GitHub Stars

1B+

Hugging Face Downloads

Hugging Face Downloads

225K

users worldwide

users worldwide

Built for production workloads

Built for production workloads

pyannoteAI's architecture meets the security, compliance, and operational requirements of enterprise teams.

pyannoteAI's architecture meets the security, compliance, and operational requirements of enterprise teams.

API

API

On-Premise

On-Premise

On-Device (by Argmax)

On-Device (by Argmax)

Speaker Intelligence features

Speaker Intelligence features

Speaker and conversation insights through a single API. 


Built on the world's most accurate speaker diarization models.

Speaker and conversation insights through a single API. Built on the world's most accurate speaker diarization models.

Speaker Diarization

Identifies and labels each speaker in multi-participant audio.

Overlapping Speech Detection

Detects and attributes overlapping speech to the correct speakers.

STT Orchestration

Combines diarization and transcription for speaker-attributed transcripts.

Voiceprint

Identifies specific individuals across sessions using voice biometric signatures.

Confidence Scores

Identifies complex segments for quality control.

Speaker Separation (Soon)

Isolates overlapping speakers into individual audio streams.

Built for developers

Integrate state-of-the-art speaker intelligence into your stack in minutes.

Compatible with any tech stack. No AI expertise required, no setup costs. Just call the API and build.

Built for developers

Integrate state-of-the-art speaker intelligence into your stack in minutes.

Compatible with any tech stack. No AI expertise required, no setup costs. Just call the API and build.

curl

1


2


3


4


5


6

1


2


3


4


5


6

Curl --request POST
--url https://api.pyannote.ai/v1/diarize \


--header ‘Authorisation: Bearer <token>’ \
--data ‘{
“url”: "https://files.pyannote.ai/marklex1min.wav",
}’

Curl --request POST
--url https://api.pyannote.ai/v1/diarize \


--header ‘Authorisation: Bearer <token>’ \
--data ‘{
“url”: "https://files.pyannote.ai/marklex1min.wav",
}’

Resources to get you started

Discover the power of speaker diarization

From voice to programmable intelligence

From voice to programmable intelligence

Unlayers real-world voice interactions into structured metadata.