speaker-diarization

Here are 352 public repositories matching this topic...

modelscope / FunASR

Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.

Updated Jun 8, 2026
Python

speechbrain / speechbrain

Star

A PyTorch-based Speech Toolkit

Updated May 27, 2026
Python

pyannote / pyannote-audio

Star

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

pytorch pretrained-models speaker-recognition speaker-verification speech-processing speaker-diarization voice-activity-detection speech-activity-detection speaker-change-detection speaker-embedding overlapped-speech-detection

Updated Jun 6, 2026
Jupyter Notebook

espnet / espnet

Star

End-to-End Speech Processing Toolkit

text-to-speech deep-learning chainer end-to-end machine-translation pytorch speech-synthesis speech-recognition kaldi voice-conversion speaker-diarization speech-separation speech-enhancement spoken-language-understanding speech-translation singing-voice-synthesis

Updated Jun 8, 2026
Python

argmaxinc / argmax-oss-swift

Star

On-device Speech AI for Apple Silicon

macos swift ios text-to-speech transformers inference speech-recognition speech-to-text whisper speaker-diarization pyannote ttskit whisperkit qwen3-tts speakerkit

Updated Jun 2, 2026
Swift

MahmoudAshraf97 / whisper-diarization

Star

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

speech speech-recognition speech-to-text whisper asr speaker-diarization

Updated Feb 23, 2026
Jupyter Notebook

Purfview / whisper-standalone-win

Star

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

subtitles speech-recognition openai speech-to-text whisper asr speaker-diarization uvr transcriber diarization faster-whisper ctranslate2 whisperx whisper-faster vocal-extractor

Updated Nov 7, 2025

modelscope / 3D-Speaker

Star

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

speaker-verification speaker-diarization language-identification voxceleb modelscope campplus eres2net 3d-speaker cnceleb sdpn

Updated Dec 8, 2025
Python

linto-ai / whisper-timestamped

Star

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Updated Sep 9, 2025
Python

FluidInference / FluidAudio

Star

Frontier CoreML audio models in your apps — text-to-speech, speech-to-text, voice activity detection, and speaker diarization. In Swift, powered by SOTA open source.

audio macos swift ios real-time avfoundation nvidia vad automatic-speech-recognition speech-to-text ane speaker-recognition asr speaker-diarization voice-activity-detection coreml speaker-identification speaker-embedding parakeet

Updated Jun 8, 2026
Swift

juanmc2005 / diart

Star

A python package to build AI-powered real-time audio applications

real-time deep-learning transcription speaker-diarization streaming-audio voice-activity-detection speaker-embedding

Updated Feb 12, 2025
Python

wq2012 / awesome-diarization

Star

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

machine-learning awesome deep-learning speech-recognition awesome-list speech-processing speaker-diarization

Updated Jun 1, 2026

google / uis-rnn

Star

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

machine-learning clustering supervised-learning speaker-recognition speaker-diarization supervised-clustering uis-rnn

Updated Sep 25, 2024
Python

wenet-e2e / wespeaker

Star

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Updated Apr 10, 2026
Python

FunAudioLLM / Fun-ASR

Star

End-to-end speech recognition large model: 31 languages, dialects, accents, lyrics, hotwords, timestamps, speaker diarization. Trained on tens of millions of hours.

pytorch speech-recognition speech-to-text transcription asr speaker-diarization chinese-dialects real-time-asr audio-language-model multilingual-asr fun-asr whisper-alternative 31-languages llm-asr

Updated Jun 8, 2026
Python

transcriptionstream / transcriptionstream

Star

turnkey self-hosted offline transcription and diarization service with llm summary

automation speech-recognition transcription whisper speaker-diarization diarization llm whisperx ollama mistral-7b

Updated Jan 18, 2026
Python

soniqo / speech-swift

Star

AI speech toolkit for Apple Silicon — ASR, TTS, speech-to-speech, VAD, and diarization powered by MLX and CoreML

macos swift ios text-to-speech tts speech-recognition asr mlx speaker-diarization voice-activity-detection coreml speech-enhancement on-device neural-engine speech-to-speech apple-silicon

Updated Jun 4, 2026
Swift

corvo007 / MioSub

Star

一站式全自动字幕生成软件，下载、转录、翻译、压制全流程覆盖，无需人工介入 / One-stop automated subtitle generator. Handles downloading, transcription, translation, and hardcoding—zero human intervention required.

i18n ffmpeg captions subtitles alignment substation-alpha speech-to-text transcription whisper srt-subtitles speaker-diarization ass-subtitles forced-alignment gemini-api subtitle-generator subtitle-translation diarization subtitles-generator gemini-subtitle-pro

Updated Jun 5, 2026
TypeScript

yinruiqing / pyannote-whisper

Star

whisper asr speaker-diarization meeting-summarization pyannote chatgpt

Updated Sep 24, 2025
Python

wq2012 / SpectralCluster

Star

Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.

python machine-learning clustering unsupervised-learning constrained-clustering speaker-diarization spectral-clustering unsupervised-clustering auto-tune

Updated Sep 25, 2024
Python

Improve this page

Add a description, image, and links to the speaker-diarization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speaker-diarization topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speaker-diarization

Here are 352 public repositories matching this topic...

modelscope / FunASR

speechbrain / speechbrain

pyannote / pyannote-audio

espnet / espnet

argmaxinc / argmax-oss-swift

MahmoudAshraf97 / whisper-diarization

Purfview / whisper-standalone-win

modelscope / 3D-Speaker

linto-ai / whisper-timestamped

FluidInference / FluidAudio

juanmc2005 / diart

wq2012 / awesome-diarization

google / uis-rnn

wenet-e2e / wespeaker

FunAudioLLM / Fun-ASR

transcriptionstream / transcriptionstream

soniqo / speech-swift

corvo007 / MioSub

yinruiqing / pyannote-whisper

wq2012 / SpectralCluster

Improve this page

Add this topic to your repo