speechbrain/speechbrain

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/speechbrain/speechbrain)

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

☆11,388

Alternatives and similar repositories for speechbrain

Users that are interested in speechbrain are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

espnet / espnet
View on GitHub
End-to-End Speech Processing Toolkit
☆9,792Updated this week
asteroid-team / asteroid
View on GitHub
The PyTorch-based audio source separation toolkit for researchers
☆2,556Oct 6, 2025Updated 6 months ago
pyannote / pyannote-audio
View on GitHub
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…
☆9,449Mar 30, 2026Updated last week
s3prl / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit
☆2,542Mar 12, 2026Updated 3 weeks ago
wenet-e2e / wenet
View on GitHub
Production First and Production Ready End-to-End Speech Recognition Toolkit
☆5,065Mar 31, 2026Updated last week
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
NVIDIA-NeMo / NeMo
View on GitHub
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Auto…
☆17,048Updated this week
k2-fsa / k2
View on GitHub
FSA/FST algorithms, differentiable, with PyTorch compatibility.
☆1,320Mar 9, 2026Updated 3 weeks ago
lhotse-speech / lhotse
View on GitHub
Tools for handling multimodal data in machine learning projects.
☆1,122Updated this week
kaldi-asr / kaldi
View on GitHub
kaldi-asr/kaldi is the official location of the Kaldi project.
☆15,364Sep 22, 2025Updated 6 months ago
snakers4 / silero-vad
View on GitHub
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
☆8,643Mar 26, 2026Updated last week
facebookresearch / fairseq
View on GitHub
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
☆32,202Sep 30, 2025Updated 6 months ago
clovaai / voxceleb_trainer
View on GitHub
In defence of metric learning for speaker recognition
☆1,165Mar 26, 2024Updated 2 years ago
wenet-e2e / wespeaker
View on GitHub
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
☆1,251Mar 31, 2026Updated last week
mravanelli / pytorch-kaldi
View on GitHub
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…
☆2,399Mar 14, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
facebookresearch / denoiser
View on GitHub
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech E…
☆1,882Mar 14, 2023Updated 3 years ago
jik876 / hifi-gan
View on GitHub
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
☆2,338Jul 27, 2024Updated last year
aliutkus / speechmetrics
View on GitHub
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
☆1,047Jul 5, 2023Updated 2 years ago
facebookresearch / WavAugment
View on GitHub
A library for speech data augmentation in time-domain
☆684Aug 30, 2021Updated 4 years ago
wq2012 / awesome-diarization
View on GitHub
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
☆1,858Jul 22, 2025Updated 8 months ago
iver56 / audiomentations
View on GitHub
A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.
☆2,251Dec 27, 2025Updated 3 months ago
iver56 / torch-audiomentations
View on GitHub
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
☆1,141Nov 24, 2025Updated 4 months ago
SpeechColab / GigaSpeech
View on GitHub
Large, modern dataset for speech recognition
☆722Feb 26, 2024Updated 2 years ago
kan-bayashi / ParallelWaveGAN
View on GitHub
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
☆1,639Apr 22, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Snowdar / asv-subtools
View on GitHub
An Open Source Tools for Speaker Recognition
☆636Aug 5, 2024Updated last year
k2-fsa / icefall
View on GitHub
☆1,383Mar 25, 2026Updated last week
microsoft / DNS-Challenge
View on GitHub
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
☆1,398Jul 25, 2024Updated last year
WenzheLiu-Speech / awesome-speech-enhancement
View on GitHub
speech enhancement\speech seperation\sound source localization
☆1,231Nov 14, 2023Updated 2 years ago
pytorch / audio
View on GitHub
Data manipulation and transformation for audio signal processing, powered by PyTorch
☆2,857Updated this week
MontrealCorpusTools / Montreal-Forced-Aligner
View on GitHub
Command line utility for forced alignment using Kaldi
☆1,782Mar 31, 2026Updated last week
facebookresearch / encodec
View on GitHub
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
☆3,932Jan 4, 2024Updated 2 years ago
coqui-ai / open-speech-corpora
View on GitHub
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
☆1,391Jun 6, 2024Updated last year
coqui-ai / TTS
View on GitHub
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
☆44,993Aug 16, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
jim-schwoebel / voice_datasets
View on GitHub
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
☆2,143Jun 6, 2024Updated last year
resemble-ai / Resemblyzer
View on GitHub
A python package to analyze and compare voices with deep learning
☆3,241Oct 12, 2023Updated 2 years ago
NVIDIA / BigVGAN
View on GitHub
Official PyTorch implementation of BigVGAN (ICLR 2023)
☆1,200Sep 5, 2024Updated last year
microsoft / NeuralSpeech
View on GitHub
☆1,460Feb 11, 2024Updated 2 years ago
JusperLee / Speech-Separation-Paper-Tutorial
View on GitHub
A must-read paper for speech separation based on neural networks
☆919Aug 11, 2025Updated 7 months ago
PaddlePaddle / PaddleSpeech
View on GitHub
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text fronten…
☆12,582Updated this week
m-bain / whisperX
View on GitHub
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
☆21,096Updated this week