bbc/bbc-speech-segmenter

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bbc/bbc-speech-segmenter)

bbc / bbc-speech-segmenter

A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.

☆31

Alternatives and similar repositories for bbc-speech-segmenter

Users that are interested in bbc-speech-segmenter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Akella17 / speaker-embedding
View on GitHub
A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack
☆10Feb 19, 2018Updated 8 years ago
lucasjinreal / aural
View on GitHub
A Tiny Project For ASR model training and Deployment
☆26Oct 14, 2022Updated 3 years ago
tiro-is / tiro-speech-core
View on GitHub
This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core
☆15Jun 19, 2023Updated 3 years ago
ICASSP2021-tutorial9 / Distant_conversational_ASR_and_analysis
View on GitHub
☆12Jun 10, 2021Updated 5 years ago
talhanai / kaldi-diar-latte
View on GitHub
steps to perform text-based speaker diarization with kaldi toolkit
☆12Nov 2, 2018Updated 7 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
tuanio / nextformer
View on GitHub
PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"
☆10Dec 15, 2022Updated 3 years ago
qianhwan / KaldiBasedSpeakerVerification
View on GitHub
Kaldi based speaker verification
☆47Jan 26, 2018Updated 8 years ago
Skycoder42 / QtUtils
View on GitHub
A collection of various Qt-Classes, branch-sorted
☆12Apr 8, 2017Updated 9 years ago
markusdr / transducersaurus
View on GitHub
Automatically exported from code.google.com/p/transducersaurus
☆11Apr 1, 2015Updated 11 years ago
MiniXC / LightningFastSpeech2
View on GitHub
☆55Jan 13, 2023Updated 3 years ago
cvqluu / MTL-Speaker-Embeddings
View on GitHub
Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…
☆26Oct 5, 2022Updated 3 years ago
PranavPutsa1006 / Speaker-Diarization
View on GitHub
Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python
☆18Jun 18, 2023Updated 3 years ago
seungheondoh / hi_kia
View on GitHub
wake-up word emotion recognition [APSIPA 2022]
☆17Nov 11, 2022Updated 3 years ago
hhj1897 / roi_tanh_warping
View on GitHub
☆11Feb 19, 2021Updated 5 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
IU-SAIGE / pse
View on GitHub
Efficient Personalized Speech Enhancement through Self-Supervised Learning
☆23Mar 12, 2023Updated 3 years ago
cadia-lvl / kaldi-speaker-diarization
View on GitHub
This repository creates speaker diarization recipes to be used within the egs folder of kaldi.
☆17Aug 12, 2024Updated last year
AIBigTruth / 0-9-speech-recognition-system-based-on-GMM
View on GitHub
基于GMM的0-9孤立词语音识别系统
☆10Sep 29, 2020Updated 5 years ago
dandugula / mod_ipp_g729
View on GitHub
Hacked FreeSWITCH-G729 speech codec using Intel® Integrated Performance Primitive.
☆23Jan 22, 2013Updated 13 years ago
ahaliassos / raven
View on GitHub
Official implementation of RAVEn (ICLR 2023) and BRAVEn (ICASSP 2024)
☆82Feb 27, 2025Updated last year
MrSyee / pokemon_story_generator
View on GitHub
양재 AI 실무자 교육 6조 프로젝트
☆21Sep 18, 2018Updated 7 years ago
arda-num / SFSRNet
View on GitHub
Reproduction of the paper SFSRNet: Super-resolution for single-channel Audio Source Separation by me (@arda-num) and @dritx16. Navigate P…
☆12Jul 7, 2022Updated 4 years ago
NibuTake / LiNGAM-fast
View on GitHub
☆12May 17, 2018Updated 8 years ago
DinoMan / face-processor
View on GitHub
Aligns faces to the canonical face in both videos and images
☆17Apr 11, 2022Updated 4 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
nc-ai / speech
View on GitHub
☆17Aug 27, 2025Updated 10 months ago
aispeech-lab / WASE
View on GitHub
PyTorch implementation of WASE described in our ICASSP 2021: "Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Envi…
☆27Jan 11, 2022Updated 4 years ago
ZhangAustin / Deep-Speech
View on GitHub
Deep Learning for Speech Recogntion based on Theano
☆15Jul 28, 2017Updated 8 years ago
alexnorton / transcript-model
View on GitHub
JSON schema and JavaScript model classes for dealing with time-aligned transcripts of speech.
☆16Aug 20, 2018Updated 7 years ago
ztjhz / miniLM
View on GitHub
Small Model Is All You Need - NTU SC4001 Neural Network & Deep Learning Project
☆17Nov 9, 2023Updated 2 years ago
Mu-Y / DiariST
View on GitHub
☆18Sep 19, 2023Updated 2 years ago
AndreevP / speech_distances
View on GitHub
Deep Speech Distances PyTorch
☆29Feb 21, 2022Updated 4 years ago
dhpollack / programming_notebooks
View on GitHub
A collection of programming notebooks that I've created.
☆16Oct 18, 2020Updated 5 years ago
asteroid-team / pytorch-pit
View on GitHub
Permutation invariant training in PyTorch
☆13Oct 2, 2020Updated 5 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
nonday / awesome-voiceprint
View on GitHub
A curated list of awesome Voiceprint Recognition papers
☆19Jul 9, 2021Updated 5 years ago
qiny1012 / kaldi_x-vector_aishell
View on GitHub
Using Kaldi x-vector method to train speaker recognition model on aishell database.
☆18Aug 19, 2021Updated 4 years ago
idiap / icassp-oov-recognition
View on GitHub
Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"
☆17Nov 28, 2021Updated 4 years ago
jymsuper / VAD_tutorial
View on GitHub
Simple DNN based Voice Activity Detection (VAD) using Pytorch
☆43Feb 8, 2020Updated 6 years ago
jinsongpan / ASR_Course_Homework
View on GitHub
分享在深蓝学院《语音识别：从入门到精通》第一期课程学习过程中完成的课后作业，供参考。
☆21Sep 13, 2020Updated 5 years ago
zouyajing / step-by-step-multi-sensor-fusion
View on GitHub
☆13Nov 14, 2022Updated 3 years ago
neeek2303 / papers-to-read
View on GitHub
Main articles I read or plan to read, as well as useful links.
☆12Feb 17, 2023Updated 3 years ago