JaesungHuh / SimpleDiarization
Simple Diarization model
☆47Updated last year
Alternatives and similar repositories for SimpleDiarization:
Users that are interested in SimpleDiarization are comparing it to the libraries listed below
- ☆46Updated this week
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆110Updated 2 years ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆82Updated last year
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆82Updated 2 months ago
- Speaker change detection using SincNet and an LSTM/Transformer☆48Updated 8 months ago
- ☆61Updated last year
- ☆80Updated 10 months ago
- This is the M-AILABS Speech Dataset☆47Updated 3 months ago
- SelfRemaster: SSL Speech Restoration☆88Updated last year
- ☆74Updated last year
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆50Updated last month
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆68Updated 6 months ago
- ☆54Updated last year
- Unofficial implementation of miipher☆120Updated 11 months ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆156Updated last year
- ☆39Updated last year
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆154Updated 2 weeks ago
- Predicts the level of noise and reverberation on your audiofiles☆146Updated 10 months ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆50Updated 7 months ago
- Various speech datasets made available to the public☆114Updated 3 months ago
- Clustering-based methods for overlapping diarization☆78Updated last year
- ☆62Updated 10 months ago
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆86Updated 4 months ago
- Implementation of SoundStorm built upon SpeechTokenizer.☆108Updated last year
- Reference-aware automatic speech evaluation toolkit☆144Updated 3 months ago
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆122Updated 3 weeks ago
- Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context☆192Updated 6 months ago
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆16Updated 4 months ago
- ☆64Updated 6 months ago
- multilingual speech aligner☆72Updated last year