kamperh / vqwordseg
Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.
☆35Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for vqwordseg
- Transformer-based visually grounded speech models☆19Updated 2 years ago
- ☆31Updated last year
- ☆51Updated this week
- multilingual speech aligner☆72Updated last year
- DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning☆47Updated 10 months ago
- ☆27Updated last year
- Word Discovery in Visually Grounded, Self-Supervised Speech Models☆25Updated 11 months ago
- ☆36Updated 2 years ago
- ☆17Updated this week
- Alignment files of LibriTTS.☆60Updated 4 years ago
- Non-Autoregressive Predictive Coding☆50Updated 4 years ago
- CMU multilingual speech repository☆31Updated 2 years ago
- Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"☆61Updated 2 years ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆39Updated last year
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆35Updated last month
- A list of papers for child ASR☆26Updated last month
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆46Updated last year
- A CSRankings-like index for speech researchers☆31Updated last month
- ☆36Updated 3 years ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Updated 3 years ago
- Speech (audio) subjective evaluation system☆37Updated 4 years ago
- Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.☆59Updated last year
- Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.☆44Updated 5 years ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆71Updated last year
- ☆15Updated 3 years ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆31Updated 4 months ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆69Updated 2 years ago
- Sylber: Syllabic Embedding Representation of Speech from Raw Audio☆20Updated last month