danpovey / quantization
Torch-based tool for quantizing high-dimensional vectors using additive codebooks
☆50Updated 2 years ago
Related projects: ⓘ
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆19Updated last year
- ☆35Updated 2 years ago
- Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.☆56Updated last year
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆30Updated 2 years ago
- video cut powered by AI☆25Updated last year
- ☆41Updated 10 months ago
- Implementation of CTC alignment-based single step non-autoregressive transformer☆11Updated last year
- A collection of papers related to speech model compression☆24Updated last year
- ☆54Updated 3 years ago
- End-to-end diarization loss☆19Updated 3 years ago
- multilingual speech aligner☆70Updated 10 months ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆68Updated last year
- RepVgg + HiFiGAN☆33Updated 2 years ago
- CMU multilingual speech repository☆31Updated 2 years ago
- ☆23Updated last month
- A CSRankings-like index for speech researchers☆30Updated last year
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Updated 4 years ago
- Memory efficient transducer loss computation☆68Updated 2 years ago
- Repo for the FB AI Speech team.☆22Updated 3 years ago
- streaming attention networks for end-to-end automatic speech recognition☆55Updated 4 years ago
- UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation☆69Updated 3 years ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆47Updated 2 months ago
- Small compression utility☆33Updated 2 months ago
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆33Updated last week
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Network☆45Updated 2 years ago
- ☆69Updated this week
- Multipurpose Multi Speaker Mixture Signal Generator☆43Updated 6 months ago
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Updated 2 years ago
- ☆52Updated 3 years ago
- Streaming Audiotransformers for online Audio tagging☆39Updated 3 months ago