apple / ml-acn-embedLinks
Acoustic Neighbor Embeddings
☆28Updated 2 months ago
Alternatives and similar repositories for ml-acn-embed
Users that are interested in ml-acn-embed are comparing it to the libraries listed below
Sorting:
- Collection of scripts from mHuBERT-147.☆30Updated 10 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Updated last year
- Supervoice diffusion enhance☆27Updated last year
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 4 months ago
- ☆14Updated last year
- ☆25Updated last year
- PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMind☆52Updated 2 weeks ago
- ☆43Updated 2 months ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated 2 years ago
- My vocoder experiments☆31Updated 2 months ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 3 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆21Updated 2 months ago
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆26Updated 9 months ago
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆67Updated 2 weeks ago
- GPT-style network for phonemization with durations of text☆67Updated last year
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆30Updated 5 months ago
- Temporary anonymous version☆22Updated last year
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆27Updated last year
- A collection of all our phonemeizers for dataset construction and inference☆26Updated 7 months ago
- Audio tokenization, in the fastest way possible!☆53Updated last year
- Codebase and project page for EDMSound☆34Updated last year
- ☆60Updated last year
- The official code for the SALMon🍣 benchmark (ICASSP 2025 - Oral)☆47Updated last month
- StyleTTS 2 Optimized Training Fork☆33Updated 8 months ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆20Updated 4 months ago
- GPT for FACodec☆13Updated last year
- ☆15Updated last year
- Forced alignment decoder for Whisper.☆14Updated last year
- ☆57Updated last year
- ☆22Updated 2 years ago