🌋LavaSR: Fast Speech restoration and enhancement
☆390Mar 2, 2026Updated this week
Alternatives and similar repositories for LavaSR
Users that are interested in LavaSR are comparing it to the libraries listed below
Sorting:
- [AAAI 2026] UltraGen☆77Feb 1, 2026Updated last month
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆28Sep 20, 2025Updated 5 months ago
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaD…☆76Jan 25, 2026Updated last month
- ☆49Apr 1, 2025Updated 11 months ago
- Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025☆38Feb 24, 2025Updated last year
- Googleの音声復元モデルMiipher-2の再現実装の学習および推論コード。学習済みモデルも公開しています。☆31Feb 7, 2026Updated last month
- ☆54Mar 2, 2023Updated 3 years ago
- Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancement☆25Apr 16, 2023Updated 2 years ago
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆110May 20, 2025Updated 9 months ago
- Sequence alignement methods with helpers for PyTorch.☆24Nov 30, 2022Updated 3 years ago
- Fully Quantized Neural Networks For Speech Enhancement☆63Feb 15, 2024Updated 2 years ago
- Official repository for the paper "MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars"☆41Nov 20, 2025Updated 3 months ago
- Animate Any Character in Any World☆90Jan 9, 2026Updated last month
- Official code of SenSE.☆74Oct 30, 2025Updated 4 months ago
- D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI [ICLR 2026]☆72Jan 15, 2026Updated last month
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆69Nov 1, 2024Updated last year
- A Flow Matching-based Text-to-Speech Model with Emoji-driven Style Control☆28Feb 27, 2026Updated last week
- ☆12Jun 17, 2019Updated 6 years ago
- Reimplementation of Bandit for "Remastering Divide and Remaster: A Cinematic Audio Source Separation Dataset with Multilingual Support"☆53Jul 29, 2025Updated 7 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- ☆28Sep 5, 2024Updated last year
- HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz☆24Jan 2, 2024Updated 2 years ago
- ☆54Jul 16, 2025Updated 7 months ago
- Resilient multi-LLM orchestration with in-built failure handing, rate limits, retries, and circuit breaker.☆29Updated this week
- Kanade is a single-layer disentangled speech tokenizer that extracts compact tokens suitable for both generative and discriminative model…☆82Feb 3, 2026Updated last month
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆15Aug 1, 2024Updated last year
- A Benchmark and Evaluation Suite for Zero-shot Singing Voice Synthesis☆23Feb 11, 2026Updated 3 weeks ago
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Sep 29, 2025Updated 5 months ago
- ☆117Feb 26, 2026Updated last week
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆27Sep 12, 2024Updated last year
- Streaming Vocos☆30Jun 10, 2025Updated 8 months ago
- https://little-misfit.github.io/GRAG-Image-Editing/☆115Nov 27, 2025Updated 3 months ago
- Official Repository of Paper: "Emilia-NV: A Non-Verbal Speech Dataset with Word-Level Annotation for Human-Like Speech Modeling"☆84Sep 18, 2025Updated 5 months ago
- TG-CRITIC: A TIMBRE-GUIDED MODEL FOR REFERENCE-INDEPENDENT SINGING EVALUATION☆15May 26, 2023Updated 2 years ago
- ☆110Nov 6, 2025Updated 4 months ago
- Repository of published DNN speech separation recipes for a number of datasets☆12Jan 22, 2024Updated 2 years ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆13Jul 22, 2024Updated last year
- ip forwarding into your tailnet. An entrypoint proxy into your tailnet if you will☆26Dec 4, 2025Updated 3 months ago