🌋LavaSR: Fast Speech restoration and enhancement
☆537Apr 6, 2026Updated last month
Alternatives and similar repositories for LavaSR
Users that are interested in LavaSR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [AAAI 2026] UltraGen☆78Feb 1, 2026Updated 3 months ago
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- Official repository for the paper "MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars"☆43Mar 24, 2026Updated 2 months ago
- [CVPR 2026🔥] Enhancing Spatial Understanding in Image Generation via Reward Modeling☆84Mar 2, 2026Updated 2 months ago
- [arXiv 2512.17796] Animate Any Character in Any World☆96Mar 10, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Googleの音声復元モデルMiipher-2の再現実装の学習および推論コード。学習済みモデルも公開しています。☆31Feb 7, 2026Updated 3 months ago
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆109May 20, 2025Updated last year
- This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaD…☆77Jan 25, 2026Updated 4 months ago
- ☆21Feb 14, 2026Updated 3 months ago
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆31Sep 20, 2025Updated 8 months ago
- D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI [ICLR 2026]☆82Mar 3, 2026Updated 2 months ago
- UniVid: The Open-Source Unified Video Model☆32Oct 13, 2025Updated 7 months ago
- EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs☆47Sep 19, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆88Mar 16, 2026Updated 2 months ago
- ☆118Nov 6, 2025Updated 6 months ago
- Official implementation of AnchorWeave: World-Consistent Video Generation with Retrieved Local Spatial Memories☆91Feb 17, 2026Updated 3 months ago
- ☆10Jun 11, 2024Updated last year
- Fully Quantized Neural Networks For Speech Enhancement☆63Feb 15, 2024Updated 2 years ago
- poorman's ar-dit tts☆45Dec 31, 2025Updated 4 months ago
- [CVPR 2026] SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time☆115May 17, 2026Updated last week
- FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…☆49Feb 17, 2026Updated 3 months ago
- [CVPR 2026] Scaling Zero-Shot Reference-to-Video Generation☆72Apr 28, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis☆39Dec 24, 2025Updated 5 months ago
- https://little-misfit.github.io/GRAG-Image-Editing/☆119Nov 27, 2025Updated 6 months ago
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆26Mar 28, 2025Updated last year
- ☆12Jun 17, 2019Updated 6 years ago
- A highly compressive and high-quality neural audio codec for speech models.☆266Jan 23, 2026Updated 4 months ago
- ☆51Mar 5, 2026Updated 2 months ago
- Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancement☆25Apr 16, 2023Updated 3 years ago
- ☆120May 5, 2026Updated 3 weeks ago
- [CVPR 2026] Offical implementation of the paper "HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Pre…☆88May 11, 2026Updated 2 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMind☆65Sep 22, 2025Updated 8 months ago
- The official repository for the paper “NonVerbalSpeech-38K: A Scalable Pipeline for Enabling Non-Verbal Speech Generation and Understandi…☆66Dec 26, 2025Updated 5 months ago
- Official Codebase for our CVPR 2026 paper UniSH: Unifying Scene and Human Reconstruction in a Feed-Forward Pass☆145Feb 24, 2026Updated 3 months ago
- Fully quantized Neural Networks for Audio Source Separation☆16Aug 11, 2024Updated last year
- A Benchmark and Evaluation Suite for Zero-shot Singing Voice Synthesis☆27Feb 11, 2026Updated 3 months ago
- ☆55Mar 2, 2023Updated 3 years ago
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆36Aug 30, 2025Updated 9 months ago