🌋LavaSR: Fast Speech restoration and enhancement
☆508Apr 6, 2026Updated last week
Alternatives and similar repositories for LavaSR
Users that are interested in LavaSR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [AAAI 2026] UltraGen☆78Feb 1, 2026Updated 2 months ago
- Onset-and-Offset-Aware Sound Event Detection☆22Feb 10, 2025Updated last year
- Official repository for the paper "MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars"☆42Mar 24, 2026Updated 3 weeks ago
- Animate Any Character in Any World☆97Mar 10, 2026Updated last month
- [CVPR 2026🔥] Enhancing Spatial Understanding in Image Generation via Reward Modeling☆80Mar 2, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Googleの音声復元モデルMiipher-2の再現実装の学習および推論コード。学習済みモデルも公開しています。☆31Feb 7, 2026Updated 2 months ago
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆110May 20, 2025Updated 10 months ago
- This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaD…☆77Jan 25, 2026Updated 2 months ago
- ☆17Feb 14, 2026Updated 2 months ago
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆30Sep 20, 2025Updated 7 months ago
- D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI [ICLR 2026]☆77Mar 3, 2026Updated last month
- EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs☆47Sep 19, 2025Updated 7 months ago
- ☆84Mar 16, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆115Nov 6, 2025Updated 5 months ago
- Official implementation of AnchorWeave: World-Consistent Video Generation with Retrieved Local Spatial Memories☆90Feb 17, 2026Updated 2 months ago
- [CVPR 2026] SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time☆106Updated this week
- Fully Quantized Neural Networks For Speech Enhancement☆63Feb 15, 2024Updated 2 years ago
- poorman's ar-dit tts☆45Dec 31, 2025Updated 3 months ago
- FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…☆50Feb 17, 2026Updated 2 months ago
- [CVPR 2026] Scaling Zero-Shot Reference-to-Video Generation☆69Dec 11, 2025Updated 4 months ago
- [ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis☆39Dec 24, 2025Updated 3 months ago
- https://little-misfit.github.io/GRAG-Image-Editing/☆117Nov 27, 2025Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆26Mar 28, 2025Updated last year
- ☆12Jun 17, 2019Updated 6 years ago
- A highly compressive and high-quality neural audio codec for speech models.☆262Jan 23, 2026Updated 2 months ago
- ☆50Mar 5, 2026Updated last month
- Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancement☆25Apr 16, 2023Updated 3 years ago
- Gen-Searcher: Reinforcing Agentic Search for Image Generation☆280Apr 7, 2026Updated last week
- ☆117Mar 24, 2026Updated 3 weeks ago
- Official Codebase for our CVPR 2026 paper UniSH: Unifying Scene and Human Reconstruction in a Feed-Forward Pass☆140Feb 24, 2026Updated last month
- [CVPR 2026] Offical implementation of the paper "HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Pre…☆79Mar 3, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMind☆64Sep 22, 2025Updated 6 months ago
- The official repository for the paper “NonVerbalSpeech-38K: A Scalable Pipeline for Enabling Non-Verbal Speech Generation and Understandi…☆65Dec 26, 2025Updated 3 months ago
- A Benchmark and Evaluation Suite for Zero-shot Singing Voice Synthesis☆24Feb 11, 2026Updated 2 months ago
- Fully quantized Neural Networks for Audio Source Separation☆16Aug 11, 2024Updated last year
- ☆55Mar 2, 2023Updated 3 years ago
- [Official Repo] SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing☆97Apr 7, 2026Updated last week
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆36Aug 30, 2025Updated 7 months ago