🌋LavaSR: Fast Speech restoration and enhancement
☆482Mar 10, 2026Updated 2 weeks ago
Alternatives and similar repositories for LavaSR
Users that are interested in LavaSR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [AAAI 2026] UltraGen☆77Feb 1, 2026Updated last month
- [CVPR 2026🔥] Enhancing Spatial Understanding in Image Generation via Reward Modeling☆79Mar 2, 2026Updated 3 weeks ago
- Animate Any Character in Any World☆97Mar 10, 2026Updated 2 weeks ago
- Official repository for the paper "MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars"☆41Updated this week
- Googleの音声復元モデルMiipher-2の再現実装の学習お よび推論コード。学習済みモデルも公開しています。☆31Feb 7, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaD…☆76Jan 25, 2026Updated 2 months ago
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆30Sep 20, 2025Updated 6 months ago
- D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI [ICLR 2026]☆77Mar 3, 2026Updated 3 weeks ago
- EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs☆47Sep 19, 2025Updated 6 months ago
- ☆76Mar 16, 2026Updated 2 weeks ago
- ☆111Nov 6, 2025Updated 4 months ago
- Official implementation of AnchorWeave: World-Consistent Video Generation with Retrieved Local Spatial Memories☆88Feb 17, 2026Updated last month
- [CVPR 2026] SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time☆103Updated this week
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Fully Quantized Neural Networks For Speech Enhancement☆63Feb 15, 2024Updated 2 years ago
- Scaling Zero-Shot Reference-to-Video Generation☆66Dec 11, 2025Updated 3 months ago
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆110May 20, 2025Updated 10 months ago
- https://little-misfit.github.io/GRAG-Image-Editing/☆117Nov 27, 2025Updated 4 months ago
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆26Mar 28, 2025Updated last year
- ☆12Jun 17, 2019Updated 6 years ago
- A highly compressive and high-quality neural audio codec for speech models.☆261Jan 23, 2026Updated 2 months ago
- ☆51Mar 5, 2026Updated 3 weeks ago
- Official Codebase for our CVPR 2026 paper UniSH: Unifying Scene and Human Reconstruction in a Feed-Forward Pass☆137Feb 24, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancement☆25Apr 16, 2023Updated 2 years ago
- ☆117Updated this week
- [CVPR 2026] Offical implementation of the paper "HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Pre…☆63Mar 3, 2026Updated 3 weeks ago
- PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMind☆64Sep 22, 2025Updated 6 months ago
- SegviGen: Repurposing 3D Generative Model for Part Segmentation☆104Mar 19, 2026Updated last week
- The official repository for the paper “NonVerbalSpeech-38K: A Scalable Pipeline for Enabling Non-Verbal Speech Generation and Understandi…☆64Dec 26, 2025Updated 3 months ago
- A Benchmark and Evaluation Suite for Zero-shot Singing Voice Synthesis☆24Feb 11, 2026Updated last month
- Track4World: Feedforward World-centric Dense 3D Tracking of All Pixels☆193Mar 11, 2026Updated 2 weeks ago
- ☆54Mar 2, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A lightweight end-of-utterance detection model fine-tuned on SmolLM2-135M, optimized for Raspberry Pi and low-power devices.☆47Mar 20, 2026Updated last week
- Official Repository of Paper: "Emilia-NV: A Non-Verbal Speech Dataset with Word-Level Annotation for Human-Like Speech Modeling"☆87Sep 18, 2025Updated 6 months ago
- Evaluation tool used in the BigVSAN paper☆14Mar 22, 2024Updated 2 years ago
- ☆82Mar 7, 2026Updated 3 weeks ago
- Latency Comparison among Serverless Databases: DynamoDB, FaunaDB, MongoDB, Cassandra, Firestore and Upstash☆26Sep 20, 2021Updated 4 years ago
- Eureka-Audio: A 1.7B lightweight audio–language model that matches 7B–30B models on ASR, audio understanding, and paralinguistic reasonin…☆35Feb 28, 2026Updated last month
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year