ysharma3501/LavaSR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ysharma3501/LavaSR)

ysharma3501 / LavaSR

🌋LavaSR: Fast Speech restoration and enhancement

☆566

Alternatives and similar repositories for LavaSR

Users that are interested in LavaSR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ysharma3501 / NovaSR
View on GitHub
A lightning fast audio upsampler.
☆775Feb 26, 2026Updated 5 months ago
faxlab / LavaSR-Fast-Enhancer
View on GitHub
☆21Mar 1, 2026Updated 4 months ago
NVlabs / LoRWeB
View on GitHub
We propose a novel modular framework that learns to dynamically mix low-rank adapters (LoRAs) to improve visual analogy learning, enablin…
☆75Jun 22, 2026Updated last month
ysharma3501 / FlashSR
View on GitHub
Fast audio super resolution from 16khz to 48khz.
☆215Jan 3, 2026Updated 6 months ago
xk-huang / VecGlypher
View on GitHub
[CVPR'26] VecGlypher: Unified Vector Glyph Generation with Language Models
☆136Feb 26, 2026Updated 5 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
liangbingzhao / PhysicEdit
View on GitHub
[ICML2026] From Statics to Dynamics: Physics-Aware Image Editing with Latent Transition Priors
☆92Apr 30, 2026Updated 2 months ago
ysharma3501 / LinaCodec
View on GitHub
A highly compressive and high-quality neural audio codec for speech models.
☆269Jan 23, 2026Updated 6 months ago
ysharma3501 / MiraTTS
View on GitHub
A high quality and fast TTS repository
☆517Dec 22, 2025Updated 7 months ago
woongzip1 / UniverSR
View on GitHub
Official implemtation of UniverSR (ICASSP 2026)
☆59Apr 9, 2026Updated 3 months ago
yxlu-0102 / AP-BWE
View on GitHub
Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction
☆194Apr 15, 2025Updated last year
ekwek1 / soprano-factory
View on GitHub
Soprano-Factory: Train your own 2000x realtime text-to-speech model
☆253Jan 13, 2026Updated 6 months ago
ysharma3501 / FastNeuTTS
View on GitHub
A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!
☆118Nov 24, 2025Updated 8 months ago
tue-mps / videomt
View on GitHub
[CVPR 2026] Official code and models for Video Encoder-only Mask Transformer (VidEoMT).
☆254Jun 23, 2026Updated last month
meituan-longcat / LongCat-AudioDiT
View on GitHub
☆555Apr 3, 2026Updated 3 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
ysharma3501 / LayaCodec
View on GitHub
High fidelity neural audio codec for TTS models
☆36Dec 22, 2025Updated 7 months ago
StellanLi / EchoFree
View on GitHub
☆18Feb 22, 2025Updated last year
nineninesix-ai / kani-tts
View on GitHub
☆461Nov 2, 2025Updated 8 months ago
smulelabs / windowed-roformer
View on GitHub
Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"
☆45Oct 30, 2025Updated 9 months ago
cisco-open / pase
View on GitHub
PASE: Phonologically Anchored Speech Enhancer
☆86Jul 15, 2026Updated 2 weeks ago
Guoxu1233 / DreamID-Omni
View on GitHub
[ICML 2026] DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation
☆275May 22, 2026Updated 2 months ago
snowflakewang / CustomX
View on GitHub
[ECCV 2026] CustomX: Unified Character, Action, and Scene Customization in Video World Models
☆96Jun 25, 2026Updated last month
ysy31415 / EffectMaker
View on GitHub
Code repo for EffectMaker: Unifying Reasoning and Generation for Customized Visual Effect Creation
☆42Mar 6, 2026Updated 4 months ago
hanjq17 / Spectrum
View on GitHub
[CVPR 2026] Adaptive Spectral Feature Forecasting for Diffusion Sampling Acceleration
☆126Apr 30, 2026Updated 2 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
FudanCVL / GlyphPrinter
View on GitHub
[CVPR 2026 Highlight] GlyphPrinter: Region-Grouped Direct Preference Optimization for Glyph-Accurate Visual Text Rendering
☆104Apr 9, 2026Updated 3 months ago
ekwek1 / soprano
View on GitHub
Soprano: Instant, Ultra-Realistic Text-to-Speech
☆1,382Jan 15, 2026Updated 6 months ago
Xiaobin-Rong / ul-unas
View on GitHub
The official repo of UL-UNAS, an ultra-lightweight SE model.
☆192Jun 17, 2026Updated last month
yukara-ikemiya / Open-Miipher-2
View on GitHub
PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMind
☆70Sep 22, 2025Updated 10 months ago
OpenMOSS / MOSS-TTS
View on GitHub
MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fi…
☆3,923Updated this week
HumeAI / tada
View on GitHub
Open Source Speech Language Model
☆1,006May 11, 2026Updated 2 months ago
yl4579 / DMOSpeech2
View on GitHub
☆302Jul 22, 2025Updated last year
Francis-Rings / FlashPortrait
View on GitHub
[CVPR2026]We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length vide…
☆480Feb 21, 2026Updated 5 months ago
Vyvo-Labs / VyvoTTS
View on GitHub
VyvoTTS: LLM-Based Text-to-Speech Training Framework
☆257Apr 8, 2026Updated 3 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
mo230761 / UniGeo
View on GitHub
A framework for camera-controllable image editing using unified geometric guidance and video models.
☆65Jun 25, 2026Updated last month
yangdongchao / UniAudio2Demo
View on GitHub
☆26Feb 10, 2026Updated 5 months ago
KetsuiLabs / MichiAI
View on GitHub
MichiAI: A Low Latency, Full Duplex Speech LLM with zero coherence loss
☆109Apr 24, 2026Updated 3 months ago
Berkeley-Speech-Group / StyleStream
View on GitHub
☆61Jun 11, 2026Updated last month
justdubit / just-dub-it
View on GitHub
Code for 'JUST-DUB-IT: Video Dubbing via Joint Audio-Visual Diffusion'
☆268May 11, 2026Updated 2 months ago
ssi-research / FQSE
View on GitHub
Fully Quantized Neural Networks For Speech Enhancement
☆65Feb 15, 2024Updated 2 years ago
cwx-worst-one / WavTTS
View on GitHub
WavTTS: Towards High-Quality Zero-Shot TTS via Direct Raw Waveform Modeling
☆210Jun 6, 2026Updated last month