A lightning fast audio upsampler.
☆737Feb 26, 2026Updated last week
Alternatives and similar repositories for NovaSR
Users that are interested in NovaSR are comparing it to the libraries listed below
Sorting:
- A pitch detection model trained to be robust against noise and reverberation environments.☆27Jan 21, 2025Updated last year
- Fast audio super resolution from 16khz to 48khz.☆199Jan 3, 2026Updated 2 months ago
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago
- ☆11Nov 7, 2024Updated last year
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- ☆52Sep 10, 2024Updated last year
- LLaSE: Maximizing Acoustic Preservation for LLaMA based Speech Enhancement☆16Jul 11, 2025Updated 7 months ago
- Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.☆246Mar 7, 2025Updated last year
- Code for INTERSPEECH 2023 paper "mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra"☆66Jun 3, 2023Updated 2 years ago
- ☆36Sep 6, 2025Updated 6 months ago
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Aug 16, 2024Updated last year
- Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"☆41Oct 20, 2025Updated 4 months ago
- Reimplementation of Miipher☆29Aug 16, 2023Updated 2 years ago
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆31Aug 30, 2025Updated 6 months ago
- ☆25Jan 24, 2023Updated 3 years ago
- HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz☆24Jan 2, 2024Updated 2 years ago
- ☆54Jul 16, 2025Updated 7 months ago
- Soprano-Factory: Train your own 2000x realtime text-to-speech model☆211Jan 13, 2026Updated last month
- A high quality and fast TTS repository☆505Dec 22, 2025Updated 2 months ago
- An example of a speech enhancement model deployed with TensorRT.☆78Mar 24, 2025Updated 11 months ago
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆100Apr 1, 2025Updated 11 months ago
- Official Repository of Paper: "Towards High-Quality Zero-Shot Singing Voice Conversion in Low-Resource Scenarios"(AAAI 2026)☆89Jan 31, 2026Updated last month
- Text-To-Speech for NotebookLM☆39Jul 20, 2025Updated 7 months ago
- ☆68Dec 30, 2025Updated 2 months ago
- ☆13Sep 12, 2024Updated last year
- A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.☆15Aug 22, 2023Updated 2 years ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- Trainging, inference, and testing of the SAC speech codec model.☆100Nov 1, 2025Updated 4 months ago
- Audio-FLAN☆159Sep 23, 2025Updated 5 months ago
- Ablation study of local spectral attention (LSA) for full-band speech enhancement (SE)☆28Sep 16, 2023Updated 2 years ago
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆212Sep 19, 2024Updated last year
- The official implementation of GTCRN, an ultra-lightweight SE model.☆575Jan 18, 2026Updated last month
- ☆67Aug 16, 2023Updated 2 years ago
- Code for the blog "Neural audio codecs: how to get audio into LLMs"☆156Oct 20, 2025Updated 4 months ago
- Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"☆43Oct 30, 2025Updated 4 months ago
- ☆32May 17, 2024Updated last year
- ☆13Jul 23, 2024Updated last year
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 4 months ago
- A Benchmark and Evaluation Suite for Zero-shot Singing Voice Synthesis☆23Feb 11, 2026Updated 3 weeks ago