ysharma3501 / NovaSRView external linksLinks
A lightning fast audio upsampler.
☆710Feb 2, 2026Updated 2 weeks ago
Alternatives and similar repositories for NovaSR
Users that are interested in NovaSR are comparing it to the libraries listed below
Sorting:
- A pitch detection model trained to be robust against noise and reverberation environments.☆27Jan 21, 2025Updated last year
- Fast audio super resolution from 16khz to 48khz.☆192Jan 3, 2026Updated last month
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago
- ☆11Nov 7, 2024Updated last year
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- LLaSE: Maximizing Acoustic Preservation for LLaMA based Speech Enhancement☆16Jul 11, 2025Updated 7 months ago
- Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.☆245Mar 7, 2025Updated 11 months ago
- ☆36Sep 6, 2025Updated 5 months ago
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Aug 16, 2024Updated last year
- Reimplementation of Miipher☆29Aug 16, 2023Updated 2 years ago
- Soprano-Factory: Train your own 2000x realtime text-to-speech model☆206Jan 13, 2026Updated last month
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆31Aug 30, 2025Updated 5 months ago
- ☆52Sep 10, 2024Updated last year
- ☆25Jan 24, 2023Updated 3 years ago
- HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz☆24Jan 2, 2024Updated 2 years ago
- ☆54Jul 16, 2025Updated 7 months ago
- A high quality and fast TTS repository☆502Dec 22, 2025Updated last month
- Text-To-Speech for NotebookLM☆37Jul 20, 2025Updated 6 months ago
- Official Repository of Paper: "Towards High-Quality Zero-Shot Singing Voice Conversion in Low-Resource Scenarios"(AAAI 2026)☆89Jan 31, 2026Updated 2 weeks ago
- ☆68Dec 30, 2025Updated last month
- ☆13Sep 12, 2024Updated last year
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- Trainging, inference, and testing of the SAC speech codec model.☆96Nov 1, 2025Updated 3 months ago
- Audio-FLAN☆160Sep 23, 2025Updated 4 months ago
- The official implementation of GTCRN, an ultra-lightweight SE model.☆561Jan 18, 2026Updated 3 weeks ago
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆212Sep 19, 2024Updated last year
- Code for the blog "Neural audio codecs: how to get audio into LLMs"☆151Oct 20, 2025Updated 3 months ago
- Code for INTERSPEECH 2023 paper "mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra"☆66Jun 3, 2023Updated 2 years ago
- ☆66Aug 16, 2023Updated 2 years ago
- Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"☆42Oct 30, 2025Updated 3 months ago
- ☆32May 17, 2024Updated last year
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 3 months ago
- A Python Library for Fundamental Frequency Estimation in Music Recordings☆54Jan 16, 2026Updated last month
- My vocoder experiments☆31Jul 26, 2025Updated 6 months ago
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆97Apr 1, 2025Updated 10 months ago
- An example of a speech enhancement model deployed with TensorRT.☆77Mar 24, 2025Updated 10 months ago
- ☆19Mar 22, 2024Updated last year
- Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"☆41Oct 20, 2025Updated 3 months ago