A lightning fast audio upsampler.
☆776Feb 26, 2026Updated 3 months ago
Alternatives and similar repositories for NovaSR
Users that are interested in NovaSR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fast audio super resolution from 16khz to 48khz.☆212Jan 3, 2026Updated 5 months ago
- A pitch detection model trained to be robust against noise and reverberation environments.☆27Jan 21, 2025Updated last year
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated last year
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago
- ☆52Sep 10, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for INTERSPEECH 2023 paper "mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra"☆66Jun 3, 2023Updated 3 years ago
- ☆15Jul 23, 2024Updated last year
- ☆12Nov 7, 2024Updated last year
- Ablation study of local spectral attention (LSA) for full-band speech enhancement (SE)☆28Sep 16, 2023Updated 2 years ago
- A high quality and fast TTS repository☆512Dec 22, 2025Updated 5 months ago
- ☆25Jan 24, 2023Updated 3 years ago
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆37Aug 30, 2025Updated 9 months ago
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆104Apr 1, 2025Updated last year
- A Lightweight and Streaming Zero-Shot Voice Conversion via Mean Flows☆284Jan 8, 2026Updated 5 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆36Sep 6, 2025Updated 9 months ago
- The official implementation of GTCRN, an ultra-lightweight SE model.☆668Jan 18, 2026Updated 5 months ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆113Sep 2, 2025Updated 9 months ago
- Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"☆41Oct 20, 2025Updated 7 months ago
- Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.☆254Mar 7, 2025Updated last year
- Ultra-fast audio super resolution custom node for ComfyUI, powered by the NovaSR model.☆33Feb 12, 2026Updated 4 months ago
- LLaSE: Maximizing Acoustic Preservation for LLaMA based Speech Enhancement☆16Jul 11, 2025Updated 11 months ago
- Fully quantized Neural Networks for Audio Source Separation☆16Aug 11, 2024Updated last year
- Unofficial PyTorch implementation of "SCNet: Sparse Compression Network for Music Source Separation"☆63Apr 14, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Soprano-Factory: Train your own 2000x realtime text-to-speech model☆232Jan 13, 2026Updated 5 months ago
- Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW☆15Dec 10, 2024Updated last year
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 7 months ago
- Official Repository of Paper: "Towards High-Quality Zero-Shot Singing Voice Conversion in Low-Resource Scenarios"(AAAI 2026)☆104Apr 21, 2026Updated last month
- Music repair method to convert lossy MP3 compressed music to lossless music.☆381Aug 12, 2025Updated 10 months ago
- Audio-FLAN☆161Sep 23, 2025Updated 8 months ago
- A toolkit for researchers in the multimodal sound separation.☆16Oct 20, 2023Updated 2 years ago
- Official PyTorch inference code for the Interspeech 2025 paper: Efficient Speech Enhancement via Embeddings from Pre-trained Generative A…☆80Jun 16, 2025Updated last year
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆217Sep 19, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Perceptual Contrast Stretching on Target Feature for Speech Enhancement (Accepted by INTERSPEECH 2022)☆73May 11, 2024Updated 2 years ago
- ☆13Sep 12, 2024Updated last year
- FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…☆49Feb 17, 2026Updated 4 months ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆21May 20, 2025Updated last year
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Aug 16, 2024Updated last year
- ☆71Dec 30, 2025Updated 5 months ago
- ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation☆118Dec 11, 2025Updated 6 months ago