FINALLY: Fast and universal speech enhancement model delivering studio-quality audio for a wide range of recordings.
☆26Apr 1, 2026Updated last month
Alternatives and similar repositories for FINALLY-Speech-Enhancement
Users that are interested in FINALLY-Speech-Enhancement are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated 2 years ago
- ☆10Apr 8, 2024Updated 2 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- (Interspeech 2025, official code) Speech enhancement based on cascaded two flows☆16Sep 1, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge☆12Jun 11, 2024Updated last year
- [CVPR 2025] Official implementation of paper "Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie…☆23Jun 6, 2025Updated 11 months ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆122Jul 14, 2022Updated 3 years ago
- SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)☆113Aug 1, 2025Updated 9 months ago
- Sing any popular song with your voice☆11Jul 10, 2022Updated 3 years ago
- Based on https://github.com/fatchord/WaveRNN☆24May 3, 2020Updated 6 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- PASE: Phonologically Anchored Speech Enhancer☆60Apr 9, 2026Updated last month
- This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"☆16Jun 27, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [NAACL 2025] WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching☆125Apr 8, 2026Updated last month
- ☆18Feb 9, 2020Updated 6 years ago
- Manipulating semantic data within Python☆20Jan 14, 2025Updated last year
- Parallel waveform generation with DiffusionGAN☆17Mar 26, 2022Updated 4 years ago
- ☆16Apr 4, 2022Updated 4 years ago
- 1D version of Pytorch's PixelShuffle module☆24Jul 25, 2019Updated 6 years ago
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX☆21Oct 8, 2024Updated last year
- An official documentation of the paper <Wave-U-Mamba: An End-To-End Framework For High-Quality And Efficient Speech Super Resolution>.☆25Oct 29, 2025Updated 7 months ago
- Official PyTorch implementation of TTS Style Transfer☆25Jun 22, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- LoRA-based phoneme/prosody control for LLM-based TTS with no G2P - Lightweight adapter for edit and control the target language's phoneme…☆26Aug 14, 2025Updated 9 months ago
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆25Dec 12, 2024Updated last year
- Spherical residual vector quantization (SRVQ)☆31Aug 25, 2024Updated last year
- multilingual speech aligner☆77Nov 19, 2023Updated 2 years ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆43May 9, 2023Updated 3 years ago
- ☆11Oct 8, 2022Updated 3 years ago
- LTFT-Phase-Vocoder is an audio effect that slows down an audio signal without dilating its frequency content or pitch.☆16Dec 19, 2020Updated 5 years ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Jun 18, 2022Updated 3 years ago
- Speech Human Evaluation Estimation Toolkit (SHEET)☆134Mar 31, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- pre-process script for timit data for dnn-aec works☆38Mar 3, 2022Updated 4 years ago
- Official repo for CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations☆66Jan 16, 2025Updated last year
- Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)☆154Feb 1, 2023Updated 3 years ago
- We are very happy that our work has been accepted by ACM Multimedia 2024!🥰☆11Jan 8, 2025Updated last year
- SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering☆178May 2, 2026Updated 3 weeks ago
- Synthesized singing voice demos of WeSinger 2 paper.☆26Feb 20, 2023Updated 3 years ago
- [CVPR 2025] Official implementation of paper "Multi-Granularity Class Prototype Topology Distillation for Class-Incremental Source-Free …☆18Apr 16, 2026Updated last month