The second generation of VoiceFixer, a toolkit for general speech restoration. *Not affiliated with the original VoiceFixer repo*
☆21Nov 19, 2023Updated 2 years ago
Alternatives and similar repositories for voicefixer2
Users that are interested in voicefixer2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings☆19Jun 6, 2025Updated 9 months ago
- Dynamic Mixing For Speech Processing (mix-on-the-fly)☆21Jul 19, 2022Updated 3 years ago
- Keep track of good articles on speech processing, mainly on speech enhancement, include speech denoise, speech dereverberation and aec、ag…☆47Jul 17, 2024Updated last year
- Neural Speech Codec☆25Jan 25, 2021Updated 5 years ago
- ☆17Jul 22, 2024Updated last year
- ☆17May 5, 2024Updated last year
- silero-vad pytorch implement☆36Nov 23, 2024Updated last year
- Unofficial implementation for the paper 'Improving Diffusion Models for Inverse Problems using Manifold Constraints'[https://arxiv.org/ab…☆12Aug 21, 2022Updated 3 years ago
- A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.☆15Aug 22, 2023Updated 2 years ago
- ☆10May 30, 2024Updated last year
- A lightweight SkinSmoothing Filter using Metal and CoreImage.☆13Jan 17, 2025Updated last year
- Noise supression using deep filtering☆15May 31, 2022Updated 3 years ago
- Peach - the porn organizer☆12Jun 10, 2024Updated last year
- This is the implementation of the manuscript "Learning General All-Neural Speech Enhancement based on Taylor's Approximation Theory", whi…☆14Nov 25, 2022Updated 3 years ago
- Containerized self-hosted REST API for vision classification, utilizing Hugging Face transformers.☆10Dec 5, 2024Updated last year
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆45Feb 17, 2026Updated last month
- ☆23Jul 17, 2024Updated last year
- ☆17Mar 30, 2023Updated 2 years ago
- Fine-Tune Whisper with Transformers and PEFT☆58Nov 4, 2023Updated 2 years ago
- Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".☆29Sep 20, 2021Updated 4 years ago
- ☆129Apr 24, 2023Updated 2 years ago
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆12Apr 15, 2025Updated 11 months ago
- Smart Encoder is a powerful tool for video encoding that optimizes video quality while minimizing file size. It automatically selects the…☆12Jun 13, 2025Updated 9 months ago
- My preferred set of Image Quality Enhancing Shaders and other scripts for MPV.☆11Jan 3, 2022Updated 4 years ago
- LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀☆15Jul 12, 2021Updated 4 years ago
- ☆58Apr 24, 2024Updated last year
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- A simple font manager for Mac. For activating and disabling fonts and installing Google Fonts☆13Mar 19, 2019Updated 7 years ago
- Speech Recognition and Simple AI Summary:可用于本地语音转文字、说话人分割及简易的AI总结,搭配web端操作界面。☆11Jul 22, 2024Updated last year
- Flow control nodes for comfyUI, allowing for more diverse workflows☆13Apr 3, 2025Updated 11 months ago
- A PyTorch Implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers using gradio and Nvidia NEMO ASR mode…☆37Dec 28, 2023Updated 2 years ago
- The implementation of TaylorBeamformer, which is in submission to Interspeech2022☆48Jun 10, 2022Updated 3 years ago
- The purpose of this code base is to add a specified signal-to-noise ratio noise from MUSAN dataset to a pure speech signal and to generat…☆31Sep 21, 2021Updated 4 years ago
- Implementation of the paper: "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning" in pytorch☆14Mar 16, 2026Updated last week
- Merge and clean up multi-line and multi-language subtitle files. Updated with language-based subtitle split. 将带有多行英文的SRT字幕合并成单行,同时合并中文翻译。…☆14Mar 30, 2015Updated 10 years ago
- Unofficial PyTorch implementation of "SCNet: Sparse Compression Network for Music Source Separation"☆61Apr 14, 2024Updated last year
- A collections of tools around sleep research: plotting of hypnograms / spectrograms, etc etc☆10Jan 24, 2026Updated last month
- Sound Separation, Omni modal☆28Sep 15, 2025Updated 6 months ago
- Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation☆727Feb 1, 2026Updated last month