The second generation of VoiceFixer, a toolkit for general speech restoration. *Not affiliated with the original VoiceFixer repo*
☆21Nov 19, 2023Updated 2 years ago
Alternatives and similar repositories for voicefixer2
Users that are interested in voicefixer2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings☆19Jun 6, 2025Updated 10 months ago
- This repo gives the code for the official implementation of RCT.☆14Jun 28, 2022Updated 3 years ago
- Dynamic Mixing For Speech Processing (mix-on-the-fly)☆21Jul 19, 2022Updated 3 years ago
- Keep track of good articles on speech processing, mainly on speech enhancement, include speech denoise, speech dereverberation and aec、ag…☆48Jul 17, 2024Updated last year
- ☆17Jul 22, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Rebuild of GTCRN using Grouped TCNs, amidst other changes. Initially an attempt to target MCU deployment.☆23Jan 12, 2026Updated 3 months ago
- Unofficial implementation for the paper 'Improving Diffusion Models for Inverse Problems using Manifold Constraints'[https://arxiv.org/ab…☆12Aug 21, 2022Updated 3 years ago
- A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.☆15Aug 22, 2023Updated 2 years ago
- ☆10May 30, 2024Updated last year
- Noise supression using deep filtering☆16May 31, 2022Updated 3 years ago
- This is the implementation of the manuscript "Learning General All-Neural Speech Enhancement based on Taylor's Approximation Theory", whi…☆14Nov 25, 2022Updated 3 years ago
- ☆15Updated this week
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆45Feb 17, 2026Updated last month
- ☆23Jul 17, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆17Mar 30, 2023Updated 3 years ago
- Fine-Tune Whisper with Transformers and PEFT☆58Nov 4, 2023Updated 2 years ago
- A lightweight SkinSmoothing Filter using Metal and CoreImage.☆14Jan 17, 2025Updated last year
- ☆16Dec 18, 2023Updated 2 years ago
- Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".☆29Sep 20, 2021Updated 4 years ago
- Containerized self-hosted REST API for vision classification, utilizing Hugging Face transformers.☆11Dec 5, 2024Updated last year
- ☆130Apr 24, 2023Updated 2 years ago
- ☆13Apr 22, 2024Updated last year
- Smart Encoder is a powerful tool for video encoding that optimizes video quality while minimizing file size. It automatically selects the…☆12Jun 13, 2025Updated 9 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆13Apr 15, 2025Updated 11 months ago
- A repo with the fonts I use for Shells and Programming, most of them with Ligatures and Powerline support!☆22Feb 25, 2026Updated last month
- a open source iOS framework☆14Jun 17, 2015Updated 10 years ago
- Auto generated swig python module with a binary compnent☆11Apr 19, 2012Updated 13 years ago
- ☆58Apr 24, 2024Updated last year
- A simple font manager for Mac. For activating and disabling fonts and installing Google Fonts☆13Mar 19, 2019Updated 7 years ago
- LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀☆15Jul 12, 2021Updated 4 years ago
- Speech Recognition and Simple AI Summary:可用于本地语音转文字、说话人分割及简易的AI总结,搭配web端操作界面。☆11Jul 22, 2024Updated last year
- Flow control nodes for comfyUI, allowing for more diverse workflows☆13Apr 3, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A PyTorch Implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers using gradio and Nvidia NEMO ASR mode…☆37Dec 28, 2023Updated 2 years ago
- The implementation of TaylorBeamformer, which is in submission to Interspeech2022☆48Jun 10, 2022Updated 3 years ago
- Modular FFmpeg builder☆13Sep 3, 2016Updated 9 years ago
- The purpose of this code base is to add a specified signal-to-noise ratio noise from MUSAN dataset to a pure speech signal and to generat…☆31Sep 21, 2021Updated 4 years ago
- Merge and clean up multi-line and multi-language subtitle files. Updated with language-based subtitle split. 将带有多行英文的SRT字幕合并成单行,同时合并中文翻译。…☆14Mar 30, 2015Updated 11 years ago
- Implementation of the paper: "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning" in pytorch☆14Apr 6, 2026Updated last week
- A collections of tools around sleep research: plotting of hypnograms / spectrograms, etc etc☆10Jan 24, 2026Updated 2 months ago