tan90xx/distillw2n

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tan90xx/distillw2n)

tan90xx / distillw2n

🤫A Lightweight One-Shot Whisper to Normal Voice Conversion Model Using Distillation of Self-Supervised Features

☆25

Alternatives and similar repositories for distillw2n

Users that are interested in distillw2n are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lucadellalib / discrete-wavlm-codec
View on GitHub
A neural speech codec based on discrete WavLM representations
☆26Aug 28, 2024Updated last year
IiuZiKai / Evo_TSE
View on GitHub
☆17Apr 9, 2026Updated 3 months ago
Xiaobin-Rong / unipase
View on GitHub
Official repository of UniPASE, a SOTA USE model
☆51Jul 21, 2026Updated last week
rkmt / wesper-demo
View on GitHub
☆36Dec 25, 2023Updated 2 years ago
Taltt / FNSE-SBGAN
View on GitHub
FNSE-SBGAN: Far-field Speech Enhancement with Schrödinger Bridge and Generative Adversarial Networks
☆20May 12, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
chaufanglin / Normal2Whisper
View on GitHub
Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"
☆14Oct 31, 2024Updated last year
Clovermax / AED-TSVAD
View on GitHub
Attention-Based Encoder-Decoder Target-Speaker Voice Activity Detection for Robust Speaker Diarization
☆31Sep 22, 2025Updated 10 months ago
ZLiNJU / AFC-SPEX
View on GitHub
Source code and audio samples for AFC-SPEX, an algorithm that can jointly perform acoustic feedback cancellation and speaker extraction.
☆40Nov 7, 2025Updated 8 months ago
cisco-open / pase
View on GitHub
PASE: Phonologically Anchored Speech Enhancer
☆86Jul 15, 2026Updated last week
Taltt / FNSE-SAT
View on GitHub
☆46Jan 14, 2025Updated last year
Andong-Li-speech / Neural-Vocoders-as-Speech-Enhancers
View on GitHub
☆52Sep 10, 2024Updated last year
Max1Wz / H-GTCRN
View on GitHub
A Lightweight Hybrid Dual Channel Speech Enhancement System under Low-SNR Conditions (Interspeech 2025)
☆111Mar 13, 2026Updated 4 months ago
deepnetni / msa-dpcrn
View on GitHub
☆21Jul 18, 2026Updated last week
Emrys365 / se-scaling
View on GitHub
Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…
☆41Aug 7, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ASLP-lab / SenSE
View on GitHub
Official code of SenSE.
☆90Oct 30, 2025Updated 8 months ago
echocatzh / MTFAA-Net
View on GitHub
Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement
☆233Sep 30, 2022Updated 3 years ago
gitwukeyi / FSPEN
View on GitHub
☆59Apr 24, 2024Updated 2 years ago
donghoney0416 / DeFTAN-II
View on GitHub
Official page of "DeFTAN-II: Efficient multichannel speech enhancement with subgroup processing", IEEE/ACM Transactions on Audio, Speech,…
☆34Nov 21, 2024Updated last year
chentuochao / Target-Conversation-Extraction
View on GitHub
This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamic…
☆58Aug 15, 2025Updated 11 months ago
TuZehai / Sheffield_Clarity_CEC1_Entry
View on GitHub
Implementation of Sheffield entry for Clarity enhancement challenge.
☆18Apr 19, 2022Updated 4 years ago
uthree / ddsp-vocoder
View on GitHub
☆12Nov 7, 2024Updated last year
Mddct / usm-tokenizer
View on GitHub
semantic tokenizer for speech and music
☆20Jul 6, 2025Updated last year
echocatzh / GTCNN
View on GitHub
Personalized AEC
☆19Nov 3, 2022Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ZBang / USEF-TSE
View on GitHub
☆70Jul 5, 2025Updated last year
Dahan-Wang / Rethinking-Flow-and-Diffusion-Bridge-Models-for-Speech-Enhancement
View on GitHub
☆39Feb 23, 2026Updated 5 months ago
Yip-Jia-Qi / codecformer
View on GitHub
☆21Jul 15, 2024Updated 2 years ago
bear-boy / DPCRN-Pytorch
View on GitHub
Pytorch implementation of DPCRN
☆29Mar 31, 2024Updated 2 years ago
VoxBlink2 / ScriptsForVoxBlink2
View on GitHub
Official Repository For VoxBlink2
☆88Aug 13, 2024Updated last year
ZhongYang2026 / Sandglasset-A-Light-Multi-Granularity-Self-Attentive-Network-For-Time-Domain-Speech-Separation
View on GitHub
Speech Separation
☆21Mar 7, 2024Updated 2 years ago
fakufaku / auxiva-ipa
View on GitHub
Fast algorithm for determined blind source separation with update of demixing filters with joint adjustment of the remaining sources.
☆36Mar 22, 2021Updated 5 years ago
hyyan2k / LiSenNet
View on GitHub
This is the official implementation of the LiSenNet
☆163Nov 15, 2024Updated last year
ssi-research / FQSE
View on GitHub
Fully Quantized Neural Networks For Speech Enhancement
☆65Feb 15, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
primepake / F5-TTS-meanflow-multilingual
View on GitHub
Meanflow and multilingual for F5-TTS model
☆16Aug 23, 2025Updated 11 months ago
HolgerBovbjerg / SSL-PVAD
View on GitHub
A repository for code used to produce the results the ICASSP 2024 paper: "SELF-SUPERVISED PRETRAINING FOR ROBUST PERSONALIZED VOICE ACTIV…
☆25Nov 25, 2024Updated last year
Audio-WestlakeU / Mel-McNet
View on GitHub
The Official PyTorch Implementation of "Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement" [Interspeech 2025]
☆26May 14, 2026Updated 2 months ago
Xiaobin-Rong / deepvqe
View on GitHub
An unofficial implementation of DeepVQE proposed by Microsoft Corp.
☆148Mar 24, 2025Updated last year
GravityPoet / textream-zh
View on GitHub
一款隐身于 Mac 摄像头下方的智能提词器：专为视频录制、直播与会议设计，帮您保持自然眼神交流。支持苹果自带语音识别与本地 AI 大模型，能随着您的真实语速自动跟踪和滚动文案，彻底告别忘词与手动滑屏的烦恼。
☆19Feb 24, 2026Updated 5 months ago
sungwon23 / BSRNN
View on GitHub
☆138Apr 24, 2023Updated 3 years ago
NikolaiKyhne / RWSAMamba-UNet
View on GitHub
Official repository for the paper "Exploring Resolution-Wise Shared Attention in Hybrid Mamba-U-Nets for Improved Cross-Corpus Speech Enh…
☆19May 5, 2026Updated 2 months ago