rkmt/wesper-demo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rkmt/wesper-demo)

rkmt / wesper-demo

☆36

Alternatives and similar repositories for wesper-demo

Users that are interested in wesper-demo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tan90xx / distillw2n
View on GitHub
🤫A Lightweight One-Shot Whisper to Normal Voice Conversion Model Using Distillation of Self-Supervised Features
☆25Dec 10, 2025Updated 7 months ago
chaufanglin / Normal2Whisper
View on GitHub
Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"
☆14Oct 31, 2024Updated last year
dwgnr / speech-conversion
View on GitHub
Whisper to Normal Speech Conversion with SC-MelGAN and SC-VQ-VAE
☆15Dec 3, 2022Updated 3 years ago
Taltt / FNSE-SBGAN
View on GitHub
FNSE-SBGAN: Far-field Speech Enhancement with Schrödinger Bridge and Generative Adversarial Networks
☆20May 12, 2025Updated last year
Honee-W / CPTNN
View on GitHub
unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"
☆15Nov 14, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Many0therFunctions / MaskGCT-Text-To-Semantic-Finetune
View on GitHub
This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …
☆13Dec 4, 2024Updated last year
zeta-chicken / toWhisper
View on GitHub
☆29Jul 12, 2024Updated 2 years ago
lucadellalib / discrete-wavlm-codec
View on GitHub
A neural speech codec based on discrete WavLM representations
☆26Aug 28, 2024Updated last year
Maitreyapatel / speech-conversion-between-different-modalities
View on GitHub
Generative Adversarial Networks for different impaired speech conversions
☆39Jul 6, 2023Updated 3 years ago
audiodemo / voice-conversion
View on GitHub
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Aug 18, 2023Updated 2 years ago
iiscleap / ZEST
View on GitHub
Zero-Shot Emotion Style Transfer
☆49Apr 23, 2025Updated last year
uthree / ddsp-vocoder
View on GitHub
☆12Nov 7, 2024Updated last year
fchest / Speech-Transformer-multi-GPUs
View on GitHub
A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This code…
☆10Dec 25, 2019Updated 6 years ago
zcf28 / StyleGAN-VC
View on GitHub
Voice Conversion method based on speaker style
☆14Aug 7, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zzy1hjq / NeuralVC
View on GitHub
A real-time voice conversion model based on VITS.
☆16Aug 1, 2024Updated last year
WangHelin1997 / DuTa-VC
View on GitHub
Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…
☆38Dec 5, 2023Updated 2 years ago
hilab-open-source / WatchYourMouth
View on GitHub
Implementation for WatchYourMouth: Silent Speech Recognition with Depth Sensing presented at CHI 2024
☆23Oct 6, 2025Updated 9 months ago
tuanio / nextformer
View on GitHub
PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"
☆10Dec 15, 2022Updated 3 years ago
wonjune-kang / lvc-vc
View on GitHub
End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions
☆94Nov 6, 2023Updated 2 years ago
sp-uhh / mp-gtf
View on GitHub
Multi-Phase Gammatone Filterbank (MP-GTF) construction for Python
☆48Apr 30, 2020Updated 6 years ago
ZhaoF-i / SDAEC
View on GitHub
☆19Jan 6, 2025Updated last year
tonnetonne814 / QuickVC-44100-Ja_HuBERT
View on GitHub
44100Hz日本語HuBERTに対応した QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion です。
☆16May 21, 2023Updated 3 years ago
ChangLee0903 / SERIL
View on GitHub
Official Implementation of SERIL in Pytorch
☆27Sep 29, 2020Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
rrbluke / BSSD
View on GitHub
Blind Source Separation and Dereverberation
☆21Mar 26, 2021Updated 5 years ago
yuguochencuc / CinCGAN-SE
View on GitHub
Joint magnitude estimation and phase recovery using Cycle-in-Cycle GAN for non-parallel speech enhancement
☆10Jan 24, 2022Updated 4 years ago
lars76 / fastspeech2-clean
View on GitHub
Clean and modernized implementation of FastSpeech2/LightSpeech using IPA
☆18Aug 16, 2024Updated last year
7Xin / DPI-TTS
View on GitHub
☆13Sep 12, 2024Updated last year
YUCHEN005 / UNA-GAN
View on GitHub
Code for paper "Unsupervised Noise adaptation using Data Simulation"
☆14May 16, 2024Updated 2 years ago
reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
WingZLeung / TTDS
View on GitHub
Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.
☆13Mar 15, 2025Updated last year
prairie-schooner / wav2vec-vc
View on GitHub
☆10Mar 22, 2023Updated 3 years ago
anas-rz / specmix-pytorch
View on GitHub
A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Features
☆10Oct 5, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
BUTSpeechFIT / DVBx
View on GitHub
Discriminative Training of VBx Diarization
☆28Sep 23, 2024Updated last year
bootphon / learnable-strf
View on GitHub
Learnable STRF, from Riad et al. 2021 JASA
☆13Aug 21, 2021Updated 4 years ago
nils-werner / pymushra
View on GitHub
pyMUSHRA is a python web application which hosts webMUSHRA experiments and collects the data with python.
☆47Jul 3, 2026Updated 3 weeks ago
FLwolfy / InnoEngine
View on GitHub
A simple cross-platform game engine based on .NET framework.
☆16Jul 17, 2026Updated last week
p1an-lin-jung / wv_tts
View on GitHub
☆19Mar 22, 2024Updated 2 years ago
leohuang2013 / pyannote-audio_overlapped-speech-detection_cpp
View on GitHub
C++ version of pyannote audio overlapped speech detection pipeline
☆13Feb 14, 2024Updated 2 years ago
GravityPoet / textream-zh
View on GitHub
一款隐身于 Mac 摄像头下方的智能提词器：专为视频录制、直播与会议设计，帮您保持自然眼神交流。支持苹果自带语音识别与本地 AI 大模型，能随着您的真实语速自动跟踪和滚动文案，彻底告别忘词与手动滑屏的烦恼。
☆19Feb 24, 2026Updated 5 months ago