☆34Dec 25, 2023Updated 2 years ago
Alternatives and similar repositories for wesper-demo
Users that are interested in wesper-demo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Whisper to Normal Speech Conversion with SC-MelGAN and SC-VQ-VAE☆15Dec 3, 2022Updated 3 years ago
- Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"☆13Oct 31, 2024Updated last year
- 🤫A Lightweight One-Shot Whisper to Normal Voice Conversion Model Using Distillation of Self-Supervised Features☆23Dec 10, 2025Updated 4 months ago
- FNSE-SBGAN: Far-field Speech Enhancement with Schrödinger Bridge and Generative Adversarial Networks☆18May 12, 2025Updated 11 months ago
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"☆15Nov 14, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- Implementation for WatchYourMouth: Silent Speech Recognition with Depth Sensing presented at CHI 2024☆19Oct 6, 2025Updated 6 months ago
- ☆29Jul 12, 2024Updated last year
- A neural speech codec based on discrete WavLM representations☆26Aug 28, 2024Updated last year
- Generative Adversarial Networks for different impaired speech conversions☆39Jul 6, 2023Updated 2 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- ☆12Nov 7, 2024Updated last year
- Zero-Shot Emotion Style Transfer☆49Apr 23, 2025Updated 11 months ago
- A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This code…☆10Dec 25, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Voice Conversion method based on speaker style☆14Aug 7, 2021Updated 4 years ago
- A wearable non-invasive silent-speech interface with auditory feedback via bone conduction, for assistive and intelligence augmentation☆14Sep 4, 2019Updated 6 years ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Dec 5, 2023Updated 2 years ago
- SmartShot automates the task of organizing and renaming screenshots in a specified folder on macOS. It uses the built-in Automater app to…☆12Nov 13, 2023Updated 2 years ago
- End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions☆94Nov 6, 2023Updated 2 years ago
- A real-time voice conversion model based on VITS.☆15Aug 1, 2024Updated last year
- This repository includes training, inference, evaluation, and utility scripts developed for fine-tuning the Whisper medium.en model on Ai…☆26Oct 9, 2024Updated last year
- Multi-Phase Gammatone Filterbank (MP-GTF) construction for Python☆48Apr 30, 2020Updated 5 years ago
- 44100Hz日本語HuBERTに対応した QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion です。☆16May 21, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Official Implementation of SERIL in Pytorch☆27Sep 29, 2020Updated 5 years ago
- ☆10Jul 1, 2023Updated 2 years ago
- Feed-forward compressor experiments source code for "Differentiable All-pole Filters for Time-varying Audio Systems".☆22Jun 10, 2024Updated last year
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Aug 16, 2024Updated last year
- Blind Source Separation and Dereverberation☆20Mar 26, 2021Updated 5 years ago
- Code for paper "Unsupervised Noise adaptation using Data Simulation"☆14May 16, 2024Updated last year
- ☆13Sep 12, 2024Updated last year
- LoRA-based phoneme/prosody control for LLM-based TTS with no G2P - Lightweight adapter for edit and control the target language's phoneme…☆24Aug 14, 2025Updated 8 months ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Joint magnitude estimation and phase recovery using Cycle-in-Cycle GAN for non-parallel speech enhancement☆10Jan 24, 2022Updated 4 years ago
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆10Dec 15, 2022Updated 3 years ago
- ☆38Nov 18, 2025Updated 5 months ago
- Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.☆12Mar 15, 2025Updated last year
- ☆11Mar 22, 2023Updated 3 years ago
- Discriminative Training of VBx Diarization☆27Sep 23, 2024Updated last year
- A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Features☆10Oct 5, 2022Updated 3 years ago