Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals
☆18Aug 8, 2024Updated last year
Alternatives and similar repositories for spatial_voice_conversion
Users that are interested in spatial_voice_conversion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A fully and partially fake speech dataset for evaluation☆14Nov 11, 2025Updated 4 months ago
- ☆26Mar 29, 2025Updated 11 months ago
- ☆16Dec 18, 2023Updated 2 years ago
- PyTorch implementation of Swin Transformer for 1-dimensional data☆18Mar 15, 2024Updated 2 years ago
- ☆25Aug 2, 2024Updated last year
- A command-line tool that provides the core functionality for storing and retrieving shell command history with directory context in SQLit…☆10Feb 6, 2026Updated last month
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆47May 13, 2025Updated 10 months ago
- ☆11May 7, 2022Updated 3 years ago
- Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".☆26Jul 2, 2024Updated last year
- ChatGPT for Scratch☆18Mar 8, 2026Updated 2 weeks ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor☆18Jun 5, 2023Updated 2 years ago
- ☆40Jan 24, 2023Updated 3 years ago
- This is a song listening and music recognition project based on audio fingerprint algorithm.☆11Mar 26, 2022Updated 3 years ago
- Collect eye movement data using a webcam(with calibration).☆10Mar 25, 2021Updated 4 years ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Aug 7, 2023Updated 2 years ago
- Named entity recognition for scientific and vernacular plant names☆13Jan 17, 2023Updated 3 years ago
- ☆44Sep 19, 2024Updated last year
- Code for the paper "Toward Fully Self-Supervised Multi-Pitch Estimation".☆23Sep 27, 2025Updated 5 months ago
- ☆11Apr 20, 2020Updated 5 years ago
- Just another FastSpeech 2 but cleaner code :)☆29Jun 28, 2024Updated last year
- ☆67Aug 16, 2023Updated 2 years ago
- Demucs Lightning: A PyTorch lightning version of Demucs with Hydra and Tensorboard features☆84May 3, 2023Updated 2 years ago
- Digital Speech Processing in PyTorch.☆15Aug 12, 2022Updated 3 years ago
- ☆12Jun 17, 2019Updated 6 years ago
- ☆28Oct 7, 2025Updated 5 months ago
- A lightweight audio codec based on a single quantizer☆69Aug 15, 2025Updated 7 months ago
- ☆14Aug 19, 2024Updated last year
- ☆41May 15, 2023Updated 2 years ago
- Generator for anechoic, non-stationary noise signals☆11Aug 12, 2022Updated 3 years ago
- Official implementation of the paper - GD-Retriever: Controllable generative text-music retrieval with diffusion models (Accepted at ISMI…☆17Sep 25, 2025Updated 5 months ago
- Official Repository for "GOTCHA: Real-Time Video Deepfake Detection via Challenge-Response"☆11Jul 8, 2024Updated last year
- ☆11Apr 1, 2020Updated 5 years ago
- ☆12Feb 3, 2026Updated last month
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆17May 20, 2025Updated 10 months ago
- OpenFLAM: Framewise Language Audio Model☆101Jan 14, 2026Updated 2 months ago
- ☆30Aug 12, 2023Updated 2 years ago
- Directional sparse filtering for blind speech separation☆10Jun 8, 2021Updated 4 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year