sarulab-speech/spatial_voice_conversion

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sarulab-speech/spatial_voice_conversion)

sarulab-speech / spatial_voice_conversion

Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals

☆18

Alternatives and similar repositories for spatial_voice_conversion

Users that are interested in spatial_voice_conversion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hieuthi / LlamaPartialSpoof
View on GitHub
A fully and partially fake speech dataset for evaluation
☆15Nov 11, 2025Updated 8 months ago
Takaaki-Saeki / ssl_speech_restoration_v2
View on GitHub
☆17Dec 18, 2023Updated 2 years ago
zjlww / ardit-web
View on GitHub
☆27Aug 2, 2024Updated last year
facebookresearch / SS2_HRTF
View on GitHub
SS2 HRTF Dataset - Reality Labs Research Audio
☆18May 22, 2026Updated 2 months ago
fuba / histree-core
View on GitHub
A command-line tool that provides the core functionality for storing and retrieving shell command history with directory context in SQLit…
☆11Feb 6, 2026Updated 5 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
nttcslab-sp / mamba-diarization
View on GitHub
Official repository for Mamba-based Segmentation Model for Speaker Diarization
☆47May 13, 2025Updated last year
cyhuang-tw / robust-vc
View on GitHub
☆11May 7, 2022Updated 4 years ago
ictnlp / ComSpeech
View on GitHub
Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".
☆27Jul 2, 2024Updated 2 years ago
anupm12 / webgazer.js-calibration
View on GitHub
Collect eye movement data using a webcam(with calibration).
☆10Mar 25, 2021Updated 5 years ago
IsabelMeraner / BotanicalNER
View on GitHub
Named entity recognition for scientific and vernacular plant names
☆14Jan 17, 2023Updated 3 years ago
ichiroc / chatgpt2scratch
View on GitHub
ChatGPT for Scratch
☆18Mar 8, 2026Updated 4 months ago
yukara-ikemiya / Swin-Transformer-1d
View on GitHub
PyTorch implementation of Swin Transformer for 1-dimensional data
☆19Mar 15, 2024Updated 2 years ago
line / WaveTrainerFit
View on GitHub
Official implementation of "Wave-Trainer-Fit: Neural Vocoder with Trainable Prior and Fixed-Point Iteration towards High-Quality Speech G…
☆16Feb 6, 2026Updated 5 months ago
ogunlao / glowtts_stdp
View on GitHub
Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor
☆19Jun 5, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
QianZ423 / MusicRecognition
View on GitHub
This is a song listening and music recognition project based on audio fingerprint algorithm.
☆11Mar 26, 2022Updated 4 years ago
P1ping / TokAN-Legacy
View on GitHub
☆27Jun 22, 2026Updated last month
tinkoff-ai / hifi_vc
View on GitHub
☆40Jan 24, 2023Updated 3 years ago
shivammehta25 / BetterFastSpeech2
View on GitHub
Just another FastSpeech 2 but cleaner code :)
☆29Jun 28, 2024Updated 2 years ago
hcy71o / SNAC
View on GitHub
Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…
☆57Aug 7, 2023Updated 2 years ago
cwitkowitz / ss-mpe
View on GitHub
Code for the paper "Toward Fully Self-Supervised Multi-Pitch Estimation".
☆25Sep 27, 2025Updated 9 months ago
zhai-lw / SQCodec
View on GitHub
A lightweight audio codec based on a single quantizer
☆72Aug 15, 2025Updated 11 months ago
Edresson / ZS-TTS-Evaluation
View on GitHub
☆45Sep 19, 2024Updated last year
nwpuaslp / kws_mia
View on GitHub
☆11Apr 20, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
sony / diffiner
View on GitHub
☆68Aug 16, 2023Updated 2 years ago
zjlww / dsp
View on GitHub
Digital Speech Processing in PyTorch.
☆15Aug 12, 2022Updated 3 years ago
csukuangfj / kaldi-hmm-gmm
View on GitHub
☆28Apr 24, 2026Updated 3 months ago
KinWaiCheuk / demucs_lightning
View on GitHub
Demucs Lightning: A PyTorch lightning version of Demucs with Hydra and Tensorboard features
☆85May 3, 2023Updated 3 years ago
Lee-W / TOC-Project-2017
View on GitHub
Template Code for TOC Project 2017
☆10Apr 26, 2017Updated 9 years ago
audiolabs / anechoic-noise
View on GitHub
Generator for anechoic, non-stationary noise signals
☆12Aug 12, 2022Updated 3 years ago
iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
voidful / vall-e-encodec
View on GitHub
☆41May 15, 2023Updated 3 years ago
Audio-WestlakeU / OnlineSSL_DPRTF_EG
View on GitHub
☆12Apr 1, 2020Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
hanshounsu / d3rm
View on GitHub
☆14Feb 3, 2026Updated 5 months ago
mittalgovind / GOTCHA-Deepfakes
View on GitHub
Official Repository for "GOTCHA: Real-Time Video Deepfake Detection via Challenge-Response"
☆11Jul 8, 2024Updated 2 years ago
e13000 / directional_sparse_filtering
View on GitHub
Directional sparse filtering for blind speech separation
☆11Jun 8, 2021Updated 5 years ago
maverick0122 / QueryByHumming
View on GitHub
A query by humming system based on locality sensitive hashing indexes
☆12May 8, 2014Updated 12 years ago
manmay-nakhashi / TTSizer
View on GitHub
🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨
☆18May 20, 2025Updated last year
YMLLG / SPEECHFAKE
View on GitHub
SpeechFake: A Large-Scale Multilingual Speech Deepfake Dataset Incorporating Cutting-Edge Generation Methods
☆28Aug 13, 2025Updated 11 months ago
atosystem / SSL_Interface
View on GitHub
Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024
☆16Nov 19, 2024Updated last year