A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.
☆91Apr 2, 2025Updated last year
Alternatives and similar repositories for simple-speaker-embedding
Users that are interested in simple-speaker-embedding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple, performant re-implementation of AutoVC☆22Jul 6, 2023Updated 2 years ago
- A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack☆10Feb 19, 2018Updated 8 years ago
- An tensorflow implementation of ghostvlad for speaker recognition☆15May 2, 2019Updated 7 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆50Oct 27, 2022Updated 3 years ago
- Command line tool for forced-alignment of Spanish speech data☆13Dec 31, 2025Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Machine learning experiment to perform gender classification from raw audio.☆23Sep 1, 2018Updated 7 years ago
- Speaker embedding (d-vector) trained with GE2E loss☆289Jan 8, 2024Updated 2 years ago
- Training code and trained checkpoints for ASGAN.☆62Dec 27, 2023Updated 2 years ago
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.☆144Jul 8, 2021Updated 4 years ago
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated 2 years ago
- Synthesized singing voice demos of WeSinger 2 paper.☆26Feb 20, 2023Updated 3 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆26Oct 5, 2022Updated 3 years ago
- ☆19Feb 2, 2023Updated 3 years ago
- 基于FreeVC的歌声转换☆21Dec 16, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆151Aug 22, 2022Updated 3 years ago
- phase reconstruction from magnitude terms of an STFT☆13May 18, 2025Updated last year
- Sing any popular song with your voice☆11Jul 10, 2022Updated 3 years ago
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆88Dec 20, 2022Updated 3 years ago
- ☆15Apr 16, 2026Updated last month
- ☆30Jul 21, 2022Updated 3 years ago
- Source code of APNet2, a vocoder☆59Nov 23, 2023Updated 2 years ago
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆160Jul 16, 2022Updated 3 years ago
- Official Code for Assem-VC @ICASSP2022☆269May 16, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".☆34Apr 26, 2021Updated 5 years ago
- PyTorch implementation of: Rhythm-Flexible Voice Conversion without Parallel Data Using Cycle-GAN over Phoneme Posteriorgram Sequences☆11Jul 18, 2019Updated 6 years ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆126Jun 16, 2022Updated 3 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆321Nov 11, 2020Updated 5 years ago
- Update: Ignore this repo, check out @lucidrains' implementation https://github.com/lucidrains/musiclm-pytorch☆15Jan 27, 2023Updated 3 years ago
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis☆45Jul 24, 2023Updated 2 years ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆80Sep 27, 2023Updated 2 years ago
- Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"☆193Dec 8, 2022Updated 3 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆92Jun 6, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Griffin-Lim Like Phase Recovery via Alternating Direction Method of Multipliers (Yoshiki Masuyama et al., 2018)☆13Dec 17, 2018Updated 7 years ago
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆32Aug 2, 2025Updated 10 months ago
- The source code for the paper XiaoiceSing2 (interspeech2023)☆49Jan 15, 2024Updated 2 years ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆43May 9, 2023Updated 3 years ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆270Jul 25, 2024Updated last year
- Implementation of Emo-StarGAN☆46Dec 19, 2023Updated 2 years ago
- Official implementation of the source-filter HiFiGAN vocoder☆273Jul 29, 2023Updated 2 years ago