A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.
☆90Apr 2, 2025Updated 11 months ago
Alternatives and similar repositories for simple-speaker-embedding
Users that are interested in simple-speaker-embedding are comparing it to the libraries listed below
Sorting:
- A simple, performant re-implementation of AutoVC☆22Jul 6, 2023Updated 2 years ago
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- ☆19Feb 2, 2023Updated 3 years ago
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- 基于FreeVC的歌声转换☆21Dec 16, 2022Updated 3 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆50Oct 27, 2022Updated 3 years ago
- Command line tool for forced-alignment of Spanish speech data☆13Dec 31, 2025Updated 2 months ago
- A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack☆10Feb 19, 2018Updated 8 years ago
- Sing any popular song with your voice☆11Jul 10, 2022Updated 3 years ago
- Machine learning experiment to perform gender classification from raw audio.☆23Sep 1, 2018Updated 7 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆146Aug 22, 2022Updated 3 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present …☆26Oct 5, 2022Updated 3 years ago
- Synthesized singing voice demos of WeSinger 2 paper.☆26Feb 20, 2023Updated 3 years ago
- Speaker embedding (d-vector) trained with GE2E loss☆286Jan 8, 2024Updated 2 years ago
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis☆44Jul 24, 2023Updated 2 years ago
- Source code of APNet2, a vocoder☆58Nov 23, 2023Updated 2 years ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44May 9, 2023Updated 2 years ago
- Update: Ignore this repo, check out @lucidrains' implementation https://github.com/lucidrains/musiclm-pytorch☆15Jan 27, 2023Updated 3 years ago
- A Chinese version of A Neural Parametric Singing Synthesizer☆13Feb 12, 2022Updated 4 years ago
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆87Dec 20, 2022Updated 3 years ago
- ☆15Aug 22, 2025Updated 6 months ago
- Official Code for Assem-VC @ICASSP2022☆269May 16, 2022Updated 3 years ago
- The source code for the paper XiaoiceSing2 (interspeech2023)☆49Jan 15, 2024Updated 2 years ago
- My vocoder experiments☆31Jul 26, 2025Updated 7 months ago
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆159Jul 16, 2022Updated 3 years ago
- An tensorflow implementation of ghostvlad for speaker recognition☆15May 2, 2019Updated 6 years ago
- source code of EfficientTTS 2☆20Feb 18, 2024Updated 2 years ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆125Jun 16, 2022Updated 3 years ago
- Official implementation of the source-filter HiFiGAN vocoder☆268Jul 29, 2023Updated 2 years ago
- Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.☆140Sep 25, 2024Updated last year
- Speech Resynthesis and Language Modeling☆27Jun 11, 2025Updated 8 months ago
- ☆21Feb 15, 2022Updated 4 years ago
- Unsupervised Rhythm Modeling for Voice Conversion☆86Aug 3, 2023Updated 2 years ago
- Voice conversion with just linear regression.☆35Sep 25, 2025Updated 5 months ago
- Real-time end-to-end singing voice convertion☆24Nov 3, 2024Updated last year
- Training code and trained checkpoints for ASGAN.☆62Dec 27, 2023Updated 2 years ago
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.☆144Jul 8, 2021Updated 4 years ago
- ☆22Feb 22, 2024Updated 2 years ago
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Jun 1, 2024Updated last year