Akella17 / speaker-embeddingView external linksLinks
A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack
☆10Feb 19, 2018Updated 7 years ago
Alternatives and similar repositories for speaker-embedding
Users that are interested in speaker-embedding are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.☆31Jun 17, 2024Updated last year
- GlottDNN vocoder and tools for training DNN excitation models☆32Feb 27, 2021Updated 4 years ago
- PyTorch based speaker embedding model☆16Apr 13, 2024Updated last year
- Code for paper "Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition"☆19Jun 21, 2023Updated 2 years ago
- Open Source Speech/Text Data on AI☆19Sep 13, 2022Updated 3 years ago
- ☆17Aug 27, 2025Updated 5 months ago
- A Tensorflow Implementation like "Neural Speech Synthesis with Transformer Network" Port From OpenSeq2Seq☆20Jul 6, 2023Updated 2 years ago
- Synthesized singing voice demos of WeSinger 2 paper.☆26Feb 20, 2023Updated 2 years ago
- ☆24Jul 22, 2019Updated 6 years ago
- 양재 AI 실무자 교육 6조 프로젝트☆21Sep 18, 2018Updated 7 years ago
- Labels for kiritan_singing data with extra resources for DNN-based singing voice synthesis (SVS) systems.☆29Dec 31, 2023Updated 2 years ago
- TTS for pitch-accented language. Korean dialect DB.☆157May 12, 2023Updated 2 years ago
- ☆33Jan 14, 2023Updated 3 years ago
- A toolset for easy formant extraction and visualization from wav files and TTS models☆33Sep 2, 2022Updated 3 years ago
- Korean Singing Voice Synthesis based on Auto-regressive Boundary Equilibrium GAN☆67Apr 26, 2021Updated 4 years ago
- Implementation of Global Style Token Tacotron in TensorFlow2☆26Sep 28, 2020Updated 5 years ago
- Pytorch implementation of conformer with with training script for end-to-end speech recognition on the LibriSpeech dataset.☆28May 1, 2024Updated last year
- ☆31Nov 7, 2018Updated 7 years ago
- ☆31Jul 13, 2023Updated 2 years ago
- PyTorch implementation of "Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video" (ICCV2021)☆20Apr 11, 2022Updated 3 years ago
- Synthesizer Self-Attention is a very recent alternative to causal self-attention that has potential benefits by removing this dot product…☆14Dec 29, 2024Updated last year
- RepVgg + HiFiGAN☆36Aug 10, 2022Updated 3 years ago
- A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.☆45May 25, 2023Updated 2 years ago
- Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021☆40Jul 17, 2021Updated 4 years ago
- Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification☆14Jul 19, 2022Updated 3 years ago
- 책 읽어주는 딥러닝을 보고 나도 만들고 싶어져서 공부하며 만드는 repository입니다.☆10Dec 8, 2022Updated 3 years ago
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆90Apr 2, 2025Updated 10 months ago
- Interface for Controllable Expressive Talking Machine☆40Sep 20, 2025Updated 4 months ago
- ☆42Oct 30, 2018Updated 7 years ago
- Implementation for NATv2.☆23Feb 20, 2021Updated 4 years ago
- List of papers about TTS / Список статей о TTS☆10Dec 16, 2017Updated 8 years ago
- ☆12Jul 24, 2024Updated last year
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Mar 23, 2021Updated 4 years ago
- Planet wars RTS game for AI agent evaluation☆19Jan 9, 2026Updated last month
- ☆10Sep 19, 2018Updated 7 years ago
- Combining encoder-based language models☆11Nov 11, 2021Updated 4 years ago
- Tutorial session material of Pytest in PyCon KR 2019☆10Apr 11, 2020Updated 5 years ago
- ☆12Jun 5, 2018Updated 7 years ago