A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack
☆10Feb 19, 2018Updated 8 years ago
Alternatives and similar repositories for speaker-embedding
Users that are interested in speaker-embedding are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.☆31Jun 17, 2024Updated last year
- GlottDNN vocoder and tools for training DNN excitation models☆32Feb 27, 2021Updated 5 years ago
- PyTorch based speaker embedding model☆16Apr 13, 2024Updated last year
- Open Source Speech/Text Data on AI☆19Sep 13, 2022Updated 3 years ago
- Code for paper "Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition"☆19Jun 21, 2023Updated 2 years ago
- ☆17Aug 27, 2025Updated 6 months ago
- A Tensorflow Implementation like "Neural Speech Synthesis with Transformer Network" Port From OpenSeq2Seq☆20Jul 6, 2023Updated 2 years ago
- Synthesized singing voice demos of WeSinger 2 paper.☆26Feb 20, 2023Updated 3 years ago
- ☆24Jul 22, 2019Updated 6 years ago
- Labels for kiritan_singing data with extra resources for DNN-based singing voice synthesis (SVS) systems.☆29Dec 31, 2023Updated 2 years ago
- 양재 AI 실무자 교육 6조 프로젝트☆21Sep 18, 2018Updated 7 years ago
- TTS for pitch-accented language. Korean dialect DB.☆157May 12, 2023Updated 2 years ago
- ☆33Jan 14, 2023Updated 3 years ago
- A toolset for easy formant extraction and visualization from wav files and TTS models☆33Sep 2, 2022Updated 3 years ago
- Korean Singing Voice Synthesis based on Auto-regressive Boundary Equilibrium GAN☆67Apr 26, 2021Updated 4 years ago
- Implementation of Global Style Token Tacotron in TensorFlow2☆26Sep 28, 2020Updated 5 years ago
- Pytorch implementation of conformer with with training script for end-to-end speech recognition on the LibriSpeech dataset.☆28May 1, 2024Updated last year
- ☆31Nov 7, 2018Updated 7 years ago
- ☆31Jul 13, 2023Updated 2 years ago
- Synthesizer Self-Attention is a very recent alternative to causal self-attention that has potential benefits by removing this dot product…☆14Dec 29, 2024Updated last year
- PyTorch implementation of "Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video" (ICCV2021)☆20Apr 11, 2022Updated 3 years ago
- RepVgg + HiFiGAN☆36Aug 10, 2022Updated 3 years ago
- A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.☆45May 25, 2023Updated 2 years ago
- 책 읽어주는 딥러닝을 보고 나도 만들고 싶어져서 공부하며 만드는 repository입니다.☆10Dec 8, 2022Updated 3 years ago
- Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021☆40Jul 17, 2021Updated 4 years ago
- Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification☆14Jul 19, 2022Updated 3 years ago
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆90Apr 2, 2025Updated 11 months ago
- Interface for Controllable Expressive Talking Machine☆40Sep 20, 2025Updated 5 months ago
- ☆42Oct 30, 2018Updated 7 years ago
- Standalone software tool for conducting spatial audio listening tests.☆15Nov 29, 2021Updated 4 years ago
- 9기 운영진을 위한 repo입니다.☆12Sep 22, 2024Updated last year
- Methods to extract tracks from time-frequency distributions; tracks can represent instantaneous frequency (IF) laws☆10May 11, 2016Updated 9 years ago
- Implementation for NATv2.