zzw922cn / TF2_soft_dtwView external linksLinks
Custom TensorFlow2 implementations of forward and backward computation of soft-DTW algorithm in batch mode.
☆21Jun 7, 2021Updated 4 years ago
Alternatives and similar repositories for TF2_soft_dtw
Users that are interested in TF2_soft_dtw are comparing it to the libraries listed below
Sorting:
- Library about construction helper for Generative models e.g. Flow-based Model with Tensorflow 2.x.☆12Feb 16, 2023Updated 3 years ago
- Synthesized singing voice demos of WeSinger 2 paper.☆26Feb 20, 2023Updated 2 years ago
- Codes for paper <InteL-VAEs: Adding Inductive Biases to VariationalAuto-Encoders via Intermediary Latents>.☆18Jun 25, 2021Updated 4 years ago
- Parallel waveform generation with DiffusionGAN☆17Mar 26, 2022Updated 3 years ago
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆191Nov 18, 2021Updated 4 years ago
- Official Demo Page for DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer☆38Feb 17, 2025Updated 11 months ago
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Network☆45Dec 1, 2021Updated 4 years ago
- ☆19Feb 28, 2018Updated 7 years ago
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Sep 4, 2023Updated 2 years ago
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling☆37May 25, 2021Updated 4 years ago
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆10Mar 8, 2022Updated 3 years ago
- ☆25Mar 6, 2024Updated last year
- MU-GAN: Facial Attribute Editing based on Multi-attention Mechanism☆12Jun 7, 2020Updated 5 years ago
- ☆69Mar 31, 2021Updated 4 years ago
- PyTorch Implementation of ViT-TTS (EMNLP'23)☆11Oct 20, 2023Updated 2 years ago
- Simple tool for speech dataset augmentation for modeling various prosodies.☆14Jan 14, 2021Updated 5 years ago
- Implementation of the AlignTTS☆77Jul 6, 2023Updated 2 years ago
- wavenet vocoder using tensorflow☆26Feb 18, 2018Updated 7 years ago
- Chinese Text Normalization and Dataset☆90May 14, 2022Updated 3 years ago
- ☆16Apr 4, 2022Updated 3 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- ONNXモデルをpyca/cryptographyを用いて暗号化/復号化するサンプル☆16Mar 19, 2022Updated 3 years ago
- An implementation of soft-DTW divergences.☆135Oct 14, 2021Updated 4 years ago
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆37Aug 29, 2023Updated 2 years ago
- GPT-style network for phonemization with durations of text☆68Mar 21, 2024Updated last year
- A Tensorflow Implementation of the FastSpeech 2: Fast and High-Quality End-to-End Text to Speech☆11Aug 12, 2020Updated 5 years ago
- code for "BEAT-ALIGNED SPECTROGRAM-TO-SEQUENCE GENERATION OF RHYTHM-GAME CHARTS" (ISMIR 2023 LBD)☆18Jan 29, 2024Updated 2 years ago
- vq-wav2vec inference☆13Dec 13, 2021Updated 4 years ago
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.☆144Jul 8, 2021Updated 4 years ago
- RepVgg + HiFiGAN☆36Aug 10, 2022Updated 3 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Jul 16, 2020Updated 5 years ago
- Unofficial Pytorch Implementation of WaveGrad2☆112Aug 18, 2021Updated 4 years ago
- PyTorch implementation of the paper "NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity." (NeurIPS 2020)☆67Dec 28, 2020Updated 5 years ago
- 2018/2019 TTS framework integrating state of the art open source methods☆48Jul 8, 2019Updated 6 years ago
- 로봇의 감정 및 개성을 표현할 수 있는 대화형 음성합성 오픈소스 플랫폼☆108Feb 5, 2025Updated last year
- [EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers☆125Mar 20, 2025Updated 10 months ago
- 60k hours of phoneme-aligned audio from audio books☆19Jul 27, 2024Updated last year
- Implementation for paper "Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE"☆44Apr 10, 2023Updated 2 years ago