vlarine / wav2vecView external linksLinks
vq-wav2vec inference
☆13Dec 13, 2021Updated 4 years ago
Alternatives and similar repositories for wav2vec
Users that are interested in wav2vec are comparing it to the libraries listed below
Sorting:
- Pretrained spoken language classifiers from audio.☆10Jan 21, 2021Updated 5 years ago
- TTS Text Analyzer☆32Jul 20, 2023Updated 2 years ago
- [AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS☆64Nov 18, 2024Updated last year
- ☆64Jan 15, 2024Updated 2 years ago
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆77Dec 3, 2025Updated 2 months ago
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- RepVgg + HiFiGAN☆36Aug 10, 2022Updated 3 years ago
- ☆18Dec 7, 2023Updated 2 years ago
- ☆36Mar 14, 2025Updated 11 months ago
- Basic concatenative text-to-speech implementation in Python☆19Aug 31, 2019Updated 6 years ago
- ESPnet-TTS Audio Sample HP☆21Oct 25, 2019Updated 6 years ago
- PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS☆24Jan 29, 2022Updated 4 years ago
- A Neural Audio Codec (NAC) for Universal Audio☆44May 30, 2025Updated 8 months ago
- Official code for "Semantic-VAE: Semantic-Alignment Latent Representation for Better Speech Synthesis"☆107Dec 20, 2025Updated last month
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆28Aug 4, 2023Updated 2 years ago
- Custom TensorFlow2 implementations of forward and backward computation of soft-DTW algorithm in batch mode.☆21Jun 7, 2021Updated 4 years ago
- Synthesized singing voice demos of WeSinger 2 paper.☆26Feb 20, 2023Updated 2 years ago
- ☆26Aug 8, 2024Updated last year
- ☆99Jan 19, 2026Updated 3 weeks ago
- working on parallel wavenet☆25Apr 19, 2018Updated 7 years ago
- An unofficial implementation of Vector Quantization Voice Conversion (VQVC).☆29Apr 12, 2021Updated 4 years ago
- This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Models☆35Oct 13, 2024Updated last year
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆77Jun 9, 2023Updated 2 years ago
- wavenet vocoder using tensorflow☆26Feb 18, 2018Updated 7 years ago
- Tacotron2 with Global Style Tokens☆65Apr 19, 2019Updated 6 years ago
- Neural network-based forced alignment with bidirectional attention mechanism☆78Jan 17, 2025Updated last year
- ☆69May 19, 2023Updated 2 years ago
- The open source code of ALMTokenizer2: Towards Low bit-rate and Semantic-rich Audio Tokenizer with Flow-based Scalar Diffusion Transforme…☆42Sep 5, 2025Updated 5 months ago
- ☆33Jun 29, 2023Updated 2 years ago
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Sep 21, 2022Updated 3 years ago
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.☆71Mar 19, 2021Updated 4 years ago
- Training code and dataset cleasing with Sidon☆76Jan 16, 2026Updated 3 weeks ago
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995☆78Dec 3, 2024Updated last year
- ☆31Jul 13, 2023Updated 2 years ago
- ☆30Aug 12, 2023Updated 2 years ago
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆37Aug 29, 2023Updated 2 years ago
- PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMind☆64Sep 22, 2025Updated 4 months ago
- The framework for creating a new platform (like game engine).☆10Jan 11, 2026Updated last month
- Unsupervised Rhythm Modeling for Voice Conversion☆86Aug 3, 2023Updated 2 years ago