Evaluation tool used in the BigVSAN paper
☆14Mar 22, 2024Updated last year
Alternatives and similar repositories for bigvsan_eval
Users that are interested in bigvsan_eval are comparing it to the libraries listed below
Sorting:
- Alignment examples for Interspeech 2024☆27Jul 5, 2024Updated last year
- VocalVerse: A powerful vocal evaluation framework powered by the Qwen LLMs☆37Jan 22, 2026Updated last month
- LoRA-based phoneme/prosody control for LLM-based TTS with no G2P - Lightweight adapter for edit and control the target language's phoneme…☆23Aug 14, 2025Updated 6 months ago
- ☆13Sep 12, 2024Updated last year
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆40Sep 18, 2024Updated last year
- ☆21Jul 15, 2024Updated last year
- ☆61Oct 28, 2024Updated last year
- ☆88Nov 1, 2022Updated 3 years ago
- VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling☆96Nov 9, 2024Updated last year
- A lightweight audio codec based on a single quantizer☆69Aug 15, 2025Updated 6 months ago
- A toolkit for any-to-any encoder-decoder voice conversion systems☆84Aug 10, 2023Updated 2 years ago
- Source code of APNet2, a vocoder☆58Nov 23, 2023Updated 2 years ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆38Jan 6, 2024Updated 2 years ago
- A pitch detection model trained to be robust against noise and reverberation environments.☆27Jan 21, 2025Updated last year
- Implementation of Emo-StarGAN☆46Dec 19, 2023Updated 2 years ago
- Reference-aware automatic speech evaluation toolkit☆180Dec 5, 2024Updated last year
- Codes and MIDI demos of ISMIR 2022 paper: Domain Adversarial Training on Conditional Variational Auto-Encoder for Controllable Music Gene…☆21Mar 28, 2023Updated 2 years ago
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆36Apr 29, 2025Updated 10 months ago
- ☆54Jul 16, 2025Updated 7 months ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- A sequence-to-sequence voice conversion toolkit.☆108Jul 5, 2024Updated last year
- A neural speech codec based on discrete WavLM representations☆24Aug 28, 2024Updated last year
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆75Aug 24, 2024Updated last year
- Variable Bitrate Residual Vector Quantization for Audio Coding☆51May 1, 2025Updated 10 months ago
- Collection of scripts from mHuBERT-147.☆32Nov 19, 2024Updated last year
- Source code for the EMNLP 2025 paper “DM-Codec: Distilling Multimodal Representations for Speech Tokenization”☆56Jun 1, 2025Updated 9 months ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆11Mar 14, 2025Updated 11 months ago
- ☆11Dec 17, 2025Updated 2 months ago
- Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"☆13Oct 31, 2024Updated last year
- ☆11Mar 22, 2023Updated 2 years ago
- Encode and decode audio samples to/from continuous and discrete compressed representations!☆104Nov 25, 2025Updated 3 months ago
- A Singing Style Conversion Framework Based On Audio Infilling☆33Apr 28, 2025Updated 10 months ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆32Apr 10, 2023Updated 2 years ago
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆48Jan 19, 2026Updated last month
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆36Jun 6, 2024Updated last year
- Spherical residual vector quantization (SRVQ)☆31Aug 25, 2024Updated last year
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆87Dec 20, 2024Updated last year
- Simple tool for speech dataset augmentation for modeling various prosodies.☆14Jan 14, 2021Updated 5 years ago
- ☆62Nov 6, 2023Updated 2 years ago