☆19May 2, 2024Updated 2 years ago
Alternatives and similar repositories for TacoLM
Users that are interested in TacoLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Aug 19, 2024Updated last year
- ☆40Apr 15, 2024Updated 2 years ago
- speaker-disentangled speech linguistic content quantizer☆25Mar 19, 2025Updated last year
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆70Nov 1, 2024Updated last year
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆23May 19, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆39Oct 1, 2023Updated 2 years ago
- 4G GPU & 10 Minutes for train☆12Aug 9, 2023Updated 2 years ago
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆92Jul 23, 2025Updated 10 months ago
- My vocoder experiments☆31Jul 26, 2025Updated 10 months ago
- [InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter☆97Jul 4, 2024Updated last year
- [NAACL 2025] WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching☆125Apr 8, 2026Updated last month
- ☆14Aug 1, 2025Updated 9 months ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 3 years ago
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆108Jan 17, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions☆86Oct 11, 2024Updated last year
- ☆70Sep 3, 2024Updated last year
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆27Apr 23, 2024Updated 2 years ago
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆76Dec 3, 2025Updated 5 months ago
- An unofficial PyTorch implementation of VALL-E☆88Aug 3, 2025Updated 9 months ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆40Jul 10, 2023Updated 2 years ago
- A collection of utilities for handling IPA phones.☆27Sep 24, 2023Updated 2 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆25Aug 1, 2025Updated 9 months ago
- Official implementation of "Unsupervised Pre-training for Data-Efficient Text-to-Speech on Low Resource Languages", ICASSP 2023☆27Apr 27, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ACL 2025 Main] ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec☆275Nov 22, 2024Updated last year
- Phonemes and durations labeling based on whisper small☆11Jul 7, 2024Updated last year
- Bilingual Singing Voice Synthesis☆18Mar 25, 2024Updated 2 years ago
- Awesome Triton Resources☆41Apr 27, 2025Updated last year
- ☆15Mar 31, 2025Updated last year
- Zero-Shot Emotion Style Transfer☆49Apr 23, 2025Updated last year
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Sep 4, 2023Updated 2 years ago
- All generative model in one for better TTS model☆74Sep 8, 2024Updated last year
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆98Oct 8, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Evaluation Protocol for Large-Scale Zero-Shot TTS Literature☆95Mar 12, 2025Updated last year
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆78Nov 1, 2024Updated last year
- ACM MM 2024 FlashSpeech: Efficient Zero-Shot Speech Synthesis☆154Sep 20, 2024Updated last year
- SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis☆151Jan 1, 2025Updated last year
- MiSS is a novel PEFT method that features a low-rank structure but introduces a new update mechanism distinct from LoRA, achieving an exc…☆35Mar 9, 2026Updated 2 months ago
- The open source code for LLM-Codec☆147Aug 18, 2024Updated last year
- Taiwanese Speech Synthesis with Tacotron2☆26Oct 2, 2022Updated 3 years ago