Fast CosyVoice3 inference with tensorRT and tensorRT-LLM
☆73Mar 7, 2026Updated 3 months ago
Alternatives and similar repositories for FastCosyVoice
Users that are interested in FastCosyVoice are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Boost your efficiency with Fish Speech Batch Inference. Easily process multiple texts and achieve consistently great results. 🗨️🐟☆26Aug 4, 2025Updated 10 months ago
- SpeechJudge: Towards Human-Level Judgment for Speech Naturalness (https://arxiv.org/abs/2511.07931)☆73Dec 23, 2025Updated 5 months ago
- ☆23Oct 30, 2024Updated last year
- Open-source text-to-speech model from KRAFTON trained exclusively on public speech data, with curated datasets and reproducible training …☆69May 21, 2026Updated 3 weeks ago
- Adaptive Multimodal Reasoning via Reinforcement Learning☆23Jan 11, 2026Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆31Apr 22, 2024Updated 2 years ago
- Lyra V2 (SoundStream) running in the browser☆19Sep 20, 2023Updated 2 years ago
- A text normalization framework using GBM and human-generated features☆10Feb 4, 2020Updated 6 years ago
- Drop a video. Get perfect captions. Fast.☆46Nov 4, 2025Updated 7 months ago
- Reimplementation of Bandit for "Remastering Divide and Remaster: A Cinematic Audio Source Separation Dataset with Multilingual Support"☆60Jul 29, 2025Updated 10 months ago
- ☆10Jun 11, 2024Updated 2 years ago
- A simple extension that allows LLM to speak in any voice, literally, based on Sliero TTS which is available in oobabooga's textgen-webui …☆12Aug 26, 2023Updated 2 years ago
- ☆19Dec 23, 2025Updated 5 months ago
- Manifest Dumper is a GUI tool that creates game file's for SteamTools.☆13May 8, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Codebase for 'Scaling Rich Style-Prompted Text-to-Speech Datasets'☆161Mar 26, 2026Updated 2 months ago
- ☆12Mar 11, 2025Updated last year
- Multilingual-Speech-Synthesis-Voice-Conversion Using Bark + RVC☆14Apr 19, 2025Updated last year
- ☆11Dec 11, 2024Updated last year
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆54Dec 23, 2024Updated last year
- We Speech Toolkit, LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction☆207Jun 8, 2026Updated last week
- ☆202Updated this week
- ☆15Feb 6, 2026Updated 4 months ago
- Modified version of the PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)☆27May 30, 2026Updated 2 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆19Oct 10, 2025Updated 8 months ago
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆21Nov 19, 2024Updated last year
- Official repository for the paper - SLAP: Siamese Language-Audio Pretraining without negative samples for Music Understanding☆59Sep 25, 2025Updated 8 months ago
- Audio Speech Segmentation Tool for RVC☆15May 15, 2023Updated 3 years ago
- Dự án công cụ chuyển đổi giọng nói dành cho người Việt☆30Updated this week
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆11Dec 13, 2023Updated 2 years ago
- ☆109Feb 28, 2026Updated 3 months ago
- This repository contains the dataset used to train the neural network model descried in the paper "Implicit HRTF Modeling Using Tempora…☆11Aug 4, 2023Updated 2 years ago
- UI UX retouch of Azahar Plus 3DS emulator☆26May 12, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Simple, Unified Repository for Retrieval-based Voice Conversion☆16Jul 3, 2024Updated last year
- ☆60Dec 17, 2025Updated 5 months ago
- My version of the RVC V2 Disconnected Colab notebook, which allows you to use RVC without using WebUI/Gradio☆15Jun 11, 2024Updated 2 years ago
- Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆61Sep 5, 2025Updated 9 months ago
- Official implemtation of UniverSR (ICASSP 2026)☆50Apr 9, 2026Updated 2 months ago
- Fast and memory-efficient exact attention☆21Apr 10, 2026Updated 2 months ago
- [ACL 2026 Main] Open-Ended Speaking Style Modeling via Fine-Grained and Multi-Granular Contrastive Language-Speech Pre-training☆104Apr 6, 2026Updated 2 months ago