Fast CosyVoice3 inference with tensorRT and tensorRT-LLM
☆59Mar 7, 2026Updated last month
Alternatives and similar repositories for FastCosyVoice
Users that are interested in FastCosyVoice are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Boost your efficiency with Fish Speech Batch Inference. Easily process multiple texts and achieve consistently great results. 🗨️🐟☆26Aug 4, 2025Updated 8 months ago
- SpeechJudge: Towards Human-Level Judgment for Speech Naturalness (https://arxiv.org/abs/2511.07931)☆71Dec 23, 2025Updated 3 months ago
- Adaptive Multimodal Reasoning via Reinforcement Learning☆23Jan 11, 2026Updated 3 months ago
- ☆30Apr 22, 2024Updated last year
- Lyra V2 (SoundStream) running in the browser☆19Sep 20, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A Cinematic audio dubbing, Cloning and voice generation studio☆86Updated this week
- simple and fast wav2lip using onnx models for face-detection and inference. Easy installation☆29Oct 14, 2024Updated last year
- A text normalization framework using GBM and human-generated features☆10Feb 4, 2020Updated 6 years ago
- A streaming audio reader, processor, and writer built on top of soundfile, and PyAV (bindings for FFmpeg)☆38Mar 31, 2026Updated 2 weeks ago
- ICASSP2026 HumDial Challenge☆38Dec 13, 2025Updated 4 months ago
- Reimplementation of Bandit for "Remastering Divide and Remaster: A Cinematic Audio Source Separation Dataset with Multilingual Support"☆54Jul 29, 2025Updated 8 months ago
- A higher quality RVC pretrained model to accelerate your training process.☆21Nov 11, 2025Updated 5 months ago
- Blender addon for importing and exporting Hedgehog Engine 3D related file formats☆12Mar 19, 2026Updated 3 weeks ago
- ☆17Dec 23, 2025Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Codebase for 'Scaling Rich Style-Prompted Text-to-Speech Datasets'☆158Mar 26, 2026Updated 3 weeks ago
- Github Google Sina QQ four platforms OAuth Sign☆10Feb 25, 2015Updated 11 years ago
- ☆12Mar 11, 2025Updated last year
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆54Dec 23, 2024Updated last year
- We Speech Toolkit, LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction☆206Apr 7, 2026Updated last week
- ☆15Feb 6, 2026Updated 2 months ago
- ☆19Oct 10, 2025Updated 6 months ago
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆21Nov 19, 2024Updated last year
- Official repository for the paper - SLAP: Siamese Language-Audio Pretraining without negative samples for Music Understanding☆58Sep 25, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Audio Speech Segmentation Tool for RVC☆15May 15, 2023Updated 2 years ago
- Dự án công cụ chuyển đổi giọng nói dành cho người Việt☆28Mar 21, 2026Updated 3 weeks ago
- ☆109Feb 28, 2026Updated last month
- Dockerized Voicecraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild☆18May 15, 2024Updated last year
- This repository contains the dataset used to train the neural network model descried in the paper "Implicit HRTF Modeling Using Tempora…☆11Aug 4, 2023Updated 2 years ago
- UI UX retouch of Azahar Plus 3DS emulator☆23Sep 7, 2025Updated 7 months ago
- Simple, Unified Repository for Retrieval-based Voice Conversion☆16Jul 3, 2024Updated last year
- A monorepo containing the web application, documentation, and API used by Dione.☆21Apr 1, 2026Updated 2 weeks ago
- The mobile app for EmuReady. A community-driven platform for tracking emulation compatibility across different devices and emulators.☆41Aug 22, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆60Dec 17, 2025Updated 3 months ago
- My version of the RVC V2 Disconnected Colab notebook, which allows you to use RVC without using WebUI/Gradio☆15Jun 11, 2024Updated last year
- UTokyo-SaruLab MOS Prediction System☆308Apr 2, 2026Updated last week
- Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆61Sep 5, 2025Updated 7 months ago
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆23Mar 17, 2026Updated 3 weeks ago
- Official implemtation of UniverSR (ICASSP 2026)☆42Apr 9, 2026Updated last week
- [ACL 2026 Main] Open-Ended Speaking Style Modeling via Fine-Grained and Multi-Granular Contrastive Language-Speech Pre-training☆71Apr 6, 2026Updated last week