Fast CosyVoice3 inference with tensorRT and tensorRT-LLM
☆73Mar 7, 2026Updated 2 months ago
Alternatives and similar repositories for FastCosyVoice
Users that are interested in FastCosyVoice are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Boost your efficiency with Fish Speech Batch Inference. Easily process multiple texts and achieve consistently great results. 🗨️🐟☆26Aug 4, 2025Updated 9 months ago
- SpeechJudge: Towards Human-Level Judgment for Speech Naturalness (https://arxiv.org/abs/2511.07931)☆71Dec 23, 2025Updated 5 months ago
- ☆23Oct 30, 2024Updated last year
- ASR on WS, POST/GET FAST_API Can use many RU asr models.☆19May 12, 2026Updated 2 weeks ago
- Adaptive Multimodal Reasoning via Reinforcement Learning☆23Jan 11, 2026Updated 4 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆31Apr 22, 2024Updated 2 years ago
- Lyra V2 (SoundStream) running in the browser☆19Sep 20, 2023Updated 2 years ago
- A blender geometry node and material setup to procedurally generate islamic mosque dome☆17Oct 8, 2023Updated 2 years ago
- AI-driven storytelling system☆11Apr 24, 2025Updated last year
- A text normalization framework using GBM and human-generated features☆10Feb 4, 2020Updated 6 years ago
- UTokyo-SaruLab MOS Prediction System☆318Apr 2, 2026Updated last month
- A streaming audio reader, processor, and writer built on top of soundfile, and PyAV (bindings for FFmpeg)☆38Mar 31, 2026Updated last month
- ICASSP2026 HumDial Challenge☆45Dec 13, 2025Updated 5 months ago
- CDSW/CML version of FF14☆15Jan 29, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Reimplementation of Bandit for "Remastering Divide and Remaster: A Cinematic Audio Source Separation Dataset with Multilingual Support"☆59Jul 29, 2025Updated 9 months ago
- A higher quality RVC pretrained model to accelerate your training process.☆22Nov 11, 2025Updated 6 months ago
- ☆10Jun 11, 2024Updated last year
- A simple extension that allows LLM to speak in any voice, literally, based on Sliero TTS which is available in oobabooga's textgen-webui …☆12Aug 26, 2023Updated 2 years ago
- Generate point arrays for Geometry Nodes using cubic grid, golden angle (Fermat's spiral), poisson disc sampling, or import points from d…☆23Jul 16, 2024Updated last year
- training models at home☆42Apr 16, 2026Updated last month
- Export shortcuts for specific production pipelines. Includes presets for Unity 3D (FBX), ThreeJS (compressed GLB), Element3D (OBJ), Xcode…☆20Jul 16, 2024Updated last year
- Retrieval-based-Voice-Conversion ( RVC ) modified and enhanced by codename;0☆13Jul 8, 2024Updated last year
- ☆12Mar 11, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Modified version of the PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)☆26Apr 10, 2026Updated last month
- Texturaizer ComfyUI Nodes for Blender Plugin Connection☆21Dec 15, 2025Updated 5 months ago
- Multilingual-Speech-Synthesis-Voice-Conversion Using Bark + RVC☆14Apr 19, 2025Updated last year
- An Interactive Infinite Story Generation Framework Based on Multi-Agent☆24Aug 29, 2025Updated 8 months ago
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆54Dec 23, 2024Updated last year
- ☆11Dec 11, 2024Updated last year
- We Speech Toolkit, LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction☆207Apr 7, 2026Updated last month
- ☆15Feb 6, 2026Updated 3 months ago
- audiobook GUI for chatterbox☆39Jul 26, 2025Updated 10 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆21Nov 19, 2024Updated last year
- Official repository for the paper - SLAP: Siamese Language-Audio Pretraining without negative samples for Music Understanding☆59Sep 25, 2025Updated 8 months ago
- Audio Speech Segmentation Tool for RVC☆15May 15, 2023Updated 3 years ago
- Dự án công cụ chuyển đổi giọng nói dành cho người Việt☆27May 14, 2026Updated last week
- ☆109Feb 28, 2026Updated 2 months ago
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆11Dec 13, 2023Updated 2 years ago
- Dockerized Voicecraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild☆18May 15, 2024Updated 2 years ago