fishaudio/fish-audio-python

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/fishaudio/fish-audio-python)

fishaudio / fish-audio-python

The official Python library for the Fish Audio API.

☆184

Alternatives and similar repositories for fish-audio-python

Users that are interested in fish-audio-python are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated last year
MuyangDu / T5Voice
View on GitHub
T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …
☆28Nov 7, 2025Updated 8 months ago
camenduru / TANGO-jupyter
View on GitHub
☆13Oct 14, 2024Updated last year
LAION-AI / emotional-speech-annotations
View on GitHub
This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Models
☆35Oct 13, 2024Updated last year
primepake / dac_vae
View on GitHub
Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder
☆38Aug 30, 2025Updated 10 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
fishaudio / audio-preprocess
View on GitHub
Preprocess Audio for training
☆395Jun 1, 2026Updated last month
meaningTeam / tidy-tunes
View on GitHub
Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …
☆23May 19, 2026Updated last month
yxduir / LLM-SRT
View on GitHub
☆28Mar 11, 2026Updated 4 months ago
bfs18 / armel
View on GitHub
poorman's ar-dit tts
☆45Dec 31, 2025Updated 6 months ago
k2-fsa / Flow2GAN
View on GitHub
Hybrid Flow Matching and GAN with Multi-Resolution Network for Few-Step High-Fidelity Audio Generation
☆143Mar 8, 2026Updated 4 months ago
nateraw / voice-cloning
View on GitHub
Make Kanye sing any song ya want 🎤🔥
☆26Apr 25, 2023Updated 3 years ago
qiuqiao / DDSP-HiFiGAN
View on GitHub
基于PC-DDSP和nsf-HiFiGAN的声码器
☆19Jul 17, 2023Updated 2 years ago
PINTO0309 / onnx-aec
View on GitHub
A playground for experimenting with acoustic echo cancellation using a microphone, speaker, and ONNX.
☆13Oct 22, 2024Updated last year
danny-englander / suno-ai-downloader
View on GitHub
A set of tools to download your music from Suno.ai with organized filenames and prompts.
☆31Jan 11, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Tencent / StableToken
View on GitHub
[ICLR 2026] StableToken: A state-of-the-art noise-robust semantic speech tokenizer featuring Voting-LFQ for resilient SpeechLLMs.
☆33Feb 27, 2026Updated 4 months ago
phunterlau / papercast
View on GitHub
AI generates conversational podcast for ANY research paper, vividly!
☆25Oct 10, 2024Updated last year
adelacvg / DPTTS
View on GitHub
An AR+AR TTS attempt.
☆18Jan 13, 2025Updated last year
AnyaCoder / fish-speech
View on GitHub
Brand new TTS solution
☆11Dec 7, 2024Updated last year
hbwu-ntu / EmoCtrlTTS-Eval
View on GitHub
☆19Aug 23, 2024Updated last year
KdaiP / DC-Speech-VAE
View on GitHub
5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs
☆57Nov 19, 2025Updated 7 months ago
SparkAudio / SparkVox
View on GitHub
☆37Jun 9, 2025Updated last year
daihuangyu / speex_aec_kf
View on GitHub
speex aec kalman filter
☆15Mar 17, 2024Updated 2 years ago
prairie-schooner / wav2vec-vc
View on GitHub
☆10Mar 22, 2023Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
yeyupiaoling / YeAudio
View on GitHub
Python的音频工具
☆16Dec 5, 2025Updated 7 months ago
gxu82 / MVDR-Speech-Enhancement
View on GitHub
☆16Jul 14, 2020Updated 6 years ago
ShawnPi233 / HQ-SVC
View on GitHub
Official Repository of Paper: "Towards High-Quality Zero-Shot Singing Voice Conversion in Low-Resource Scenarios"(AAAI 2026)
☆106Jun 17, 2026Updated 3 weeks ago
Nikolay-Lysenko / geniartor
View on GitHub
Generation of musical phrases that receive maximum score according to configurable evaluational criteria.
☆12Oct 17, 2023Updated 2 years ago
NeuroWave-ai / CUCVAE-TTS
View on GitHub
☆25Mar 12, 2022Updated 4 years ago
fishaudio / fish-speech
View on GitHub
SOTA Open Source TTS
☆31,259Jun 9, 2026Updated last month
Multi-Singer / Multi-Singer.github.io
View on GitHub
☆83Nov 19, 2022Updated 3 years ago
premake / premake.github.io
View on GitHub
Premake's static website, with landing and download pages.
☆10Jul 3, 2026Updated last week
PoTaTo-Mika / Shore-Data-Engine
View on GitHub
A codebase for data crawling and preprocessing for TTS and ASR systems training.
☆23Jun 13, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
seorim0 / SE-using-SRL-Model
View on GitHub
Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings
☆20Jun 6, 2025Updated last year
tuneflow / tuneflow-py-demos
View on GitHub
☆15May 8, 2023Updated 3 years ago
WangHelin1997 / SpecAugment-plus
View on GitHub
A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification
☆34Jun 25, 2021Updated 5 years ago
HITerltr / HIT-2023-Spring-Compilation-System
View on GitHub
哈尔滨工业大学2023春季学期编译系统课程实验、习题、课件以及期末复习材料
☆12Jul 30, 2023Updated 2 years ago
Lucanyc / VISTA-Gym
View on GitHub
☆27Mar 17, 2026Updated 3 months ago
CarlWangChina / SaMoye-SVC
View on GitHub
dog-can-sing-song
☆58Jan 9, 2026Updated 6 months ago
Plachtaa / ASTRAL-quantization
View on GitHub
speaker-disentangled speech linguistic content quantizer
☆26Mar 19, 2025Updated last year