The official Python library for the Fish Audio API.
☆158Mar 25, 2026Updated this week
Alternatives and similar repositories for fish-audio-python
Users that are interested in fish-audio-python are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 4 months ago
- ☆13Oct 14, 2024Updated last year
- This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Models☆35Oct 13, 2024Updated last year
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆36Aug 30, 2025Updated 7 months ago
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆23Mar 17, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Preprocess Audio for training☆383Mar 2, 2026Updated 3 weeks ago
- Hybrid Flow Matching and GAN with Multi-Resolution Network for Few-Step High-Fidelity Audio Generation☆139Mar 8, 2026Updated 3 weeks ago
- ☆24Mar 11, 2026Updated 2 weeks ago
- Make Kanye sing any song ya want 🎤🔥☆27Apr 25, 2023Updated 2 years ago
- poorman's ar-dit tts☆45Dec 31, 2025Updated 2 months ago
- AI generates conversational podcast for ANY research paper, vividly!☆24Oct 10, 2024Updated last year
- 基于PC-DDSP和nsf-HiFiGAN的声码器☆18Jul 17, 2023Updated 2 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- A playground for experimenting with acoustic echo cancellation using a microphone, speaker, and ONNX.☆13Oct 22, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A set of tools to download your music from Suno.ai with organized filenames and prompts.☆27Jan 11, 2025Updated last year
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- Official Repository of Paper: "Towards High-Quality Zero-Shot Singing Voice Conversion in Low-Resource Scenarios"(AAAI 2026)☆94Jan 31, 2026Updated last month
- ☆15Jul 14, 2020Updated 5 years ago
- 5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs☆57Nov 19, 2025Updated 4 months ago
- ☆18Aug 23, 2024Updated last year
- ☆35Jun 9, 2025Updated 9 months ago
- Brand new TTS solution☆11Dec 7, 2024Updated last year
- speex aec kalman filter☆15Mar 17, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆11Mar 22, 2023Updated 3 years ago
- Python的音频工具☆16Dec 5, 2025Updated 3 months ago
- Generation of musical phrases that receive maximum score according to configurable evaluational criteria.☆13Oct 17, 2023Updated 2 years ago
- ☆25Mar 12, 2022Updated 4 years ago
- ☆81Nov 19, 2022Updated 3 years ago
- Pure C# port of the Pocketsphinx keyword spotter☆13Jan 19, 2020Updated 6 years ago
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆22Feb 26, 2026Updated last month
- Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings☆19Jun 6, 2025Updated 9 months ago
- ☆20Mar 17, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆34Jun 25, 2021Updated 4 years ago
- dog-can-sing-song☆54Jan 9, 2026Updated 2 months ago
- A Fish Speech implementation in Rust, with Candle.rs☆110Jun 5, 2025Updated 9 months ago
- speaker-disentangled speech linguistic content quantizer☆24Mar 19, 2025Updated last year
- Open source UTAU editing environment.☆11Mar 23, 2026Updated last week
- Conversational Multimodal Emotion Recognition☆12Dec 7, 2020Updated 5 years ago
- ☆130Mar 2, 2026Updated 3 weeks ago