resemble-ai / PerthLinks
Open Audio Watermarking Tool
☆209Updated last month
Alternatives and similar repositories for Perth
Users that are interested in Perth are comparing it to the libraries listed below
Sorting:
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆294Updated 2 months ago
- Delayed Streams Modeling (DSM) is a flexible formulation for streaming, multimodal sequence-to-sequence learning.☆310Updated this week
- Kyutai with an "eye"☆200Updated 3 months ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆90Updated last month
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆174Updated 2 months ago
- ☆488Updated this week
- ☆238Updated 2 months ago
- TTS support with GGML☆117Updated last week
- G2P☆262Updated last month
- Streaming and Fine-tuning for Chatterbox TTS☆109Updated last week
- python bindings for symphonia/opus - read various audio formats from python and write opus files☆64Updated last month
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆258Updated 3 weeks ago
- ☆432Updated last month
- LlamaVoice is a llama-based large voice generation model, providing inference and training ability.☆233Updated 10 months ago
- kokoro text to speech using javascript☆58Updated 4 months ago
- A simple, hackable text-to-speech system in PyTorch and MLX☆166Updated 4 months ago
- Audio tokenization, in the fastest way possible!☆52Updated 10 months ago
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆259Updated last month
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆179Updated 2 months ago
- ☆22Updated this week
- Open TTS models, built for streaming on the edge☆43Updated 3 months ago
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆293Updated 2 weeks ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆104Updated last month
- ☆577Updated this week
- ☆97Updated this week
- A random walk voice style cloning application for Kokoro text to speech☆99Updated last week
- Open source inference code for Rev's model☆407Updated 2 months ago
- ☆101Updated 9 months ago
- An implementation of the Nvidia's Parakeet models for Apple Silicon using MLX.☆259Updated this week
- ☆181Updated this week