resemble-ai / PerthLinks
Open Audio Watermarking Tool
☆237Updated last month
Alternatives and similar repositories for Perth
Users that are interested in Perth are comparing it to the libraries listed below
Sorting:
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆178Updated 3 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆197Updated 3 months ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆97Updated 2 months ago
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆271Updated 2 months ago
- ☆272Updated last month
- On-device streaming text-to-speech engine powered by deep learning☆102Updated 2 weeks ago
- ☆512Updated last month
- Collection of Open Source Speech Data☆159Updated 9 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆114Updated 2 weeks ago
- Kyutai with an "eye"☆212Updated 4 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆65Updated last week
- A simple, hackable text-to-speech system in PyTorch and MLX☆168Updated 5 months ago
- A lightweight end-to-end text-to-speech model☆117Updated 5 months ago
- python bindings for symphonia/opus - read various audio formats from python and write opus files☆64Updated 2 weeks ago
- G2P☆293Updated 3 months ago
- ☆27Updated last month
- Open TTS models, built for streaming on the edge☆43Updated 4 months ago
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆308Updated last month
- ☆628Updated last week
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆302Updated 3 months ago
- Audio tokenization, in the fastest way possible!☆52Updated 11 months ago
- ☆198Updated 2 weeks ago
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆268Updated 2 months ago
- ☆273Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- Streaming and Fine-tuning for Chatterbox TTS☆143Updated last month
- SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on On…☆215Updated 2 months ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆42Updated last week
- VLLM Port of the Chatterbox TTS model☆156Updated this week
- LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆118Updated 2 months ago