resemble-ai / PerthLinks
Open Audio Watermarking Tool
☆246Updated 2 months ago
Alternatives and similar repositories for Perth
Users that are interested in Perth are comparing it to the libraries listed below
Sorting:
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆182Updated 4 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆201Updated 4 months ago
- ☆287Updated last month
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆276Updated 2 months ago
- Collection of Open Source Speech Data☆159Updated 9 months ago
- G2P☆308Updated 2 weeks ago
- Kyutai with an "eye"☆215Updated 5 months ago
- ☆514Updated last week
- A simple, hackable text-to-speech system in PyTorch and MLX☆172Updated 3 weeks ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆119Updated 2 weeks ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆67Updated last month
- SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on On…☆215Updated 3 months ago
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆307Updated last month
- ☆275Updated last month
- Open source inference code for Rev's model☆424Updated 4 months ago
- LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆605Updated 4 months ago
- ☆633Updated 3 weeks ago
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆271Updated 3 months ago
- Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"☆196Updated 6 months ago
- On-device streaming text-to-speech engine powered by deep learning☆120Updated 3 weeks ago
- python bindings for symphonia/opus - read various audio formats from python and write opus files☆66Updated last month
- A lightweight end-to-end text-to-speech model☆118Updated 6 months ago
- ☆273Updated last year
- ☆86Updated last week
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆304Updated 4 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆119Updated last month
- Open TTS models, built for streaming on the edge☆43Updated 5 months ago
- ☆99Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆159Updated last year
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆42Updated last month