elevenlabs / opuspy
Opus codec support for Python.
☆27Updated 2 years ago
Alternatives and similar repositories for opuspy
Users that are interested in opuspy are comparing it to the libraries listed below
Sorting:
- Heteronym to Phoneme Parser☆18Updated last year
- Supervoice diffusion enhance☆26Updated 10 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated last month
- Unofficial implementation of wavenext vocoder☆46Updated 8 months ago
- Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.☆27Updated this week
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Updated 11 months ago
- AudioSR-Upsampling (any -> 48kHz)☆40Updated last year
- StyleTTS 2 Optimized Training Fork☆28Updated 3 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 7 months ago
- Adaptive Vocoder for Custom Voice☆59Updated 2 years ago
- Acoustic Neighbor Embeddings☆22Updated 5 months ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- Test code disclosure for the research paper "UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion Model", as a supplementa…☆20Updated last year
- A curated list of awesome voice activity detection☆50Updated 5 months ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆66Updated 2 years ago
- Codebase and project page for EDMSound☆34Updated last year
- High quality text-to-speech based on StyleTTS 2.☆42Updated this week
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆16Updated 2 weeks ago
- VoiceBox neural network implementation☆107Updated 9 months ago
- Use quantized versions of Whisper to speed up inference☆12Updated 7 months ago
- StyleTTS2 + Vocos as a Decoder☆11Updated last month
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- ☆26Updated last year
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆47Updated 2 months ago
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆24Updated last year
- Open TTS models, built for streaming on the edge☆41Updated 2 months ago
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs☆16Updated last year
- ☆59Updated last year
- Non Parallel Voice Conversion based on VITS☆24Updated 2 years ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆132Updated last year