neonbjb / pyfastmp3decoderLinks
A fast MP3 decoder for python, using minimp3
☆29Updated 3 years ago
Alternatives and similar repositories for pyfastmp3decoder
Users that are interested in pyfastmp3decoder are comparing it to the libraries listed below
Sorting:
- ☆107Updated 2 years ago
- StyleTTS 2 Optimized Training Fork☆34Updated 10 months ago
- Real-time end-to-end singing voice convertion☆22Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆69Updated last month
- Easily turn large sets of audio urls to an audio dataset.☆21Updated 2 years ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆131Updated 2 years ago
- ☆18Updated 3 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆32Updated 2 years ago
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆127Updated 2 months ago
- Codebase and project page for EDMSound☆35Updated 2 years ago
- A collection of all our phonemeizers for dataset construction and inference☆27Updated 9 months ago
- SOTA Piano Transformer model trained on 4.2GB of Solo Piano MIDI music☆27Updated 2 years ago
- Audio bandwidth enhancement with DNNs, addressing filter overfitting☆41Updated 2 years ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆129Updated 4 months ago
- [Last Updated 2021] TTS from Cookie. Messy and experimental!☆43Updated 2 years ago
- zero-shot realtime TTS system, fully offline, free and open source☆48Updated 7 months ago
- Create training data for training a voice cloner for bark text to speech.☆48Updated 2 years ago
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Updated last year
- ☆11Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆103Updated last year
- The demo page of UniAudio☆34Updated last year
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆27Updated last year
- Finally, some decent sample sentences☆23Updated 2 years ago
- Demo for 2022 ICASSP☆64Updated 3 years ago
- VoiceBox neural network implementation☆110Updated last year
- Voxtral: Convert Mistral into a end2end SpeechLM. No information bottleneck, preserves prosody, learns interruptions from data. Unlike GP…☆36Updated 9 months ago
- Opus codec support for Python.☆31Updated 3 years ago
- DLAS - A configuration-driven trainer for generative models☆141Updated 3 years ago
- [DEPRECIATED] [PyTorch 2.0] [638M] [85.33% acc] Full-attention multi-instrumental music transformer for supervised music generation, opti…☆32Updated 2 years ago