neonbjb / pyfastmp3decoderLinks
A fast MP3 decoder for python, using minimp3
β29Updated 3 years ago
Alternatives and similar repositories for pyfastmp3decoder
Users that are interested in pyfastmp3decoder are comparing it to the libraries listed below
Sorting:
- Real-time end-to-end singing voice convertionβ22Updated last year
- ποΈ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets β¨β128Updated 3 months ago
- SOTA Piano Transformer model trained on 4.2GB of Solo Piano MIDI musicβ27Updated 2 years ago
- StyleTTS 2 Optimized Training Forkβ34Updated 9 months ago
- β107Updated 2 years ago
- Create training data for training a voice cloner for bark text to speech.β47Updated 2 years ago
- β51Updated last year
- Trying to build an all in one speech-text language model - a bit like GPT-4oβ22Updated last year
- β18Updated 3 years ago
- Heteronym to Phoneme Parserβ18Updated 2 years ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with theβ¦β47Updated 2 years ago
- Audio bandwidth enhancement with DNNs, addressing filter overfittingβ41Updated 2 years ago
- [DEPRECIATED] [PyTorch 2.0] [638M] [85.33% acc] Full-attention multi-instrumental music transformer for supervised music generation, optiβ¦β32Updated 2 years ago
- text-to-audio-latent-diffusionβ37Updated 2 years ago
- An unofficial PyTorch implementation of VALL-Eβ88Updated 3 months ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.β32Updated 2 years ago
- Voice swapping with VQ-VAE and diffusion modelsβ67Updated 4 years ago
- Your one-stop solution for voice dataset creationβ127Updated last year
- β62Updated last year
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentationβ12Updated 11 months ago
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformersβ57Updated 6 months ago
- β11Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β68Updated last month
- Finally, some decent sample sentencesβ23Updated last year
- [Last Updated 2021] TTS from Cookie. Messy and experimental!β43Updated 2 years ago
- Voxtral: Convert Mistral into a end2end SpeechLM. No information bottleneck, preserves prosody, learns interruptions from data. Unlike GPβ¦β36Updated 8 months ago
- Pytorch implementation of SoundCTMβ101Updated 7 months ago
- audiolm-pytorch training codeβ15Updated 2 years ago
- Open TTS models, built for streaming on the edgeβ44Updated 8 months ago
- RTVC: Real-Time Voice Conversion GUIβ56Updated 2 years ago