neonbjb / pyfastmp3decoder
A fast MP3 decoder for python, using minimp3
☆28Updated 2 years ago
Alternatives and similar repositories for pyfastmp3decoder:
Users that are interested in pyfastmp3decoder are comparing it to the libraries listed below
- Misc. tools/scripts that I made to use for tortoise☆21Updated 8 months ago
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆14Updated last week
- StyleTTS 2 Optimized Training Fork☆27Updated 3 months ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- zero-shot realtime TTS system, fully offline, free and open source☆34Updated 2 weeks ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- ☆10Updated 5 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated 3 weeks ago
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- Finally, some decent sample sentences☆22Updated last year
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.☆16Updated last month
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆21Updated 3 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Updated last year
- Real-time end-to-end singing voice convertion☆21Updated 6 months ago
- Simple PyTorch Denoisers for Waveform Audio☆35Updated 2 weeks ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 6 months ago
- ☆41Updated 6 months ago
- High quality text-to-speech based on StyleTTS 2.☆37Updated this week
- AudioLDM text to audio colab☆19Updated last year
- A simple voice conversion tool☆17Updated 3 years ago
- ☆107Updated last year
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆12Updated 7 months ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- AudioSR-Upsampling (any -> 48kHz)☆40Updated last year
- Speech enhancement in noisy and reverberant environments using deep neural networks☆20Updated last month
- Reimplementation of Bandit for "Remastering Divide and Remaster: A Cinematic Audio Source Separation Dataset with Multilingual Support"☆30Updated 9 months ago
- SOTA Piano Transformer model trained on 4.2GB of Solo Piano MIDI music☆25Updated last year
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆23Updated this week
- Text prompt steered synthetic audio generators☆46Updated 3 weeks ago
- My vocoder experiments☆28Updated 6 months ago