SELMA-project / ml4audio
audio, NLP, ML with huggingface, nvidia/nemo, speechbrain
☆9Updated last year
Alternatives and similar repositories for ml4audio:
Users that are interested in ml4audio are comparing it to the libraries listed below
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- StyleTTS 2 Optimized Training Fork☆15Updated this week
- ForceAlign is a Python library for forced alignment of English text to English audio. You can use ForceAlign to get word or phoneme level…☆10Updated last month
- ☆23Updated last year
- Audio processing using deep neural networks. Speaker identification using voice embeddings.☆13Updated 2 years ago
- Self-supervised neural network for music recommendations.☆18Updated last year
- Zero-Shot Foreign Accent Conversion without a Native Reference☆28Updated 8 months ago
- Supervoice Speaker Separation Network☆12Updated 7 months ago
- ☆22Updated 3 years ago
- GPT for FACodec☆13Updated 9 months ago
- Easily turn large sets of audio urls to an audio dataset.☆20Updated 2 years ago
- Generate accompaniment part with chords using Evolutionary algorithm.☆8Updated 2 years ago
- Fast and differentiable hidden Markov model in C++☆16Updated last year
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆43Updated 5 months ago
- Audio tokenization, in the fastest way possible!☆46Updated 4 months ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆16Updated this week
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆12Updated last year
- A Differentiable Acoustic Guitar Model for String-Specific Polyphonic Synthesis☆12Updated last year
- ☆32Updated 3 years ago
- ☆14Updated last year
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- ☆16Updated 5 years ago
- My vocoder experiments☆25Updated 3 months ago
- Audio Demo for "FastSVC: Fast Cross-Domain Singing Voice Conversion with Feature-wise Linear Modulation"☆19Updated 3 years ago
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆18Updated last year
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆30Updated last year
- ☆15Updated 2 years ago
- Simple PyTorch Denoisers for Waveform Audio☆34Updated last month
- ☆41Updated last year