SELMA-project / ml4audio
audio, NLP, ML with huggingface, nvidia/nemo, speechbrain
☆10Updated last year
Related projects ⓘ
Alternatives and complementary repositories for ml4audio
- ☆16Updated 5 years ago
- ☆23Updated last year
- NEAL (Nature+Energy Audio Labeller) is an open-source interactive audio data annotation tool.☆13Updated 2 months ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆16Updated last week
- [TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing☆16Updated 2 months ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- The official implementation of DMEL the method presented in the paper "DMEL: The differentiable log-Mel spectrogram as a trainable layer …☆17Updated 5 months ago
- 🎹 pyannote + 🗒 notebook = pyannotebook☆25Updated last year
- Lyra V2 (SoundStream) running in the browser☆18Updated last year
- ☆22Updated 3 years ago
- ☆32Updated 2 years ago
- GroupMap: beyond mean and variance matching for deep learning☆10Updated 2 years ago
- Supervoice Speaker Separation Network☆13Updated 5 months ago
- A speech signal processing library in Python with emphasis on deep learning.☆31Updated 2 years ago
- Audio processing using deep neural networks. Speaker identification using voice embeddings.☆12Updated last year
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆18Updated 8 months ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- GPT for FACodec☆13Updated 7 months ago
- Self-supervised neural network for music recommendations.☆18Updated last year
- Musical Word Embedding for Music Tagging and Retrieval [IEEE TASLP]☆21Updated 6 months ago
- Tutorial covering Open Source tools for Source Separation.☆15Updated 2 years ago
- [ICASSP 2023] Tempo vs. Pitch: understanding self-supervised tempo estimation☆13Updated last year
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆15Updated 2 weeks ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆28Updated 6 months ago
- Minimal module for computing audio spectrograms☆15Updated 5 years ago
- Supervoice diffusion enhance☆25Updated 3 months ago
- ☆41Updated last year
- A Differentiable Acoustic Guitar Model for String-Specific Polyphonic Synthesis☆12Updated 11 months ago
- ☆11Updated 5 years ago
- Gentle and praatio scripts for easy forced alignment☆18Updated 2 years ago