SELMA-project / ml4audioLinks
audio, NLP, ML with huggingface, nvidia/nemo, speechbrain
☆11Updated last year
Alternatives and similar repositories for ml4audio
Users that are interested in ml4audio are comparing it to the libraries listed below
Sorting:
- Audio processing using deep neural networks. Speaker identification using voice embeddings.☆13Updated 2 years ago
- ☆23Updated 2 years ago
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated 9 months ago
- Easily turn large sets of audio urls to an audio dataset.☆21Updated 2 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆29Updated 2 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Self-supervised neural network for music recommendations.☆18Updated 2 years ago
- ☆16Updated 4 months ago
- A repo with scripts to test and play around with Facebook's recent llama models! 🤗☆28Updated last year
- A lightweight Python library for running TTS models with a unified API.☆20Updated 4 months ago
- OCTRA is a web-application for the orthographic transcription of audio files.☆39Updated this week
- A 🔥 cookiecutter template for building Hugging Face Spaces☆11Updated 3 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆33Updated last year
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Updated 2 years ago
- Audio tokenization, in the fastest way possible!☆52Updated 10 months ago
- GPT for FACodec☆13Updated last year
- Supervoice Speaker Separation Network☆12Updated last year
- Lyra V2 (SoundStream) running in the browser☆19Updated last year
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆28Updated this week
- Experiments with Hugging Face 🔬 🤗☆44Updated 10 months ago
- Rust bindings for CTranslate2☆14Updated 2 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆21Updated 8 months ago
- Experiments with generating GPT-2 fanfiction on specified topics.☆11Updated 6 years ago
- ☆11Updated 10 years ago
- ☆15Updated 2 years ago
- ☆16Updated 4 months ago
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆36Updated 2 years ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆18Updated 2 weeks ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆19Updated last year