SELMA-project / ml4audio
audio, NLP, ML with huggingface, nvidia/nemo, speechbrain
☆9Updated last year
Alternatives and similar repositories for ml4audio:
Users that are interested in ml4audio are comparing it to the libraries listed below
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Self-supervised neural network for music recommendations.☆18Updated last year
- A neural network for filtering target speaker's voice from audio written in tensorflow☆21Updated 6 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆12Updated 3 weeks ago
- Rust bindings for CTranslate2☆14Updated last year
- ☆23Updated last year
- Zero-Shot Foreign Accent Conversion without a Native Reference☆29Updated 9 months ago
- ☆16Updated 5 years ago
- ☆22Updated 3 years ago
- StyleTTS 2 Optimized Training Fork☆22Updated 2 weeks ago
- Fast and differentiable hidden Markov model in C++☆17Updated 2 years ago
- Experiments and tutorials with and for torchaudio☆13Updated 3 years ago
- Tool for the Automatic Assessment of Lexical Diversity☆11Updated 4 years ago
- A speech signal processing library in Python with emphasis on deep learning.☆31Updated 2 years ago
- Audio processing using deep neural networks. Speaker identification using voice embeddings.☆13Updated 2 years ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆18Updated last week
- Contrastive Language-Audio Pretraining☆15Updated 3 years ago
- ☆32Updated 3 years ago
- Gentle and praatio scripts for easy forced alignment☆18Updated 2 years ago
- Generate accompaniment part with chords using Evolutionary algorithm.☆9Updated 2 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- GPT for FACodec☆13Updated 10 months ago
- User simulation for dialog systems using Inverse Reinforcement Learning☆12Updated 8 years ago
- Experiments with generating GPT-2 fanfiction on specified topics.☆11Updated 5 years ago
- Examples of cleaning up raw voices☆18Updated 2 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆34Updated 2 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Updated 3 years ago
- ☆41Updated last year
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆12Updated last year