ex3ndr / datasetsLinks
Declare your datasets and download them using a simple tool
☆14Updated last year
Alternatives and similar repositories for datasets
Users that are interested in datasets are comparing it to the libraries listed below
Sorting:
- SpeechFlow neural network implementation☆22Updated last year
- Supervoice diffusion enhance☆28Updated last year
- VoiceBox neural network implementation☆110Updated last year
- A library for making PyTorch models streamable☆57Updated 2 weeks ago
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆37Updated 7 months ago
- ⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.☆35Updated 2 years ago
- GPT-style network for phonemization with durations of text☆68Updated last year
- Acoustic Neighbor Embeddings☆29Updated 6 months ago
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆48Updated 3 months ago
- logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source…☆45Updated last week
- VALL-E 2 reproduction☆134Updated last year
- StyleTTS2 + Vocos as a Decoder☆13Updated 10 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 8 months ago
- ☆30Updated 2 weeks ago
- A simple, hackable text-to-speech system in PyTorch and MLX☆187Updated 6 months ago
- ☆86Updated last year
- ☆18Updated 10 months ago
- python bindings for symphonia/opus - read various audio formats from python and write opus files☆77Updated last month
- This is a balanced dataset for English homograph disambiguation (HD), generated with Meta's Llama 2-Chat 70B model.☆22Updated 2 years ago
- AudioSR-Upsampling (any -> 48kHz)☆42Updated last year
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX☆21Updated last year
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆22Updated last year
- JAX Implementations of Descript Audio Codec and EnCodec☆33Updated 10 months ago
- SDX23 startkit for the Demucs baselines.☆30Updated 2 years ago
- Official repository of Wavehax vocoder☆66Updated last month
- Official Implementation of EnCLAP (ICASSP 2024)☆94Updated last year
- ☆23Updated 2 years ago
- ☆29Updated last year
- ☆58Updated last year
- Code for the blog "Neural audio codecs: how to get audio into LLMs"☆149Updated 3 months ago