npuichigo / blazing-fast-io-tutorial
Blazing fast data loading with HuggingFace Dataset and Ray Data
☆16Updated last year
Alternatives and similar repositories for blazing-fast-io-tutorial:
Users that are interested in blazing-fast-io-tutorial are comparing it to the libraries listed below
- Implementation of a Light Recurrent Unit in Pytorch☆46Updated 6 months ago
- An implementation of the Llama architecture, to instruct and delight☆21Updated 3 months ago
- GPT for FACodec☆13Updated last year
- ☆28Updated 2 months ago
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆86Updated 6 months ago
- The accompanying code for "Exploring the limits of decoder-only models trained on public speech recognition corpora" (Ankit Gupta, George…☆20Updated 6 months ago
- Temporary anonymous version☆22Updated last year
- python bindings for symphonia/opus - read various audio formats from python and write opus files☆58Updated last week
- Audio tokenization, in the fastest way possible!☆51Updated 8 months ago
- ☆23Updated last year
- Repository for "TESS-2: A Large-Scale, Generalist Diffusion Language Model"☆34Updated 2 months ago
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆57Updated 3 weeks ago
- A native-PyTorch library for large scale M-LLM (text/audio) training with tp/cp/dp/pp.☆62Updated 2 weeks ago
- ☆21Updated 2 months ago
- A dashboard for exploring timm learning rate schedulers☆19Updated 5 months ago
- ☆24Updated 4 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆27Updated 9 months ago
- ☆59Updated last year
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆25Updated 4 months ago
- Framework for writing deep learning training loops. Lightweight, and retaining full freedom to design as you see fits. It handles checkpo…☆112Updated last year
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated last year
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Updated 2 years ago
- trying to reproduce suno v3☆33Updated 3 months ago
- DPO, but faster 🚀☆41Updated 5 months ago
- Experimental playground for benchmarking language model (LM) architectures, layers, and tricks on smaller datasets. Designed for flexible…☆25Updated last month
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆52Updated last week
- Contrastive Language-Audio Pretraining☆15Updated 3 years ago
- PyTorch Implementation of ViT-TTS (EMNLP'23)☆10Updated last year
- E2E TTS using Conditional Flow Matching (Experimental*)☆69Updated last year
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆35Updated 2 years ago