npuichigo / blazing-fast-io-tutorial
Blazing fast data loading with HuggingFace Dataset and Ray Data
☆15Updated last year
Alternatives and similar repositories for blazing-fast-io-tutorial:
Users that are interested in blazing-fast-io-tutorial are comparing it to the libraries listed below
- Implementation of a Light Recurrent Unit in Pytorch☆47Updated 5 months ago
- ☆23Updated last year
- GPT for FACodec☆13Updated 11 months ago
- Benchmark for machine learning model online serving (LLM, embedding, Stable-Diffusion, Whisper)☆28Updated last year
- Temporary anonymous version☆22Updated 11 months ago
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆50Updated this week
- ☆25Updated 2 weeks ago
- ☆21Updated this week
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆16Updated 7 months ago
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆83Updated 4 months ago
- The accompanying code for "Exploring the limits of decoder-only models trained on public speech recognition corpora" (Ankit Gupta, George…☆20Updated 4 months ago
- ☆59Updated last year
- A dashboard for exploring timm learning rate schedulers☆19Updated 3 months ago
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated last year
- DPO, but faster 🚀☆40Updated 3 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆27Updated 7 months ago
- python bindings for symphonia/opus - read various audio formats from python and write opus files☆33Updated this week
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆25Updated 2 months ago
- ☆24Updated 7 months ago
- Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024☆15Updated 3 months ago
- GPT-style network for phonemization with durations of text☆63Updated 11 months ago
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Updated 2 years ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆16Updated 2 years ago
- Audio tokenization, in the fastest way possible!☆49Updated 6 months ago
- Contrastive Language-Audio Pretraining☆15Updated 3 years ago
- An implementation of the Llama architecture, to instruct and delight☆21Updated last month