npuichigo / blazing-fast-io-tutorialLinks
Blazing fast data loading with HuggingFace Dataset and Ray Data
☆16Updated last year
Alternatives and similar repositories for blazing-fast-io-tutorial
Users that are interested in blazing-fast-io-tutorial are comparing it to the libraries listed below
Sorting:
- Implementation of a Light Recurrent Unit in Pytorch☆48Updated 9 months ago
- Framework for writing deep learning training loops. Lightweight, and retaining full freedom to design as you see fits. It handles checkpo…☆113Updated last year
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆87Updated 9 months ago
- Experimental playground for benchmarking language model (LM) architectures, layers, and tricks on smaller datasets. Designed for flexible…☆67Updated this week
- ☆23Updated 2 years ago
- ☆52Updated last week
- An implementation of the Llama architecture, to instruct and delight☆21Updated last month
- ☆29Updated last week
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated 2 years ago
- Implementation of 2-simplicial attention proposed by Clift et al. (2019) and the recent attempt to make practical in Fast and Simplex, Ro…☆34Updated this week
- GPT for FACodec☆13Updated last year
- Audio tokenization, in the fastest way possible!☆52Updated 10 months ago
- The accompanying code for "Exploring the limits of decoder-only models trained on public speech recognition corpora" (Ankit Gupta, George…☆19Updated 9 months ago
- A dashboard for exploring timm learning rate schedulers☆19Updated 7 months ago
- Repository for "TESS-2: A Large-Scale, Generalist Diffusion Language Model"☆40Updated 4 months ago
- Temporary anonymous version☆22Updated last year
- Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024☆15Updated 7 months ago
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆63Updated 3 weeks ago
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Updated 2 years ago
- The official code for the SALMon🍣 benchmark (ICASSP 2025 - Oral)☆46Updated 3 months ago
- python bindings for symphonia/opus - read various audio formats from python and write opus files☆64Updated 2 months ago
- Implementation of Google's USM speech model in Pytorch☆31Updated 3 months ago
- Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch☆110Updated 7 months ago
- Implementation of Strassen attention, from Kozachinskiy et al. of National Center of AI in Chile☆38Updated last week
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆25Updated 7 months ago
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆36Updated 2 years ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆77Updated last year
- A TTS model that makes a speaker speak new languages☆76Updated last year
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆51Updated 4 years ago
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆46Updated last year