erogol / BlaGPTLinks
Experimental playground for benchmarking language model (LM) architectures, layers, and tricks on smaller datasets. Designed for flexible experimentation and exploration.
☆36Updated this week
Alternatives and similar repositories for BlaGPT
Users that are interested in BlaGPT are comparing it to the libraries listed below
Sorting:
- Repository for "TESS-2: A Large-Scale, Generalist Diffusion Language Model"☆35Updated 3 months ago
- Implementation of a Light Recurrent Unit in Pytorch☆47Updated 8 months ago
- ☆33Updated this week
- Official PyTorch Implementation for Paper "No More Adam: Learning Rate Scaling at Initialization is All You Need"☆51Updated 4 months ago
- Benchmark for evaluating TTS models on complex prosodic, expressiveness, and linguistic challenges.☆40Updated this week
- Implementation of Google's USM speech model in Pytorch☆31Updated 2 months ago
- ☆79Updated 9 months ago
- ☆60Updated last year
- ☆84Updated last year
- small audio language model for reasoning☆64Updated last month
- DPO, but faster 🚀☆42Updated 6 months ago
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆56Updated last year
- Implementation of the proposed MaskBit from Bytedance AI☆80Updated 6 months ago
- Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)☆156Updated last month
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆14Updated 11 months ago
- An official implementation of Style-Talker for Spoken Dialogue Generation☆17Updated 4 months ago
- ☆62Updated 10 months ago
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆52Updated 2 months ago
- Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch☆106Updated 6 months ago
- research impl of Native Sparse Attention (2502.11089)☆54Updated 3 months ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28Updated last month
- AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension☆102Updated 5 months ago
- [ICLR 2025] Official PyTorch implementation of "Forgetting Transformer: Softmax Attention with a Forget Gate"☆107Updated 3 weeks ago
- Explorations into adversarial losses on top of autoregressive loss for language modeling☆36Updated 3 months ago
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆38Updated this week
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated 2 years ago
- Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dial…☆40Updated 4 months ago
- Collection of autoregressive model implementation☆85Updated last month
- Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".☆61Updated 10 months ago
- Attempt to make multiple residual streams from Bytedance's Hyper-Connections paper accessible to the public☆83Updated 3 months ago