Red-Hat-AI-Innovation-Team / SQuatLinks
☆13Updated 3 weeks ago
Alternatives and similar repositories for SQuat
Users that are interested in SQuat are comparing it to the libraries listed below
Sorting:
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆32Updated 3 months ago
- Unofficial Implementation of Selective Attention Transformer☆17Updated 7 months ago
- ☆79Updated 10 months ago
- This repo is based on https://github.com/jiaweizzhao/GaLore☆28Updated 9 months ago
- ☆32Updated 5 months ago
- The official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆35Updated this week
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆33Updated 3 months ago
- ☆20Updated last week
- Make reasoning models scalable☆37Updated 3 weeks ago
- ☆13Updated 5 months ago
- ☆17Updated 5 months ago
- Work in progress.☆69Updated 2 weeks ago
- [ICML 2025] Reward-guided Speculative Decoding (RSD) for efficiency and effectiveness.☆32Updated last month
- Code for "Reasoning to Learn from Latent Thoughts"☆105Updated 2 months ago
- ☆18Updated 4 months ago
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆52Updated 3 months ago
- Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More☆31Updated last month
- Exploration of automated dataset selection approaches at large scales.☆45Updated 3 months ago
- [ICML 2025] Predictive Data Selection: The Data That Predicts Is the Data That Teaches☆49Updated 3 months ago
- ☆40Updated this week
- ☆80Updated 5 months ago
- ☆45Updated last week
- Official implementation of the ICML 2024 paper RoSA (Robust Adaptation)☆42Updated last year
- ☆65Updated last year
- ☆50Updated 3 months ago
- A repository for research on medium sized language models.☆76Updated last year
- ☆12Updated 3 months ago
- [ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers☆69Updated this week
- Activation-aware Singular Value Decomposition for Compressing Large Language Models☆71Updated 8 months ago
- ☆58Updated this week