DSL-Lab / aopsLinks
☆27Updated 6 months ago
Alternatives and similar repositories for aops
Users that are interested in aops are comparing it to the libraries listed below
Sorting:
- ☆90Updated last year
- ☆84Updated last year
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆78Updated last year
- Language modeling via stochastic processes. Oral @ ICLR 2022.☆138Updated 2 years ago
- DiffusER: Discrete Diffusion via Edit-based Reconstruction (Reid, Hellendoorn & Neubig, 2022)☆54Updated last week
- The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".☆33Updated 2 months ago
- A framework for few-shot evaluation of autoregressive language models.☆24Updated last year
- ☆31Updated 2 years ago
- ☆109Updated 3 years ago
- Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023☆136Updated last year
- ☆22Updated 2 years ago
- [ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"☆69Updated 6 months ago
- ☆66Updated 11 months ago
- ☆83Updated last year
- Staged Training for Transformer Language Models☆32Updated 3 years ago
- ☆13Updated last week
- ☆34Updated last year
- ☆51Updated 2 years ago
- [NeurIPS 2023] Learning Transformer Programs☆162Updated last year
- ☆34Updated last year
- Efficient Transformers with Dynamic Token Pooling☆62Updated 2 years ago
- Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implici…☆108Updated last year
- ☆11Updated last year
- [NeurIPS 2023 spotlight] Official implementation of HGRN in our NeurIPS 2023 paper - Hierarchically Gated Recurrent Neural Network for Se…☆66Updated last year
- Implementation of ICML 22 Paper: Scaling Structured Inference with Randomization☆14Updated 3 years ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆77Updated 2 years ago
- ☆105Updated 2 years ago
- ☆22Updated 2 months ago
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆46Updated last year
- Code for the paper "Decomposing the Enigma: Subgoal-based Demonstration Learning for Formal Theorem Proving"☆19Updated 2 years ago