lmgame-org / GRLLinks
Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learning
☆28Updated this week
Alternatives and similar repositories for GRL
Users that are interested in GRL are comparing it to the libraries listed below
Sorting:
- Experimental scripts for researching data adaptive learning rate scheduling.☆22Updated last year
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆56Updated last year
- DPO, but faster 🚀☆44Updated 9 months ago
- Explorations into adversarial losses on top of autoregressive loss for language modeling☆37Updated 7 months ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Updated last year
- ☆32Updated last year
- 😊 TPTT: Transforming Pretrained Transformers into Titans☆27Updated this week
- Code and data for paper "(How) do Language Models Track State?"☆18Updated 5 months ago
- Resa: Transparent Reasoning Models via SAEs☆41Updated last month
- Implementation of 2-simplicial attention proposed by Clift et al. (2019) and the recent attempt to make practical in Fast and Simplex, Ro…☆46Updated 2 weeks ago
- RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best…☆52Updated 6 months ago
- Flash Attention Triton kernel with support for second-order derivatives☆86Updated this week
- ☆39Updated last year
- Linear Attention Sequence Parallelism (LASP)☆86Updated last year
- Fork of Flame repo for training of some new stuff in development☆17Updated 2 weeks ago
- Triton Implementation of HyperAttention Algorithm☆48Updated last year
- ☆22Updated this week
- Official code for the paper "Attention as a Hypernetwork"☆42Updated last year
- Lottery Ticket Adaptation☆39Updated 10 months ago
- FlexAttention w/ FlashAttention3 Support☆27Updated 11 months ago
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆16Updated 5 months ago
- Here we will test various linear attention designs.☆62Updated last year
- Description and applications of OpenAI's paper about DALL-E (2021) and implementation of other (CLIP-guided) zero-shot text-to-image gene…☆33Updated 3 years ago
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆36Updated last year
- Code repository for the public reproduction of the language modelling experiments on "MatFormer: Nested Transformer for Elastic Inference…☆28Updated last year
- [ICLR 2025 & COLM 2025] Official PyTorch implementation of the Forgetting Transformer and Adaptive Computation Pruning☆130Updated last week
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆25Updated 10 months ago
- This is a simple torch implementation of the high performance Multi-Query Attention☆16Updated 2 years ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated last year
- 32 times longer context window than vanilla Transformers and up to 4 times longer than memory efficient Transformers.☆49Updated 2 years ago