Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
☆78Aug 17, 2024Updated last year
Alternatives and similar repositories for EasyLM
Users that are interested in EasyLM are comparing it to the libraries listed below
Sorting:
- ☆11Mar 13, 2023Updated 2 years ago
- ☆17Aug 1, 2025Updated 7 months ago
- Plan✕ is a platform for creating and publishing digital planning services☆17Updated this week
- Training GPTs to solve interaction nets☆18Aug 14, 2024Updated last year
- Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"☆91Jul 2, 2024Updated last year
- ☆19Oct 2, 2023Updated 2 years ago
- ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment☆56Jun 16, 2024Updated last year
- [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"☆78Nov 25, 2024Updated last year
- ☆116Jan 21, 2025Updated last year
- Can Language Models Solve Olympiad Programming?☆123Jan 14, 2025Updated last year
- AllenAI's post-training codebase☆3,605Updated this week
- Linear Attention Sequence Parallelism (LASP)☆88Jun 4, 2024Updated last year
- DPO, but faster 🚀☆48Dec 6, 2024Updated last year
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆52Jul 15, 2025Updated 7 months ago
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆186May 25, 2025Updated 9 months ago
- ☆130Oct 1, 2024Updated last year
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆23Sep 21, 2025Updated 5 months ago
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆14Jun 28, 2025Updated 8 months ago
- ☆15Jan 12, 2026Updated last month
- ☆16Jul 29, 2025Updated 7 months ago
- Exploration of automated dataset selection approaches at large scales.☆52Mar 4, 2025Updated last year
- Code for the paper "FinRLlama: A Solution to LLM-Engineered Signals Challenge at FinRL Contest 2024"☆13Feb 14, 2025Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆149Oct 27, 2024Updated last year
- Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding.☆13Nov 19, 2024Updated last year
- [ICLR 2026] BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs☆17May 21, 2025Updated 9 months ago
- ☆13Jul 22, 2024Updated last year
- [ICML'25] "Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding" by Jiajun Zhu, Peihao Wang, Ruisi…☆14Jun 6, 2025Updated 8 months ago
- [ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization☆12Jan 26, 2025Updated last year
- ☆29Oct 8, 2025Updated 4 months ago
- CMU Linguistic Annotation Backend☆15Sep 22, 2025Updated 5 months ago
- [ACL 2025] Analyzing LLMs' Multilingual Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations☆18Oct 18, 2025Updated 4 months ago
- ☆15Jun 2, 2025Updated 9 months ago
- Apps that run on modal.com☆13Sep 14, 2025Updated 5 months ago
- Mixture-of-Basis-Experts for Compressing MoE-based LLMs☆29Dec 24, 2025Updated 2 months ago
- Demo showing neon and Nervana Cloud integration with OpenAI's RL-Gym☆23Jan 3, 2023Updated 3 years ago
- RewardBench: the first evaluation tool for reward models.☆697Feb 16, 2026Updated 2 weeks ago
- Plancraft is a minecraft environment and agent suite to test planning capabilities in LLMs☆26Nov 7, 2025Updated 3 months ago
- Python implementation of tabular asynchronous actor critic☆11May 3, 2016Updated 9 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Jun 15, 2023Updated 2 years ago