hamishivi/EasyLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hamishivi/EasyLM)

hamishivi / EasyLM

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

☆78

Alternatives and similar repositories for EasyLM

Users that are interested in EasyLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zhichaoxu-shufe / context-aware-decoding-qfs
View on GitHub
☆14Jan 10, 2024Updated 2 years ago
allenai / open-instruct
View on GitHub
AllenAI's post-training codebase
☆3,807Updated this week
shadowkiller33 / Contrast-Instruction
View on GitHub
☆19Oct 2, 2023Updated 2 years ago
general-preference / general-preference-model
View on GitHub
[ICML 2025] Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment (https://arxiv.org/abs/2410.02197)
☆43Jun 15, 2026Updated last month
aypan17 / reward-misspecification
View on GitHub
☆10Mar 13, 2023Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
jwhj / OREO
View on GitHub
☆116Jan 21, 2025Updated last year
unicamp-dl / ExaRanker
View on GitHub
☆29Feb 2, 2024Updated 2 years ago
tengxiao1 / MR-Search
View on GitHub
Meta-Reinforcement Learning with Self-Reflection
☆33Mar 26, 2026Updated 3 months ago
TIGER-AI-Lab / MAmmoTH2
View on GitHub
Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]
☆146Oct 27, 2024Updated last year
ApolloResearch / apd
View on GitHub
Attribution-based Parameter Decomposition
☆35Jun 11, 2025Updated last year
haozheji / exact-optimization
View on GitHub
ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment
☆55Jun 16, 2024Updated 2 years ago
TsinghuaC3I / FS-GEN
View on GitHub
Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding.
☆13Nov 19, 2024Updated last year
fe1ixxu / CPO_SIMPO
View on GitHub
This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.
☆59Aug 13, 2024Updated last year
hamishivi / automated-instruction-selection
View on GitHub
Exploration of automated dataset selection approaches at large scales.
☆55Mar 4, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
reissbaker / clevergpt
View on GitHub
Training GPTs to solve interaction nets
☆18Aug 14, 2024Updated last year
SALT-NLP / demonstrated-feedback
View on GitHub
☆131Oct 1, 2024Updated last year
OpenNLPLab / LASP
View on GitHub
Linear Attention Sequence Parallelism (LASP)
☆87Jun 4, 2024Updated 2 years ago
allenai / reward-bench
View on GitHub
RewardBench: the first evaluation tool for reward models.
☆727Feb 16, 2026Updated 5 months ago
allenai / numglue
View on GitHub
NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks
☆20May 10, 2022Updated 4 years ago
frankxwang / dpo-prefix-sharing
View on GitHub
DPO, but faster 🚀
☆52Dec 6, 2024Updated last year
EleutherAI / bergson
View on GitHub
Mapping out the "memory" of neural nets with data attribution
☆70Updated this week
hengyuan-hu / jax-vs-pytorch
View on GitHub
☆13Feb 25, 2025Updated last year
Infini-AI-Lab / M2PO
View on GitHub
☆34Oct 8, 2025Updated 9 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
noharm-ai / brateca
View on GitHub
Brazilian Tertiary Care Dataset
☆18Dec 14, 2022Updated 3 years ago
esteng / regal_program_learning
View on GitHub
☆27Sep 11, 2024Updated last year
modal-labs / stopwatch
View on GitHub
A tool for benchmarking LLMs on Modal
☆56Aug 29, 2025Updated 10 months ago
UKPLab / on-emergence
View on GitHub
Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning
☆33Jan 9, 2025Updated last year
HKUNLP / STRING
View on GitHub
[ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"
☆82Nov 25, 2024Updated last year
McGill-NLP / VinePPO
View on GitHub
Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"
☆192May 25, 2025Updated last year
AbanteAI / LoCoDiff-bench
View on GitHub
☆33Oct 15, 2025Updated 9 months ago
ZhaolinGao / A-PO
View on GitHub
Accelerating RL for LLM Reasoning with Optimal Advantage Regression
☆41May 30, 2025Updated last year
juzhengz / logit-fusion
View on GitHub
Learning from Mixed Rollouts: Logit Fusion as a Bridge Between Imitation and Exploration
☆17Feb 24, 2026Updated 5 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
princeton-nlp / USACO
View on GitHub
Can Language Models Solve Olympiad Programming?
☆124Jan 14, 2025Updated last year
XueruiSu / Trust-Region-Preference-Approximation
View on GitHub
Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning
☆15Jun 28, 2025Updated last year
TsinghuaC3I / Intuitive-Fine-Tuning
View on GitHub
[ACL 2025, Main Conference, Oral] Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process
☆30Aug 2, 2024Updated last year
chujiezheng / LLM-Extrapolation
View on GitHub
Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"
☆75May 20, 2025Updated last year
milesaturpin / cot-unfaithfulness
View on GitHub
☆57Oct 23, 2023Updated 2 years ago
IBM / SALMON
View on GitHub
Self-Alignment with Principle-Following Reward Models
☆170Sep 18, 2025Updated 10 months ago
sheryc / resonance_rope
View on GitHub
[ACL 24 Findings] Implementation of Resonance RoPE and the PosGen synthetic dataset.
☆24Mar 5, 2024Updated 2 years ago