OpenBMB / EurusLinks

☆320

Alternatives and similar repositories for Eurus

Users that are interested in Eurus are comparing it to the libraries listed below

Sorting:

GAIR-NLP / ProX
[ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale
☆255Updated 3 weeks ago
QwenLM / AutoIF
☆298Updated last year
SqueezeAILab / LLM2LLM
[ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
☆186Updated last year
SuperGPQA / SuperGPQA
☆157Updated 3 months ago
eddycmu / demystify-long-cot
☆309Updated 2 months ago
sail-sg / oat-zero
A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.
☆245Updated 3 months ago
THUDM / LongAlign
[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs
☆253Updated 7 months ago
GAIR-NLP / ReAlign
Reformatted Alignment
☆113Updated 10 months ago
BrendanGraham14 / mcts-llm
☆129Updated last year
TIGER-AI-Lab / MAmmoTH
Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" [ICLR 2024]
☆376Updated 11 months ago
Re-Align / URIAL
☆311Updated last year
facebookresearch / sweet_rl
Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks
☆233Updated 3 months ago
OpenBMB / InfiniteBench
Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718
☆343Updated 10 months ago
GAIR-NLP / LIMR
☆205Updated 5 months ago
allenai / WildBench
Benchmarking LLMs with Challenging Tasks from Real Users
☆233Updated 9 months ago
LCLM-Horizon / A-Comprehensive-Survey-For-Long-Context-Language-Modeling
A Comprehensive Survey on Long Context Language Modeling
☆169Updated 3 weeks ago
anchen1011 / FireAct
FireAct: Toward Language Agent Fine-tuning
☆281Updated last year
FranxYao / Long-Context-Data-Engineering
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
☆467Updated last year
modelscope / Trinity-RFT
Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…
☆204Updated this week
TIGER-AI-Lab / CritiqueFineTuning
Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]
☆169Updated 3 weeks ago
ADaM-BJTU / O1-CODER
AN O1 REPLICATION FOR CODING
☆336Updated 7 months ago
microsoft / simulated-trial-and-error
☆122Updated last year
OpenBMB / UltraFeedback
A large-scale, fine-grained, diverse preference dataset (and models).
☆345Updated last year
SkyworkAI / skywork-o1-prm-inference
☆65Updated 8 months ago
jshuadvd / LongRoPE
Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper
☆146Updated last year
OFA-Sys / gsm8k-ScRel
Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models
☆268Updated 10 months ago
THUDM / ChatGLM-Math
☆83Updated last year
hkust-nlp / AgentBoard
An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]
☆335Updated last year
LLM360 / MegaMath
[COLM 2025] An Open Math Pre-trainng Dataset with 370B Tokens.
☆96Updated 4 months ago
bigai-nlco / LooGLE
ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models
☆184Updated 9 months ago