LCM-Lab / LOOM-EvalLinks
A comprehensive and efficient long-context model evaluation framework
☆27Updated last week
Alternatives and similar repositories for LOOM-Eval
Users that are interested in LOOM-Eval are comparing it to the libraries listed below
Sorting:
- Mixture-of-Basis-Experts for Compressing MoE-based LLMs☆21Updated 2 months ago
- [ICML'25] Official code of paper "Fast Large Language Model Collaborative Decoding via Speculation"☆28Updated 4 months ago
- [NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆24Updated last year
- ☆45Updated last month
- ☆19Updated 10 months ago
- ☆14Updated last year
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆43Updated 8 months ago
- ☆17Updated 8 months ago
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆23Updated last month
- ☆30Updated 2 months ago
- Code and Model for NeurIPS 2024 Spotlight Paper "Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training…☆43Updated last year
- Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization☆79Updated last month
- ☆85Updated last week
- FocusLLM: Scaling LLM’s Context by Parallel Decoding☆43Updated 11 months ago
- (ACL 2025 oral) SCOPE: Optimizing KV Cache Compression in Long-context Generation☆33Updated 5 months ago
- ☆19Updated 11 months ago
- ☆20Updated last week
- FastCuRL: Curriculum Reinforcement Learning with Stage-wise Context Scaling for Efficient LLM Reasoning☆53Updated last month
- The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆15Updated 2 months ago
- [ACL 2025] An inference-time decoding strategy with adaptive foresight sampling☆106Updated 6 months ago
- [NeurIPS 2025] Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"☆24Updated last month
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆40Updated last month
- ☆61Updated 4 months ago
- The official implementation for [NeurIPS2025 Oral] Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink…☆105Updated 2 months ago
- ☆103Updated 2 months ago
- A comprehensive benchmark for evaluating deep research agents on academic survey tasks☆33Updated 2 months ago
- ☆14Updated last year
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆58Updated 8 months ago
- ☆45Updated last month
- ☆29Updated 5 months ago