zexuanqiu/CLongEval

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zexuanqiu/CLongEval)

zexuanqiu / CLongEval

CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models

☆49

Alternatives and similar repositories for CLongEval

Users that are interested in CLongEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RUCAIBox / BAMBOO
View on GitHub
☆36Mar 25, 2024Updated 2 years ago
nick7nlp / Counting-Stars
View on GitHub
Counting-Stars (★)
☆83Nov 24, 2025Updated 8 months ago
ThisIsHwang / EXIT
View on GitHub
Official code and resources for the paper "EXIT: Context-Aware Extractive Compression for Enhancing Retrieval-Augmented Generation."
☆25Jul 15, 2026Updated 2 weeks ago
open-compass / Ada-LEval
View on GitHub
The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"
☆56May 22, 2025Updated last year
bigai-nlco / CREAM
View on GitHub
[NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding
☆22Oct 10, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
google-deepmind / loft
View on GitHub
LOFT: A 1 Million+ Token Long-Context Benchmark
☆237Apr 13, 2026Updated 3 months ago
THUDM / LongBench
View on GitHub
LongBench v2 and LongBench (ACL 25'&24')
☆1,215Jan 15, 2025Updated last year
OpenLMLab / LEval
View on GitHub
[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark
☆406Jul 9, 2024Updated 2 years ago
sunjie279 / SimCT-
View on GitHub
☆21May 22, 2026Updated 2 months ago
NormXU / Consistent-DynamicNTKRoPE
View on GitHub
An Experiment on Dynamic NTK Scaling RoPE
☆65Nov 26, 2023Updated 2 years ago
nju-websoft / KnowLA
View on GitHub
KnowLA: Enhancing Parameter-efficient Finetuning with Knowledgeable Adaptation, NAACL 2024
☆16Jul 29, 2024Updated 2 years ago
jshuadvd / LongRoPE
View on GitHub
Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper
☆154Jul 20, 2024Updated 2 years ago
THUDM / LongAlign
View on GitHub
[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs
☆262Dec 16, 2024Updated last year
TIGER-AI-Lab / LongICLBench
View on GitHub
Code and Data for "Long-context LLMs Struggle with Long In-context Learning" [TMLR2025]
☆113Feb 20, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
infinigence / LVEval
View on GitHub
Repository of LV-Eval Benchmark
☆78Aug 31, 2024Updated last year
Lyun0912-wu / LongAttn
View on GitHub
LongAttn ：Selecting Long-context Training Data via Token-level Attention
☆15Jul 16, 2025Updated last year
OpenBMB / InfiniteBench
View on GitHub
Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718
☆387Sep 25, 2024Updated last year
LCM-Lab / LOOM-Eval
View on GitHub
A comprehensive and efficient long-context model evaluation framework
☆31Feb 25, 2026Updated 5 months ago
Glaciohound / LM-Infinite
View on GitHub
Implementation of NAACL 2024 Outstanding Paper "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
☆153Mar 13, 2025Updated last year
MayDomine / Burst-Attention
View on GitHub
Distributed IO-aware Attention algorithm
☆24Sep 24, 2025Updated 10 months ago
cnlinxi / LLM-paper-daily
View on GitHub
Automatically Update LLM Papers Daily using Github Actions. Ref: https://github.com/Vincentqyw/cv-arxiv-daily
☆10Updated this week
MozerWang / Loong
View on GitHub
[EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA
☆155Dec 22, 2025Updated 7 months ago
princeton-pli / PruLong
View on GitHub
Code for the preprint "Cache Me If You Can: How Many KVs Do You Need for Effective Long-Context LMs?"
☆48Jul 29, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
purbeshmitra / MOTIF
View on GitHub
MOTIF: Modular Thinking via Reinforcement Fine-tuning in LLMs
☆17Jul 6, 2025Updated last year
feiyang-k / AutoScale
View on GitHub
Official Code Repository for [AutoScale📈: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*…
☆14Aug 8, 2025Updated 11 months ago
Hambaobao / Marathon
View on GitHub
Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.
☆10May 16, 2024Updated 2 years ago
bigai-nlco / LooGLE
View on GitHub
ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models
☆199Oct 8, 2024Updated last year
zhaochenyang20 / Prompt2Model-Self-Guide
View on GitHub
SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper
☆34May 29, 2024Updated 2 years ago
chtmp223 / suri
View on GitHub
Suri: Multi-constraint instruction following for long-form text generation [EMNLP’24]
☆27Oct 3, 2025Updated 9 months ago
seanzhang-zhichen / Qwen-WisdomVast
View on GitHub
Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …
☆17Apr 12, 2024Updated 2 years ago
yifeiwang77 / Self-Correction
View on GitHub
☆20Nov 3, 2024Updated last year
seq-to-mind / planning_dial_summ
View on GitHub
One implementation of the paper "Controllable Neural Dialogue Summarization with Personal Named Entity Planning" (EMNLP 2022).
☆18Nov 9, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jiacheng-ye / kg_gater
View on GitHub
[EMNLP 2021] Code for our EMNLP 2021 paper “Heterogeneous Graph Neural Networks for Keyphrase Generation”
☆14Nov 13, 2021Updated 4 years ago
tongzeliang / EvoPrompt
View on GitHub
☆13Feb 17, 2025Updated last year
Wangpeiyi9979 / HCL-Text2AMR
View on GitHub
Code for ACL22 short Paper "Hierarchical Curriculum Learning for AMR Parsing"
☆13Jun 1, 2022Updated 4 years ago
GATECH-EIC / LaCache
View on GitHub
[ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models
☆16Nov 4, 2025Updated 8 months ago
dwzhu-pku / PoSE
View on GitHub
Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)
☆208May 20, 2024Updated 2 years ago
MySong7NLPer / AI-Conference-Acceptance-Rate
View on GitHub
☆11Aug 8, 2022Updated 3 years ago
leezythu / FocusLLM
View on GitHub
FocusLLM: Scaling LLM’s Context by Parallel Decoding
☆45Dec 8, 2024Updated last year