Dominic789654/LongGenBench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Dominic789654/LongGenBench)

Dominic789654 / LongGenBench

Source code for the paper "LongGenBench: Long-context Generation Benchmark"

☆24

Alternatives and similar repositories for LongGenBench

Users that are interested in LongGenBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Linking-ai / SCOPE
View on GitHub
(ACL2025 oral) SCOPE: Optimizing KV Cache Compression in Long-context Generation
☆36May 28, 2025Updated last year
mozhu621 / LongGenBench
View on GitHub
☆37Oct 4, 2025Updated 9 months ago
mutonix / pyramidinfer
View on GitHub
☆47Nov 25, 2024Updated last year
OnlyAR / RAL-Writer
View on GitHub
The code and data for the paper "Lost-in-the-Middle in Long-Text Generation: Synthetic Dataset, Evaluation Framework, and Mitigation"
☆14Oct 8, 2025Updated 9 months ago
L1aoXingyu / llm-infer-bench
View on GitHub
☆12Sep 1, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
HPC4AI / MeAtten
View on GitHub
The repository maintains the source code for the article titled "Optimizing Attention by Exploiting Data Reuse on ARM Multi-core CPUs."
☆17Dec 1, 2024Updated last year
zjunlp / LookAheadTuning
View on GitHub
[WSDM 2026] LookAhead Tuning: Safer Language Models via Partial Answer Previews
☆17Dec 14, 2025Updated 7 months ago
FFY0 / AdaKV
View on GitHub
The Official Implementation of Ada-KV [NeurIPS 2025]
☆139Nov 26, 2025Updated 7 months ago
zyxxmu / cam
View on GitHub
Pytorch implementation of our paper accepted by ICML 2024 -- CaM: Cache Merging for Memory-efficient LLMs Inference
☆50Jun 19, 2024Updated 2 years ago
avalonstrel / Mitigating-the-Alignment-Tax-of-RLHF
View on GitHub
☆16Feb 8, 2024Updated 2 years ago
pprp / ACBench
View on GitHub
[ICML25] Agentic Compression Benchmark (ACBench)
☆17Jul 2, 2025Updated last year
BAI-LAB / BaiJia
View on GitHub
[WWW 2026] BaiJia: An Open Role-Playing Platform of Chinese Historical Characters
☆28Jan 14, 2026Updated 6 months ago
IsaacRe / vllm-kvcompress
View on GitHub
KV cache compression for high-throughput LLM inference
☆158Feb 5, 2025Updated last year
letsgoLakers / NCIFD
View on GitHub
面向大模型的民族文化数据集
☆13May 26, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mit-han-lab / Quest
View on GitHub
[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference
☆400Jul 10, 2025Updated last year
OpenBMB / InfiniteBench
View on GitHub
Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718
☆387Sep 25, 2024Updated last year
FasterDecoding / SnapKV
View on GitHub
☆324Jul 10, 2025Updated last year
RUCAIBox / EASYEP
View on GitHub
☆28Apr 14, 2025Updated last year
sandeepkumar-skb / pytorch_custom_op
View on GitHub
End to End steps for adding custom ops in PyTorch.
☆24Aug 20, 2020Updated 5 years ago
pkunlp-icler / ChildTuning
View on GitHub
☆33Sep 29, 2021Updated 4 years ago
wwbrannon / congrat
View on GitHub
Code for the paper "ConGraT: Self-Supervised Contrastive Pretraining for Joint Graph and Text Embeddings" -- TextGraphs-17 @ ACL 2024
☆26Nov 4, 2025Updated 8 months ago
Francesco215 / text-diffusion
View on GitHub
Generates text with diffusion models. Reproduction of the Continous Diffusion for Categorical Data paper by Deepmind
☆18Dec 9, 2024Updated last year
sail-sg / SimLayerKV
View on GitHub
The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.
☆54Oct 18, 2024Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Owen718 / LongPrompt-LLamaGen
View on GitHub
This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…
☆30Oct 21, 2024Updated last year
Zanette-Labs / SpeculativeRejection
View on GitHub
[NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection
☆56Oct 29, 2024Updated last year
C0-Design / MemoryFormer
View on GitHub
An implementation is provided here for the NeurIPS2024 paper "MemoryFormer : Minimize Transformer Computation by Removing Fully-Connected…
☆16Mar 24, 2026Updated 3 months ago
yblir / vllm-learn
View on GitHub
☆17Feb 24, 2026Updated 4 months ago
Azure / MS-AMP-Examples
View on GitHub
Examples for MS-AMP package.
☆30Jul 17, 2025Updated last year
ThisIsHwang / EXIT
View on GitHub
Official code and resources for the paper "EXIT: Context-Aware Extractive Compression for Enhancing Retrieval-Augmented Generation."
☆25Jul 15, 2026Updated last week
prometheus-eval / scaling-evaluation-compute
View on GitHub
Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"
☆12Mar 25, 2025Updated last year
IST-DASLab / HALO
View on GitHub
HALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. 🚀 The official implementation of https://arx…
☆31Feb 17, 2025Updated last year
yeonwoo378 / flow-divergence-sampler
View on GitHub
☆16Jun 19, 2026Updated last month
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
KAIST-Visual-AI-Group / Token-Warping-MLLM
View on GitHub
☆22Mar 31, 2026Updated 3 months ago
kietngt00 / UFC
View on GitHub
[NeurIPS 2025] Universal Few-Shot Spatial Control for Diffusion Models
☆21Sep 18, 2025Updated 10 months ago
dgist-datalab / deepsketch-fast2022
View on GitHub
☆24May 6, 2022Updated 4 years ago
SUSTechBruce / LOOK-M
View on GitHub
[EMNLP 2024 Findings🔥] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context In…
☆103Nov 9, 2024Updated last year
hatsu3 / curator
View on GitHub
☆13Jan 17, 2024Updated 2 years ago
LLaVA-VL / llava-vl.github.io
View on GitHub
☆13Mar 9, 2024Updated 2 years ago
M1n9X / GraphRAG_Lite
View on GitHub
☆16Jul 12, 2024Updated 2 years ago