Zoeyyao27 / SirLLMLinks

This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM

☆59

Alternatives and similar repositories for SirLLM

Users that are interested in SirLLM are comparing it to the libraries listed below

Sorting:

lunyiliu / CoachLM
Code and data for CoachLM, an automatic instruction revision approach LLM instruction tuning.
☆60Updated last year
hetailang / SqueezeAttention
☆38Updated 9 months ago
YutongWang1216 / DocMTAgent
Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
☆45Updated 5 months ago
18907305772 / FuseAI
FuseAI Project
☆87Updated 6 months ago
Geaming2002 / Ruler
Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models
☆38Updated 10 months ago
orionw / promptriever
The first dense retrieval model that can be prompted like an LM
☆81Updated 2 months ago
clinicalml / co-llm
Co-LLM: Learning to Decode Collaboratively with Multiple Language Models
☆116Updated last year
sanyalsunny111 / LLM-Inheritune
This is the official repository for Inheritune.
☆112Updated 5 months ago
jeffreysijuntan / lloco
The official repo for "LLoCo: Learning Long Contexts Offline"
☆118Updated last year
voidism / Lookback-Lens
Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"
☆130Updated 11 months ago
dinobby / MAgICoRE
☆24Updated 10 months ago
THU-KEG / Agentic-Reward-Modeling
[ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems
☆99Updated last month
Quehry / HelloBench
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
☆46Updated 8 months ago
sunnynexus / RetroLLM
RetroLLM: Empowering LLMs to Retrieve Fine-grained Evidence within Generation [ACL 2025]
☆115Updated 6 months ago
ZihanWang314 / coeCheck
☆19Updated 5 months ago
open-compass / Ada-LEval
The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"
☆54Updated 2 months ago
samchaineau / llm_slerp_generation
Repo hosting codes and materials related to speeding LLMs' inference using token merging.
☆36Updated 2 weeks ago
rohinmanvi / Capability-Aware-and-Mid-Generation-Self-Evaluations
☆21Updated last week
VITA-Group / WeLore
From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients. Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu,…
☆47Updated 3 months ago
StigLidu / DualDistill
The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"
☆86Updated 2 weeks ago
LLM360 / crystalcoder-data-prep
Data preparation code for CrystalCoder 7B LLM
☆45Updated last year
uclaml / COPS
The official implementation of Cross-Task Experience Sharing (COPS)
☆24Updated 9 months ago
efficientscaling / Z1
Repo for "Z1: Efficient Test-time Scaling with Code"
☆63Updated 3 months ago
QwenLM / Self-Lengthen
☆87Updated 9 months ago
SalesforceAIResearch / GemFilter
☆83Updated 6 months ago
rosewang2008 / backtracing
Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.
☆89Updated last year
royeisen / reasoning_loading_bar
☆47Updated 3 weeks ago
giangdip2410 / HyperRouter
Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"
☆33Updated last year
DRSY / EasyKV
Easy control for Key-Value Constrained Generative LLM Inference(https://arxiv.org/abs/2402.06262)
☆63Updated last year
wuhy68 / Parameter-Efficient-MoE
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks (EMNLP'24)
☆146Updated 10 months ago