Bui1dMySea/MemLong

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Bui1dMySea/MemLong)

Bui1dMySea / MemLong

☆96

Alternatives and similar repositories for MemLong

Users that are interested in MemLong are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LCM-Lab / LOGO
View on GitHub
Code for paper: Long cOntext aliGnment via efficient preference Optimization
☆24Oct 10, 2025Updated 7 months ago
Zoeyyao27 / SirLLM
View on GitHub
This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM
☆60May 28, 2024Updated last year
Jikai0Wang / OPT-Tree
View on GitHub
☆30May 24, 2025Updated 11 months ago
snu-mllab / Context-Memory
View on GitHub
Pytorch implementation for "Compressed Context Memory For Online Language Model Interaction" (ICLR'24)
☆63Apr 18, 2024Updated 2 years ago
Aloriosa / srmt
View on GitHub
The original Shared Recurrent Memory Transformer implementation
☆36Jul 11, 2025Updated 10 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
OpenNLG / OpenBA-v2
View on GitHub
OpenBA-V2: 3B LLM (Large Language Model) with T5 architecture, utilizing model pruning technique and continuing pretraining from OpenBA-1…
☆25May 10, 2024Updated 2 years ago
dwzq-com-cn / DongwuLLM
View on GitHub
This is the codebase for pre-training, compressing, extending, and distilling LLMs with Megatron-LM.
☆12Mar 11, 2024Updated 2 years ago
ZetangForward / CMD-Context-aware-Model-self-Detoxification
View on GitHub
CMD: a framework for Context-aware Model self-Detoxification (EMNLP2024 Long Paper)
☆17Feb 10, 2025Updated last year
rayleizhu / vllm-ra
View on GitHub
[ACL 2024] RelayAttention for Efficient Large Language Model Serving with Long System Prompts
☆39Feb 29, 2024Updated 2 years ago
thu-ml / ReMoE
View on GitHub
[ICLR2025] Codebase for "ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing", built on Megatron-LM.
☆114Dec 20, 2024Updated last year
Marker-Inc-Korea / AutoRAG_ARAGOG_Paper
View on GitHub
☆22Jul 18, 2024Updated last year
pzs19 / LEMMA
View on GitHub
☆16Sep 4, 2025Updated 8 months ago
Leey21 / CipherBank
View on GitHub
☆12Jun 13, 2025Updated 11 months ago
technion-cs-nlp / hallucination-mitigation
View on GitHub
☆23Dec 17, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
LCM-Lab / Bridge_Gap_Diffusion
View on GitHub
Diffusion Model Improvement Method
☆35Sep 4, 2023Updated 2 years ago
EvanZhuang / AgenticLU
View on GitHub
Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).
☆13Sep 22, 2025Updated 8 months ago
LCM-Lab / LOOM-Eval
View on GitHub
A comprehensive and efficient long-context model evaluation framework
☆31Feb 25, 2026Updated 2 months ago
AlexCuadron / ThinkingAgent
View on GitHub
Systematic evaluation framework that automatically rates overthinking behavior in large language models.
☆101May 16, 2025Updated last year
spcl / MRAG
View on GitHub
Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"
☆242Feb 26, 2026Updated 2 months ago
jeffreysijuntan / lloco
View on GitHub
The official repo for "LLoCo: Learning Long Contexts Offline"
☆117Jun 15, 2024Updated last year
yale-nlp / refdpo
View on GitHub
☆16Jul 23, 2024Updated last year
DRSY / KV_Compression
View on GitHub
[EMNLP 2023]Context Compression for Auto-regressive Transformers with Sentinel Tokens
☆25Nov 6, 2023Updated 2 years ago
microsoft / FILM
View on GitHub
Official repo for "Make Your LLM Fully Utilize the Context"
☆272May 15, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
shoaibahmed / llm_depth_pruning
View on GitHub
Official implementation of the paper: "A deeper look at depth pruning of LLMs"
☆15Jul 24, 2024Updated last year
SWE-Gym / SWE-Bench-Fork
View on GitHub
☆13Mar 5, 2025Updated last year
whyNLP / LCKV
View on GitHub
Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance…
☆156Apr 7, 2025Updated last year
qhjqhj00 / MemoRAG
View on GitHub
Empowering RAG with a memory-based data interface for all-purpose applications!
☆2,240Sep 11, 2025Updated 8 months ago
DerrickYLJ / TidalDecode
View on GitHub
[ICLR 2025] TidalDecode: A Fast and Accurate LLM Decoding with Position Persistent Sparse Attention
☆53Aug 6, 2025Updated 9 months ago
arnab-api / romba
View on GitHub
Applies ROME and MEMIT on Mamba-S4 models
☆15Apr 5, 2024Updated 2 years ago
WindyLee0822 / CTG
View on GitHub
Source code of “Reinforcement Learning with Token-level Feedback for Controllable Text Generation (NAACL 2024)
☆17Dec 8, 2024Updated last year
thu-coai / SPaR
View on GitHub
☆47Jun 11, 2025Updated 11 months ago
caskcsg / longcontext
View on GitHub
Long Context Research
☆32Jan 26, 2026Updated 3 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
fate-ubw / RAGLAB
View on GitHub
[EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation
☆309Oct 18, 2024Updated last year
DeepAuto-AI / sglang
View on GitHub
This is a fork of SGLang for hip-attention integration. Please refer to hip-attention for detail.
☆18Mar 31, 2026Updated last month
TIGER-AI-Lab / LongRAG
View on GitHub
Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".
☆248Aug 25, 2024Updated last year
193746 / VHASR
View on GitHub
☆11Oct 31, 2024Updated last year
ChenyuHeidiZhang / VL-commonsense
View on GitHub
☆14May 23, 2022Updated 3 years ago
MozerWang / Loong
View on GitHub
[EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA
☆152Dec 22, 2025Updated 5 months ago
wdlctc / headinfer
View on GitHub
☆63May 16, 2025Updated last year