wangyu-ustc/MemoryLLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wangyu-ustc/MemoryLLM)

wangyu-ustc / MemoryLLM

The official implementation of the ICML 2024 paper "MemoryLLM: Towards Self-Updatable Large Language Models" and "M+: Extending MemoryLLM with Scalable Long-Term Memory"

☆317

Alternatives and similar repositories for MemoryLLM

Users that are interested in MemoryLLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wangyu-ustc / LVChat
View on GitHub
The official implementation of the paper **LVChat: Facilitating Long Video Comprehension**
☆14Apr 15, 2024Updated 2 years ago
wangyu-ustc / LargeScaleWashing
View on GitHub
The official implementation of the paper "Large Scale Knowledge Washing"
☆10Jun 12, 2024Updated 2 years ago
XinshuangL / SELF-PARAM
View on GitHub
The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"
☆15May 18, 2025Updated last year
wangyu-ustc / LM4CV
View on GitHub
The official implementation of the paper **Learning Concise and Descriptive Attributes for Visual Recognition**
☆49Dec 14, 2023Updated 2 years ago
bingreeky / MemGen
View on GitHub
MemGen: Weaving Generative Latent Memory for Self-Evolving Agents
☆406Jun 10, 2026Updated last month
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
convergence-ai / lm2
View on GitHub
Official repo of paper LM2
☆48Feb 13, 2025Updated last year
zhangyulin-space / ChatFerry
View on GitHub
☆104Oct 8, 2025Updated 9 months ago
facebookresearch / memory
View on GitHub
Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…
☆379Dec 12, 2024Updated last year
BytedTsinghua-SIA / MemAgent
View on GitHub
A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.
☆1,085May 12, 2026Updated 2 months ago
wangyu-ustc / Mem-alpha
View on GitHub
The official implementation of the paper "Mem-α: Learning Memory Construction via Reinforcement Learning"
☆218Dec 25, 2025Updated 6 months ago
princeton-nlp / ProLong
View on GitHub
Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"
☆260Sep 12, 2025Updated 10 months ago
MIT-MI / MEM1
View on GitHub
☆325Jan 3, 2026Updated 6 months ago
princeton-nlp / CEPE
View on GitHub
[ACL 2024] Long-Context Language Modeling with Parallel Encodings
☆169Jun 13, 2024Updated 2 years ago
FasterDecoding / SnapKV
View on GitHub
☆324Jul 10, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
sinberCS / switch2ai
View on GitHub
switch2ai - A JetBrains IDE plugin enabling seamless collaboration between JetBrains IDEs and various AI agents (Cursor, Qoder, Claude co…
☆173Nov 11, 2025Updated 8 months ago
aoda-zhang / PawHaven-FullStack-React-NodeJS
View on GitHub
🐱 PawHaven — an open-source platform that helps volunteers, shelters, and adopters report, track, and share stray animal rescue cases (f…
☆90Updated this week
YiCheng98 / IntegrativeDecoding
View on GitHub
Official Implementation for the paper "Integrative Decoding: Improving Factuality via Implicit Self-consistency"
☆33Apr 12, 2025Updated last year
Glaciohound / LM-Infinite
View on GitHub
Implementation of NAACL 2024 Outstanding Paper "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
☆152Mar 13, 2025Updated last year
FranxYao / Retrieval-Head-with-Flash-Attention
View on GitHub
Efficient retrieval head analysis with triton flash attention that supports topK probability
☆13Jun 15, 2024Updated 2 years ago
jiaweizzhao / InRank
View on GitHub
☆153Jan 2, 2024Updated 2 years ago
allenai / sso
View on GitHub
Repository for Skill Set Optimization
☆14Jul 26, 2024Updated last year
liufanfanlff / C3-Context-Cascade-Compression
View on GitHub
Official code implementation of Context Cascade Compression: Exploring the Upper Limits of Text Compression
☆313Jan 27, 2026Updated 5 months ago
OpenBMB / InfiniteBench
View on GitHub
Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718
☆387Sep 25, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jeon185 / LaViC
View on GitHub
Implementation of LaViC (KDD 2025)
☆13Jun 1, 2025Updated last year
menik1126 / UNComp
View on GitHub
[EMNLP 2025🔥] UNComp: Can Matrix Entropy Uncover Sparsity? -- A Compressor Design from an Uncertainty-Aware Perspective
☆20Jan 7, 2026Updated 6 months ago
princeton-pli / PruLong
View on GitHub
Code for the preprint "Cache Me If You Can: How Many KVs Do You Need for Effective Long-Context LMs?"
☆48Jul 29, 2025Updated 11 months ago
caixd-220529 / LifelongAgentBench
View on GitHub
Code repo for "LifelongAgentBench: Evaluating LLM Agents as Lifelong Learners"
☆93May 30, 2025Updated last year
HUST-AI-HYZ / MemoryAgentBench
View on GitHub
Open source code for ICLR 2026 Paper: Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions
☆404May 21, 2026Updated 2 months ago
Jinxhy / THEMIS
View on GitHub
[USENIX Security'25] THEMIS: Towards Practical Intellectual Property Protection for Post-Deployment On-Device Deep Learning Models
☆108Aug 13, 2025Updated 11 months ago
Josh00-Lu / Habi
View on GitHub
[ICML 2025 Poster] Official PyTorch Implementation of "Habitizing Diffusion Planning for Efficient and Effective Decision Making"
☆36May 26, 2025Updated last year
Tanglumy / Finance-Bro
View on GitHub
your finance bro Agent for trading and investing
☆111Nov 8, 2025Updated 8 months ago
gwh22 / UniVoice
View on GitHub
UniVoice: Unifying Autoregressive ASR and Flow-Matching based TTS with Large Language Models
☆115Oct 30, 2025Updated 8 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
thunlp / InfLLM
View on GitHub
The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Mem…
☆405Apr 20, 2024Updated 2 years ago
booydar / babilong
View on GitHub
BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.
☆250Jun 1, 2026Updated last month
MarkLee131 / PoC-Research-Papers
View on GitHub
Research papers on Proot-of-Concepts
☆114Feb 3, 2026Updated 5 months ago
PKU-ML / LongPPL
View on GitHub
Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"
☆115Oct 11, 2025Updated 9 months ago
Infini-AI-Lab / STEM
View on GitHub
☆66May 7, 2026Updated 2 months ago
TemporaryLoRA / Temp-LoRA
View on GitHub
☆130Mar 31, 2024Updated 2 years ago
bingreeky / GMemory
View on GitHub
☆259Apr 9, 2026Updated 3 months ago