import-myself/Membench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/import-myself/Membench)

import-myself / Membench

Membenchmark repository

☆55

Alternatives and similar repositories for Membench

Users that are interested in Membench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nuster1128 / MemEngine
View on GitHub
A Comprehensive Library for Memory of LLM-based Agents.
☆113May 13, 2025Updated last year
nuster1128 / MemSim
View on GitHub
The official repository for "MemSim: A Bayesian Simulator for Evaluating Memory of LLM-based Personal Assistants".
☆17Oct 10, 2024Updated last year
THUIR / MemoryBench
View on GitHub
Code for MemoryBench: A Benchmark for Memory and Continual Learning in LLM Systems
☆84Jun 27, 2026Updated 3 weeks ago
rui9812 / CAM
View on GitHub
[NeurIPS 2025] CAM: A Constructivist View of Agentic Memory for LLM-Based Reading Comprehension
☆23Oct 8, 2025Updated 9 months ago
HUST-AI-HYZ / MemoryAgentBench
View on GitHub
Open source code for ICLR 2026 Paper: Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions
☆407May 21, 2026Updated 2 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
microsoft / SeCom
View on GitHub
SECOM: On Memory Construction and Retrieval for Personalized Conversational Agents, ICLR 2025
☆60Mar 1, 2025Updated last year
jiho283 / DialSim
View on GitHub
Official repository of DialSim
☆33Oct 31, 2025Updated 8 months ago
zjunlp / MemBase
View on GitHub
A Comprehensive Benchmarking Framework for Long-Term Conversational Memory Layers
☆42Jun 29, 2026Updated 3 weeks ago
mohammadtavakoli78 / BEAM
View on GitHub
[ICLR 2026] Beyond a Million Tokens: Benchmarking and Enhancing Long-Term Memory in LLMs
☆110Feb 2, 2026Updated 5 months ago
Chocay / QtDracula
View on GitHub
基于 PyDracula 移植的Qt 客户端 UI 框架
☆11May 10, 2022Updated 4 years ago
ZexueHe / MemoryArena
View on GitHub
☆43Jun 1, 2026Updated last month
Elvin-Yiming-Du / Memory-T1
View on GitHub
This respository is used for time reasoning task for mult-session dialogue system.
☆16Feb 7, 2026Updated 5 months ago
kixlab / CUPID
View on GitHub
[COLM 2025] CUPID: Evaluating Personalized and Contextualized Alignment of LLMs from Interactions
☆17Dec 16, 2025Updated 7 months ago
yanweiyue / Mem-T
View on GitHub
Mem-T: Densifying Rewards for Long-Horizon Memory Agents
☆39Mar 22, 2026Updated 4 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zhongwanjun / MemoryBank-SiliconFriend
View on GitHub
Source code and demo for memory bank and SiliconFriend
☆441May 24, 2023Updated 3 years ago
wangyu-ustc / Mem-alpha
View on GitHub
The official implementation of the paper "Mem-α: Learning Memory Construction via Reinforcement Learning"
☆218Dec 25, 2025Updated 6 months ago
zjunlp / LightMem
View on GitHub
[ICLR 2026] LightMem: Lightweight and Efficient Memory-Augmented Generation
☆1,030Jul 16, 2026Updated last week
MIT-MI / MEM1
View on GitHub
☆325Jan 3, 2026Updated 6 months ago
AvatarMemory / RealMemBench
View on GitHub
☆47Apr 7, 2026Updated 3 months ago
mlsys-io / Halo_demo
View on GitHub
A novel system that unifies LLM serving with query optimization to efficiently process batch agentic workflows.
☆15Jun 14, 2026Updated last month
WujiangXu / A-mem
View on GitHub
The code for NeurIPS 2025 paper "A-Mem: Agentic Memory for LLM Agents"
☆924Mar 5, 2026Updated 4 months ago
uservan / speculative_thinking
View on GitHub
☆34Oct 13, 2025Updated 9 months ago
ThisIsCosine / AlpsBench
View on GitHub
☆25Apr 3, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ShootingWong / RichRAG
View on GitHub
☆11Nov 23, 2024Updated last year
xjtuleeyf / Locomo-Plus
View on GitHub
☆28Feb 13, 2026Updated 5 months ago
AvatarMemory / CloneMemBench
View on GitHub
Benchmarking Long-Term Memory for AI Clones
☆29Apr 7, 2026Updated 3 months ago
zhzihao / WikiGenBench
View on GitHub
WIKIGENBENCH: Exploring Full-length Wikipedia Generation under Real-World Scenario (COLING 2025)
☆13Jan 5, 2025Updated last year
AI45Lab / ReflectionBench
View on GitHub
[ICML 2025] ReflectionBench: Evaluating Epistemic Agency in Large Language Models
☆21Jun 24, 2025Updated last year
justincui03 / or-bench
View on GitHub
[ICML 2025] Official repository for paper "OR-Bench: An Over-Refusal Benchmark for Large Language Models"
☆28Mar 4, 2025Updated last year
YingqiLiu1999 / DFedPGP
View on GitHub
☆14Jan 3, 2025Updated last year
flint-xf-fan / Federated-RLHF
View on GitHub
[AAMAS 2025] Privacy-preserving and Personalized RLHF, with convergence guarantees. The Code contains experiments for training multiple i…
☆16Apr 16, 2025Updated last year
nemori-ai / nemori
View on GitHub
A minimalist MVP demonstrating a simple yet profound insight: aligning AI memory with human episodic memory granularity. Shows how this s…
☆207Apr 16, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
caixd-220529 / LifelongAgentBench
View on GitHub
Code repo for "LifelongAgentBench: Evaluating LLM Agents as Lifelong Learners"
☆93May 30, 2025Updated last year
fuxiAIlab / BYOB
View on GitHub
Build Your Own Bundle-A Neural Combinatorial Optimization Method (BYOB)
☆13Apr 27, 2022Updated 4 years ago
sylvain-wei / TIME
View on GitHub
[NeurIPS 2025 D&B (Spotlight🌟)] TIME: A Multi-level Benchmark for Temporal Reasoning of LLMs in Real-World Scenario
☆32Oct 5, 2025Updated 9 months ago
RedSearchAgent / DeepTraceHub
View on GitHub
RedSearcher's framework for deep search agent trajectory synthesis, QA filtering, and model evaluation, supporting ReACT and DeepSeek-sty…
☆23Feb 26, 2026Updated 4 months ago
nuster1128 / LLM_Agent_Memory_Survey
View on GitHub
☆502Jul 28, 2025Updated 11 months ago
danny911kr / REALTALK
View on GitHub
Evaluate your agent memory on real-world dialogues, not LLM-simulated dialogues.
☆46Jul 3, 2025Updated last year
MemTensor / HaluMem
View on GitHub
HaluMem is the first operation level hallucination evaluation benchmark tailored to agent memory systems.
☆148Apr 30, 2026Updated 2 months ago