JingbiaoMei/ATM-Bench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JingbiaoMei/ATM-Bench)

JingbiaoMei / ATM-Bench

ATM-Bench: A benchmark for long-term personalized memory QA spanning ~4 years of multimodal data (images, videos, emails). Features referential queries, evidence-grounded answering, and multi-source reasoning. Paper: "According to Me: Long-Term Personalized Referential Memory QA"

☆57

Alternatives and similar repositories for ATM-Bench

Users that are interested in ATM-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

marco-garosi / CIRCLE
View on GitHub
[CVPR Findings 2026] Large Multimodal Models as General In-Context Classifiers
☆24Mar 1, 2026Updated 4 months ago
jeevanjoseph03 / Linux_Guide
View on GitHub
This guide is beginner-friendly, project-driven, and laser-focused on the commands & concepts you will actually use while working with Do…
☆16Dec 20, 2025Updated 7 months ago
xie-lab-ml / piecewise-sparse-attention
View on GitHub
Piecewise Sparse Attention Is Wiser for Efficient Diffusion Transformers
☆32Jul 1, 2026Updated 2 weeks ago
phymhan / S2D2
View on GitHub
☆16Jun 17, 2026Updated last month
ByteDance-Seed / TaskMem
View on GitHub
☆26Jun 2, 2026Updated last month
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Feesuu / MemoryTree
View on GitHub
An unofficial implementation of MemTree: From Isolated Conversations to Hierarchical Schemas: Dynamic Tree Memory Representation for LLMs
☆18Jul 14, 2025Updated last year
Trae1ounG / Pretrain_Space_RLVR
View on GitHub
[arxiv: 2604.14142] From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space
☆17Apr 16, 2026Updated 3 months ago
limenlp / ExeVRM
View on GitHub
Official implementation for the paper "Video-Based Reward Modeling for Computer-Use Agents"
☆16Mar 14, 2026Updated 4 months ago
InternLM / EndoCoT
View on GitHub
[ECCV 2026] An official implementation of "EndoCoT". Scaling endogenous Chain-of-Thought (CoT) reasoning in diffusion models for complex …
☆43Jun 26, 2026Updated 3 weeks ago
laitifranz / MemCoach
View on GitHub
[CVPR'26 Highlight] MemCoach: Steering-based MLLM for Actionable Image Memorability Feedback
☆42Jul 6, 2026Updated 2 weeks ago
MasterVito / DAC-RL
View on GitHub
Official Repo for DAC-RL: Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability
☆16Feb 26, 2026Updated 4 months ago
applese233 / ICRL
View on GitHub
In-Context Reinforcement Learning for Tool Use in Large Language Models
☆48Mar 26, 2026Updated 3 months ago
MYMY-young / DelimScaling
View on GitHub
[ICLR 2026] Official implementation of "Enhancing Multi-Image Understanding Through Delimiter Token Scaling"
☆15Jul 10, 2026Updated last week
ShareLab-SII / CaTok
View on GitHub
[CVPR-26] Official repository of "CaTok: Taming Mean Flows for One-Dimensional Causal Image Tokenization"
☆19Mar 9, 2026Updated 4 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
ExplainableML / finer
View on GitHub
[CVPR 2026 Oral] FINER: MLLMs Hallucinate under Fine-grained Negative Queries
☆17Jul 6, 2026Updated 2 weeks ago
Zengwh02 / GlimpRouter
View on GitHub
GlimpRouter: Efficient Collaborative Inference by Glimpsing One Token of Thoughts
☆16Apr 24, 2026Updated 2 months ago
JianhuiWei7 / UniVBench
View on GitHub
[CVPR 2026]The official code and datasets for "UniVBench: Towards Unified Evaluation for Video Foundation Models"
☆23May 27, 2026Updated last month
ModalityDance / MRM
View on GitHub
[SIGIR 2026] "One Adapts to Any: Meta Reward Modeling for Personalized LLM Alignment"
☆15Apr 21, 2026Updated 3 months ago
sinahmr / LocAtViT
View on GitHub
PyTorch Implementation of LocAtViT in "Locality-Attending Vision Transformer" (ICLR 2026)
☆18Mar 10, 2026Updated 4 months ago
AweAI-Team / BeyondSWE
View on GitHub
☆47Updated this week
klauscc / DAM
View on GitHub
Official code for DAM: Dynamic Adapter Merging for Continual Video QA Learning
☆15Apr 25, 2024Updated 2 years ago
LuckyyySTA / GOLF
View on GitHub
☆18Mar 16, 2026Updated 4 months ago
pickxiguapi / Uni-RLHF-Platform
View on GitHub
Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…
☆42Nov 20, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ZBox1005 / CoT-UQ
View on GitHub
[ACL 2025] "CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought"
☆17Apr 3, 2025Updated last year
zjunlp / predict-before-execute
View on GitHub
Can We Predict Before Executing Machine Learning Agents?
☆19Jul 7, 2026Updated 2 weeks ago
Jolieresearch / ICPF
View on GitHub
☆14Nov 26, 2025Updated 7 months ago
viiika / Prism
View on GitHub
[ICML 2026] Official Implementation of Prism: Efficient Test-Time Scaling via Hierarchical Search and Self-Verification for Discrete Diff…
☆21Mar 4, 2026Updated 4 months ago
townsendmerino / ken
View on GitHub
Fast hybrid code search for agents. Pure Go, drop-in MCP-compatible with semble.
☆25Updated this week
OpenGVLab / Docopilot
View on GitHub
[CVPR 2025] Docopilot: Improving Multimodal Models for Document-Level Understanding
☆37Jul 22, 2025Updated last year
luoxue-star / 4DEquine
View on GitHub
4DEquine: Disentangling Motion and Appearance for 4D Equine Reconstruction from Monocular Video (CVPR2026)
☆18Apr 12, 2026Updated 3 months ago
GreatX3 / ProAct
View on GitHub
ProAct is a framework designed to enable Large Language Model (LLM) agents to perform accurate, multi-turn lookahead reasoning in interac…
☆18Feb 11, 2026Updated 5 months ago
LCM-Lab / Elastic-Attention
View on GitHub
Elastic Attention: Test-time Adaptive Sparsity Ratios for Efficient Transformers
☆23May 26, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hainuo-wang / WiT
View on GitHub
Official project page and code repository for WiT, a pixel space diffusion
☆17May 31, 2026Updated last month
SKYLENAGE-AI / DeepVision-103K
View on GitHub
Codebase for DeepVision-103K
☆22Feb 21, 2026Updated 5 months ago
ventr1c / memma
View on GitHub
The official repository of "MemMA: Coordinating the Memory Cycle through Multi-Agent Reasoning and In-Situ Self-Evolution".
☆19Mar 20, 2026Updated 4 months ago
thaoshibe / relsim
View on GitHub
🍑 relsim: Relational Visual Similarity | pip install relsim 🌍 (CVPR 2026)
☆85Apr 8, 2026Updated 3 months ago
horizon-llm / OpenKimi
View on GitHub
[ICML2026] Reproduce Kimi K1.5/K2 RL algorithm and rollout system
☆19Apr 9, 2026Updated 3 months ago
MiniAppBench / miniappbench
View on GitHub
Official repository for MiniAppBench. Contains the complete pipeline and codebase for LLM-powered interactive HTML generation and agentic…
☆23Mar 9, 2026Updated 4 months ago
cvsp-lab / AgilePruner
View on GitHub
[ICLR 2026] AgilePruner: An Empirical Study of Attention and Diversity for Adaptive Visual Token Pruning in Large Vision-Language Models
☆28Mar 3, 2026Updated 4 months ago