yansikuan/memory-r1

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yansikuan/memory-r1)

yansikuan / memory-r1

☆109

Alternatives and similar repositories for memory-r1

Users that are interested in memory-r1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wangyu-ustc / Mem-alpha
View on GitHub
The official implementation of the paper "Mem-α: Learning Memory Construction via Reinforcement Learning"
☆212Dec 25, 2025Updated 6 months ago
FFY0 / DefensiveKV
View on GitHub
Official Implementation for [ICLR26] DefensiveKV: Taming the Fragility of KV Cache Eviction in LLM Inference
☆51Mar 28, 2026Updated 3 months ago
66RING / CritiPrefill
View on GitHub
Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".
☆17Sep 15, 2024Updated last year
NovaSky-AI / SkyRL-OpenHands
View on GitHub
☆37Nov 26, 2025Updated 7 months ago
huangyuxiang03 / Locret
View on GitHub
☆14Oct 3, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
gracefulning / TIDPO
View on GitHub
TOKEN-IMPORTANCE GUIDED DIRECT PREFERENCE OPTIMIZATION
☆37Jan 26, 2026Updated 5 months ago
TemporaryLoRA / Block-Attention
View on GitHub
☆48Mar 15, 2025Updated last year
GSYfate / knnlm-limits
View on GitHub
Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"
☆24Apr 30, 2025Updated last year
namespace-Pt / UltraGist
View on GitHub
☆18Dec 2, 2024Updated last year
kashish-s / TruthSocial_2024ElectionInitiative
View on GitHub
This repository contains data of TruthSocial posts related to the 2024 U.S. Elections
☆12Nov 1, 2024Updated last year
TIGER-AI-Lab / Hierarchical-Reasoner
View on GitHub
Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning [ICLR26]
☆64Apr 11, 2026Updated 2 months ago
Sharut / CARE
View on GitHub
☆18Jun 28, 2023Updated 3 years ago
gmftbyGMFTBY / Rep-Dropout
View on GitHub
[NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective
☆41Oct 17, 2023Updated 2 years ago
UCSB-NLP-Chang / KVLink
View on GitHub
☆47Oct 16, 2025Updated 8 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
mozhu621 / LongGenBench
View on GitHub
☆37Oct 4, 2025Updated 9 months ago
RUCBM / ICLEval
View on GitHub
☆14Jun 24, 2024Updated 2 years ago
Pbihao / SLM
View on GitHub
☆29Apr 7, 2024Updated 2 years ago
zhangyitonggg / dllm4code
View on GitHub
Offical implementation of our paper "Exploring the Potential of Diffusion Large Language Models in Code Generation".
☆23Oct 29, 2025Updated 8 months ago
luckyerr / Voice-Transformer_Speaker-Verification
View on GitHub
Incorporating the memory mechanism into the transformer and employing a parallel weighting structure to obtain a better utterance-level r…
☆22Oct 4, 2025Updated 9 months ago
MIT-MI / MEM1
View on GitHub
☆324Jan 3, 2026Updated 6 months ago
runchu-tian / LongPiBench
View on GitHub
The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"
☆14Dec 16, 2024Updated last year
pangjh3 / AnLLM
View on GitHub
☆20Jun 17, 2024Updated 2 years ago
domaineval / DomainEval
View on GitHub
DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …
☆13Dec 12, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
GATECH-EIC / LaCache
View on GitHub
[ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models
☆17Nov 4, 2025Updated 8 months ago
bwnzheng / TagFex_CVPR2025
View on GitHub
The code repository for "Task-Agnostic Guided Feature Expansion for Class-Incremental Learning" (CVPR25)
☆28Dec 31, 2025Updated 6 months ago
JZCS2018 / SMAT
View on GitHub
Model and datasets for schema matching
☆15Jul 17, 2021Updated 4 years ago
zju-jiyicheng / SpecVLM
View on GitHub
[EMNLP 2025 Main] SpecVLM: Enhancing Speculative Decoding of Video LLMs via Verifier-Guided Token Pruning
☆48Apr 16, 2026Updated 2 months ago
RLHFlow / Reinforce-Ada
View on GitHub
An adaptive sampling framework for Reinforce-style LLM post training.
☆96Nov 29, 2025Updated 7 months ago
xuyang-liu16 / V2Drop
View on GitHub
[CVPR 2026] Variation-aware Vision Token Dropping for Faster Large Vision-Language Models
☆32May 27, 2026Updated last month
eecshope / HITS
View on GitHub
☆13Sep 12, 2024Updated last year
YihongDong / CDD-TED4LLMs
View on GitHub
☆16Nov 26, 2024Updated last year
youngmihuang / awesome-casual-inference
View on GitHub
☆23Apr 26, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
nishadsinghi / sc-genrm-scaling
View on GitHub
[COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…
☆15Oct 31, 2025Updated 8 months ago
machuangtao / KG-RAG4SM
View on GitHub
Knowledge Graph-based Retrieval-Augmented Generation for Schema Matching
☆17Jun 17, 2026Updated 2 weeks ago
Interplay-LM-Reasoning / Interplay-LM-Reasoning
View on GitHub
[ICML 2026 Spotlight] On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models
☆160Jun 8, 2026Updated last month
aradha / agop_feature_learning
View on GitHub
☆21Feb 19, 2024Updated 2 years ago
megagonlabs / holobench
View on GitHub
🫧 Code for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data (Maekawa*, Iso* et al.…
☆12Feb 25, 2025Updated last year
facebookresearch / MemoryMosaics
View on GitHub
Memory Mosaics are networks of associative memories working in concert to achieve a prediction task.
☆63Jan 30, 2025Updated last year
sttich / dl-recommendation
View on GitHub
code for "Deep Learning for Sequential Recommendation: Algorithms, Influential Factors, and Evaluations"
☆12Sep 7, 2020Updated 5 years ago