☆101Sep 10, 2025Updated 7 months ago
Alternatives and similar repositories for memory-r1
Users that are interested in memory-r1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official implementation of the paper "Mem-α: Learning Memory Construction via Reinforcement Learning"☆200Dec 25, 2025Updated 4 months ago
- ☆23Jan 16, 2025Updated last year
- Official Implementation for [ICLR26] DefensiveKV: Taming the Fragility of KV Cache Eviction in LLM Inference☆43Mar 28, 2026Updated last month
- Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".☆17Sep 15, 2024Updated last year
- ☆35Nov 26, 2025Updated 5 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆12Apr 29, 2024Updated 2 years ago
- TOKEN-IMPORTANCE GUIDED DIRECT PREFERENCE OPTIMIZATION☆35Jan 26, 2026Updated 3 months ago
- ☆10Jul 19, 2021Updated 4 years ago
- An official PyTorch implementation of "Certifiably Robust Graph Contrastive Learning" (NeurIPS 2023)☆11Jan 22, 2024Updated 2 years ago
- ☆47Mar 15, 2025Updated last year
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆24Apr 30, 2025Updated last year
- An iOS App simulating Chinese Brush. Mainly using UIGraphics lib. The width of strokes will be based on the speed you draw the strokes.☆13Feb 17, 2014Updated 12 years ago
- ☆18Dec 2, 2024Updated last year
- [NeurIPS 2025 D&B (Spotlight🌟)] TIME: A Multi-level Benchmark for Temporal Reasoning of LLMs in Real-World Scenario☆31Oct 5, 2025Updated 7 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- This repository contains data of TruthSocial posts related to the 2024 U.S. Elections☆12Nov 1, 2024Updated last year
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning [ICLR26]☆64Apr 11, 2026Updated 3 weeks ago
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆41Oct 17, 2023Updated 2 years ago
- ☆42Oct 16, 2025Updated 6 months ago
- ☆34Oct 4, 2025Updated 7 months ago
- ☆14Jun 24, 2024Updated last year
- ☆29Apr 7, 2024Updated 2 years ago
- Offical implementation of our paper "Exploring the Potential of Diffusion Large Language Models in Code Generation".☆21Oct 29, 2025Updated 6 months ago
- ☆311Jan 3, 2026Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The Official Implementation of Ada-KV [NeurIPS 2025]☆132Nov 26, 2025Updated 5 months ago
- The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"☆14Dec 16, 2024Updated last year
- ☆20Jun 17, 2024Updated last year
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆13Dec 12, 2024Updated last year
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆18Nov 4, 2025Updated 6 months ago
- The code repository for "Task-Agnostic Guided Feature Expansion for Class-Incremental Learning" (CVPR25)☆27Dec 31, 2025Updated 4 months ago
- [EMNLP 2025 Main] SpecVLM: Enhancing Speculative Decoding of Video LLMs via Verifier-Guided Token Pruning☆41Apr 16, 2026Updated 3 weeks ago
- An adaptive sampling framework for Reinforce-style LLM post training.☆95Nov 29, 2025Updated 5 months ago
- Code for the preprint "Cache Me If You Can: How Many KVs Do You Need for Effective Long-Context LMs?"☆48Jul 29, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [CVPR 2026] Variation-aware Vision Token Dropping for Faster Large Vision-Language Models☆31Mar 18, 2026Updated last month
- ☆12Sep 12, 2024Updated last year
- Gaussian Embedding of Large-scale Attributed Graphs☆10Mar 13, 2020Updated 6 years ago
- ☆16Nov 26, 2024Updated last year
- Code for verifying deep neural feature ansatz☆22May 3, 2023Updated 3 years ago
- Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination.☆22Jul 18, 2025Updated 9 months ago
- MARNNs Can Learn Generalized Dyck Languages☆12Nov 11, 2019Updated 6 years ago