☆107Sep 10, 2025Updated 9 months ago
Alternatives and similar repositories for memory-r1
Users that are interested in memory-r1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆24Jan 16, 2025Updated last year
- MV-RAG combines retrieval with multi-view generation to create accurate 3D-consistent visuals. By retrieving reference images and text, i…☆23Nov 29, 2025Updated 6 months ago
- Official Implementation for [ICLR26] DefensiveKV: Taming the Fragility of KV Cache Eviction in LLM Inference☆48Mar 28, 2026Updated 2 months ago
- Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".☆17Sep 15, 2024Updated last year
- ☆37Nov 26, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆12Apr 29, 2024Updated 2 years ago
- ☆14Oct 3, 2024Updated last year
- ☆48Mar 15, 2025Updated last year
- An iOS App simulating Chinese Brush. Mainly using UIGraphics lib. The width of strokes will be based on the speed you draw the strokes.☆13Feb 17, 2014Updated 12 years ago
- ☆18Dec 2, 2024Updated last year
- [NeurIPS 2025 D&B (Spotlight🌟)] TIME: A Multi-level Benchmark for Temporal Reasoning of LLMs in Real-World Scenario☆32Oct 5, 2025Updated 8 months ago
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆41Oct 17, 2023Updated 2 years ago
- ☆47Oct 16, 2025Updated 8 months ago
- ☆36Oct 4, 2025Updated 8 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ⚓️ Interactive playground for the "Thought Anchors: Which LLM Reasoning Steps Matter?" paper.☆18Dec 20, 2025Updated 5 months ago
- ☆14Jun 24, 2024Updated last year
- ☆29Apr 7, 2024Updated 2 years ago
- Incorporating the memory mechanism into the transformer and employing a parallel weighting structure to obtain a better utterance-level r…☆22Oct 4, 2025Updated 8 months ago
- ☆320Jan 3, 2026Updated 5 months ago
- The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"☆14Dec 16, 2024Updated last year
- ☆20Jun 17, 2024Updated 2 years ago
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆13Dec 12, 2024Updated last year
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆18Nov 4, 2025Updated 7 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- This repository contains our codebase for the method CABINET that tackles the task of Table Question Answering and achieves state-of-the-…☆13Jul 16, 2024Updated last year
- [EMNLP 2025 Main] SpecVLM: Enhancing Speculative Decoding of Video LLMs via Verifier-Guided Token Pruning☆46Apr 16, 2026Updated 2 months ago
- Image to text recognition for ISBN numbers from books.☆16Dec 8, 2022Updated 3 years ago
- Code for the preprint "Cache Me If You Can: How Many KVs Do You Need for Effective Long-Context LMs?"☆48Jul 29, 2025Updated 10 months ago
- [CVPR 2026] Variation-aware Vision Token Dropping for Faster Large Vision-Language Models☆30May 27, 2026Updated 3 weeks ago
- ☆16Nov 26, 2024Updated last year
- [NeurIPS 2025] Reasoning MLLM, Share-GRPO, advantage vanishing, sparse reward☆37Sep 19, 2025Updated 9 months ago
- [ICML 2026 Spotlight] On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models☆156Jun 8, 2026Updated last week
- ☆20Feb 19, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 🫧 Code for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data (Maekawa*, Iso* et al.…☆12Feb 25, 2025Updated last year
- Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination.☆22Jul 18, 2025Updated 11 months ago
- Memory Mosaics are networks of associative memories working in concert to achieve a prediction task.☆62Jan 30, 2025Updated last year
- code for "Deep Learning for Sequential Recommendation: Algorithms, Influential Factors, and Evaluations"☆12Sep 7, 2020Updated 5 years ago
- Ongoing research project for code&math LLMs☆31Jul 4, 2025Updated 11 months ago
- ☆320Jul 10, 2025Updated 11 months ago
- The evaluation framework for training-free sparse attention in LLMs☆123Jan 27, 2026Updated 4 months ago