Implementation of Evo-Memory style learning for LLM agents. Agents learn from outcomes, refine strategies, and get smarter with every task. 🚀 Features: Experience-driven memory architecture Semantic search + context synthesis Self-improving agents
☆46Dec 3, 2025Updated 4 months ago
Alternatives and similar repositories for Evo-Memory
Users that are interested in Evo-Memory are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The source code of Mem-Gallery: Benchmarking Multimodal Long-Term Conversational Memory for MLLM Agents.☆43Jan 31, 2026Updated 2 months ago
- REverse-Engineered Reasoning for Open-Ended Generation☆94Sep 10, 2025Updated 7 months ago
- Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?☆38Jun 23, 2025Updated 9 months ago
- Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.☆24Sep 4, 2024Updated last year
- SimKO: Simple Pass@K Policy Optimization☆28Oct 24, 2025Updated 5 months ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code for MemoryBench☆57Dec 23, 2025Updated 3 months ago
- Confidence Regulation Neurons in Language Models (NeurIPS 2024)☆15Feb 1, 2025Updated last year
- Custom launcher for Claude Code, supporting dynamic prompts, layered configuration and easy custom hooks and MCPs.☆16Updated this week
- Using deep research workflow to generate datasets for finetuning LLMs.☆39Oct 9, 2025Updated 6 months ago
- Linux programming environment course in Chinese☆12Nov 19, 2017Updated 8 years ago
- ☆15Nov 28, 2023Updated 2 years ago
- ☆68Apr 7, 2026Updated last week
- Document your Talon scripts using Sphinx.☆16Updated this week
- 强化学习课程,主要是如何用强化学习解决问题☆15Dec 10, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [EuroSys'25] Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization☆22Apr 8, 2026Updated last week
- Agent Memory Playground: AI Agent Memory Design & Optimization Techniques☆35Aug 7, 2025Updated 8 months ago
- RapidIn: Scalable Influence Estimation for Large Language Models (LLMs). The implementation for paper "Token-wise Influential Training Da…☆21Mar 10, 2026Updated last month
- Self-improving AI agents using Agentic Context Engineering - A starter implementation with Google ADK☆21Oct 23, 2025Updated 5 months ago
- A general framework used on evaluating the performance of large language models (LLMs) based on the peer review mechanism among LLMs☆19Aug 3, 2024Updated last year
- [ICLR 2025] Understanding and Enhancing Safety Mechanisms of LLMs via Safety-Specific Neuron☆30Apr 30, 2025Updated 11 months ago
- [ICLR 2025] Official implementation of paper "Dynamic Low-Rank Sparse Adaptation for Large Language Models".☆24Mar 16, 2025Updated last year
- Curated systems, benchmarks, and papers etc. on memory for LLMs/MLLMs --- long-term context, retrieval, and reasoning.☆341Updated this week
- MultiArchKernelBench: A Multi-Platform Benchmark for Kernel Generation☆52Mar 25, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A Survey of Multimodal Retrieval-Augmented Generation☆20Nov 3, 2025Updated 5 months ago
- A tool for managing compliance as code in your GitHub repositories.☆28Apr 7, 2026Updated last week
- AGiXT is a dynamic AI Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse A…☆24Jan 26, 2026Updated 2 months ago
- ☆21Oct 1, 2024Updated last year
- ☆42Feb 12, 2026Updated 2 months ago
- HaluMem is the first operation level hallucination evaluation benchmark tailored to agent memory systems.☆123Jan 8, 2026Updated 3 months ago
- This repo contains code and data for ICLR 2025 paper MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs☆38Mar 9, 2025Updated last year
- Implementation of "ACL'24: When Do LLMs Need Retrieval Augmentation? Mitigating LLMs’ Overconfidence Helps Retrieval Augmentation"☆24Jul 19, 2024Updated last year
- Some LaTeX Tips for Writing Research Papers☆10May 30, 2016Updated 9 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [ICLR 2025 Spotlight] Official PyTorch Implementation of "BodyGen: Advancing Towards Efficient Embodiment Co-Design"☆52Oct 21, 2025Updated 5 months ago
- Dr. MAS is an end-to-end RL training framework for multi-agent LLM systems, supporting the co-training of multiple (heterogeneous) LLMs.☆121Apr 1, 2026Updated 2 weeks ago
- [NeurIPS 2025 D&B Track] MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research☆26Sep 23, 2025Updated 6 months ago
- ☆27Jan 8, 2024Updated 2 years ago
- The official repo for our paper: LegalAgentBench: Evaluating LLM Agents in Legal Domainl☆43Dec 30, 2024Updated last year
- ☆20Apr 6, 2025Updated last year
- ☆47Mar 15, 2025Updated last year