OPPO-PersonalAI / Agent-KBLinks
Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving
☆42Updated this week
Alternatives and similar repositories for Agent-KB
Users that are interested in Agent-KB are comparing it to the libraries listed below
Sorting:
- Open-Source LLM Coders with Co-Evolving Reinforcement Learning☆93Updated last month
- ☆19Updated 4 months ago
- Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆40Updated 3 weeks ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆19Updated 3 weeks ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆15Updated 4 months ago
- ☆46Updated 2 months ago
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆38Updated 4 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆63Updated last month
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆46Updated 4 months ago
- Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"☆18Updated last month
- ☆22Updated 7 months ago
- ☆48Updated last month
- ☆47Updated 5 months ago
- RewardAnything: Generalizable Principle-Following Reward Models☆25Updated last month
- ☆16Updated 11 months ago
- ☆24Updated 9 months ago
- Open-Pandora: On-the-fly Control Video Generation☆34Updated 7 months ago
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆15Updated this week
- Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay☆88Updated last month
- Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes☆17Updated 3 weeks ago
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆26Updated 3 months ago
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆28Updated 7 months ago
- This is the official project of paper: Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conver…☆19Updated 7 months ago
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem”☆18Updated last month
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆20Updated 2 months ago
- This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"☆53Updated 11 months ago
- ☆22Updated last year
- ☆16Updated last week
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆14Updated 4 months ago
- Code for "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆15Updated 3 months ago