zjunlp / KnowSelfLinks
[ACL 2025] Agentic Knowledgeable Self-awareness
☆71Updated last week
Alternatives and similar repositories for KnowSelf
Users that are interested in KnowSelf are comparing it to the libraries listed below
Sorting:
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆93Updated last week
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆108Updated 8 months ago
- ☆45Updated last month
- ☆47Updated last week
- ☆24Updated 9 months ago
- Efficient Agent Training for Computer Use☆104Updated 2 weeks ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆56Updated 2 months ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆38Updated 3 months ago
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆78Updated last month
- Process Reward Models That Think☆41Updated 3 weeks ago
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆25Updated 3 months ago
- The official repo for the code and data of paper SMART☆26Updated 4 months ago
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆57Updated last month
- ☆86Updated last month
- Repo for "Z1: Efficient Test-time Scaling with Code"☆60Updated 2 months ago
- ☆32Updated last month
- ☆48Updated 3 months ago
- ☆38Updated 6 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆98Updated last month
- RL Scaling and Test-Time Scaling (ICML'25)☆105Updated 5 months ago
- ☆71Updated 9 months ago
- Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory☆62Updated 3 weeks ago
- "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"☆77Updated 2 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆115Updated 3 months ago
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents☆82Updated this week
- Verifiers for LLM Reinforcement Learning☆60Updated 2 months ago
- ☆56Updated 6 months ago
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆93Updated 2 weeks ago
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆28Updated 6 months ago
- ☆65Updated 2 months ago