ZhangYiqun018 / AvengersProLinks
☆169Updated last month
Alternatives and similar repositories for AvengersPro
Users that are interested in AvengersPro are comparing it to the libraries listed below
Sorting:
- Data Synthesis for Deep Research Based on Semi-Structured Data☆165Updated 2 weeks ago
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆169Updated this week
- CursorCore: Assist Programming through Aligning Anything☆131Updated 7 months ago
- LIMI: Less is More for Agency☆112Updated 2 weeks ago
- Code for the paper "Coding Agents with Multimodal Browsing are Generalist Problem Solvers"☆85Updated 3 weeks ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆100Updated last month
- [ACL 2025] Agentic Knowledgeable Self-awareness☆82Updated 3 months ago
- Pivotal Token Search☆126Updated 2 months ago
- SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?☆181Updated this week
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆106Updated 3 months ago
- ☆60Updated 10 months ago
- ☆293Updated 4 months ago
- ☆57Updated 7 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆262Updated last month
- Official code repository for Sketch-of-Thought (SoT)☆128Updated 4 months ago
- Qwen3Guard is a multilingual guardrail model series developed by the Qwen team at Alibaba Cloud.☆232Updated last week
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆84Updated 4 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆93Updated 5 months ago
- An Automatic Prompt Optimization Framework for Large Language Models☆122Updated 2 months ago
- ☆170Updated 7 months ago
- II-Thought-RL is our initial attempt at developing a large-scale, multi-domain Reinforcement Learning (RL) dataset☆28Updated 6 months ago
- This repository contains popular code generation frameworks such as MapCoder, CodeSIM.☆61Updated 3 months ago
- accompanying material for sleep-time compute paper☆115Updated 5 months ago
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆186Updated 2 weeks ago
- An open platform for enhancing the capability of LLMs in workflow orchestration.☆172Updated 6 months ago
- ☆232Updated 3 months ago
- ☆58Updated 4 months ago
- All-in-One Sandbox for AI Agents that combines Browser, Shell, File, MCP and VSCode Server in a single Docker container.☆462Updated last week
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆89Updated 4 months ago
- ☆70Updated this week