YuxiangChai / A3
☆16Updated last month
Alternatives and similar repositories for A3:
Users that are interested in A3 are comparing it to the libraries listed below
- ☆35Updated last month
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆86Updated 5 months ago
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆41Updated last month
- (ICLR 2025) The Official Code Repository for GUI-World.☆53Updated 3 months ago
- Official Repository of Are Your LLMs Capable of Stable Reasoning?☆22Updated 2 weeks ago
- ☆55Updated last month
- ☆44Updated 3 months ago
- [preprint] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆43Updated 3 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆50Updated 3 months ago
- HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models☆39Updated 4 months ago
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆50Updated this week
- ☆44Updated 3 weeks ago
- The official repo for "VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search"☆20Updated last week
- ☆59Updated 3 months ago
- ☆34Updated 3 months ago
- ☆56Updated 6 months ago
- ☆13Updated 3 months ago
- ☆84Updated last month
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆59Updated last month
- ☆16Updated 5 months ago
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆75Updated 2 months ago
- MPO: Boosting LLM Agents with Meta Plan Optimization☆43Updated 3 weeks ago
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆31Updated last month
- ☆43Updated last month
- ☆51Updated last month
- [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks☆70Updated 2 weeks ago
- ☆103Updated 2 months ago
- ☆74Updated 5 months ago
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆76Updated 3 weeks ago
- rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking☆38Updated 2 months ago