thunlp / ProactiveAgent
A LLM-based Agent that predict its tasks proactively.
☆16Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for ProactiveAgent
- [NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents☆46Updated 2 weeks ago
- Towards Large Multimodal Models as Visual Foundation Agents☆122Updated last week
- ☆16Updated 4 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆75Updated last month
- [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks☆25Updated last month
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆68Updated 5 months ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆99Updated 3 weeks ago
- Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs☆42Updated 4 months ago
- MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆17Updated 2 weeks ago
- ☆89Updated 3 months ago
- MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models☆18Updated last month
- ☆63Updated last month
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆39Updated last month
- A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents☆31Updated this week
- ☆36Updated last month
- The official implementation of Self-Exploring Language Models (SELM)☆55Updated 5 months ago
- ☆116Updated 5 months ago
- [ACL 2024] Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models☆14Updated 4 months ago
- Official repository for Decentralized Arena via Collective LLM Intelligence☆8Updated last month
- This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"☆39Updated 4 months ago
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆47Updated last month
- The Official Code Repository for GUI-World.☆41Updated 3 months ago
- ☆46Updated 2 weeks ago
- Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073☆26Updated 4 months ago
- 🎮Manipulates mobile phones just like how you would. Official code for "MobA: A Two-Level Agent System for Efficient Mobile Task Automati…☆13Updated 2 weeks ago
- ☆30Updated 2 weeks ago
- [ICML 2024 Oral] A framework for society simulation that supports complex simulation, for example: multi-scene.☆52Updated 3 months ago
- This is the official repository of the paper "OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI"☆86Updated last month
- [ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios☆46Updated 7 months ago
- Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environme…☆12Updated last week