zjunlp / WorfBench
[ICLR 2025] Benchmarking Agentic Workflow Generation
☆50Updated last week
Alternatives and similar repositories for WorfBench:
Users that are interested in WorfBench are comparing it to the libraries listed below
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆114Updated 2 months ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆121Updated 4 months ago
- ☆99Updated 2 months ago
- [ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios☆53Updated 11 months ago
- [preprint] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆42Updated 2 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆96Updated 4 months ago
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆54Updated 4 months ago
- official implementation of paper "Process Reward Model with Q-value Rankings"☆49Updated 3 weeks ago
- ☆35Updated 2 months ago
- This the implementation of LeCo☆32Updated last month
- ☆45Updated 4 months ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆125Updated 3 months ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆59Updated this week
- Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"☆52Updated 3 months ago
- Code for NeurIPS 2024 paper "AutoManual: Constructing Instruction Manuals by LLM Agents via Interactive Environmental Learning"☆35Updated 3 months ago
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆44Updated 3 months ago
- The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agen…☆24Updated 11 months ago
- ☆41Updated 4 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- ☆54Updated 5 months ago
- Reformatted Alignment☆114Updated 5 months ago
- ☆98Updated last month
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆84Updated 4 months ago
- ☆42Updated 2 months ago
- Code implementation of synthetic continued pretraining☆91Updated last month
- ☆13Updated last year
- [NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents☆75Updated 2 weeks ago
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆50Updated 9 months ago