algopapi / RetroformAgent
Langchain Agent finetuning using 7B - LLAMA 2 , on hotpotQA (Retroformer framework)
☆14Updated last year
Related projects ⓘ
Alternatives and complementary repositories for RetroformAgent
- [ACL 2024] AUTOACT: Automatic Agent Learning from Scratch for QA via Self-Planning☆177Updated last month
- [ACL 2024] Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View☆98Updated 5 months ago
- Codes for the EMNLP 2023 Findings paper "Self-Polish: Enhance Reasoning in Large Language Models via Problem Refining" by Zhiheng Xi, Sen…☆27Updated last year
- ☆116Updated 5 months ago
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization☆109Updated 5 months ago
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆47Updated 5 months ago
- Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…☆72Updated 9 months ago
- A curated paper list on LLM reasoning.☆67Updated 8 months ago
- Official implementation of paper "Cumulative Reasoning With Large Language Models" (https://arxiv.org/abs/2308.04371)☆286Updated last month
- FireAct: Toward Language Agent Fine-tuning☆254Updated last year
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆75Updated 3 weeks ago
- This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'☆82Updated last month
- [COLM'24] Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration☆20Updated 3 weeks ago
- SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks☆277Updated 3 weeks ago
- Code for the paper <SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning>☆44Updated last year
- [ICLR 2024] MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use☆69Updated 7 months ago
- ☆135Updated 6 months ago
- ☆48Updated 8 months ago
- ☆86Updated 3 months ago
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆143Updated 8 months ago
- Source code and demo for memory bank and SiliconFriend☆190Updated last year
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆79Updated this week
- An Analytical Evaluation Board of Multi-turn LLM Agents☆245Updated 5 months ago
- Gentopia Agent Zoo and Agent Benchmark☆28Updated last year
- AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback☆92Updated last year
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆48Updated 8 months ago
- ☆102Updated 2 months ago
- This is the official repo for "PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization". PromptAgen…☆199Updated 3 months ago
- Reformatted Alignment☆112Updated last month
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆96Updated last week