algopapi / RetroformAgent
Langchain Agent finetuning using 7B - LLAMA 2 , on hotpotQA (Retroformer framework)
☆15Updated last year
Alternatives and similar repositories for RetroformAgent:
Users that are interested in RetroformAgent are comparing it to the libraries listed below
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆98Updated last year
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization☆132Updated 9 months ago
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆213Updated 2 months ago
- ☆101Updated 3 months ago
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆51Updated 9 months ago
- Codes for the EMNLP 2023 Findings paper "Self-Polish: Enhance Reasoning in Large Language Models via Problem Refining" by Zhiheng Xi, Sen…☆30Updated last year
- FireAct: Toward Language Agent Fine-tuning☆271Updated last year
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆127Updated 4 months ago
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆76Updated last week
- ☆176Updated last month
- This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'☆96Updated last month
- [ACL 2024] Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View☆112Updated 9 months ago
- ☆120Updated 9 months ago
- rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking☆37Updated 2 months ago
- ☆42Updated 2 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆58Updated 3 weeks ago
- Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"☆52Updated 3 months ago
- Awesome LLM papers, news and projects about learning to reason with LLM, OpenAI o1, reasonning techniques, chain-of-thought (COT), Large …☆24Updated 5 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- ☆54Updated 6 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆99Updated 4 months ago
- Gentopia Agent Zoo and Agent Benchmark☆30Updated last year
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆84Updated 4 months ago
- Syntax Error-Free and Generalizable Tool Use for LLMs via Finite-State Decoding☆27Updated last year
- Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…☆92Updated last year
- Implementation of the Quiet-STAR paper (https://arxiv.org/pdf/2403.09629.pdf)☆53Updated 7 months ago
- An implemtation of Everyting of Thoughts (XoT).☆140Updated last year
- 🌟 SwarmAgent: A framework for simulating social group dynamics using multi-agent collaboration, aiding insights into collective behavior…☆11Updated last year
- ☆101Updated last month
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆174Updated 5 months ago