younghuman / LLMAgent
☆28Updated last year
Related projects ⓘ
Alternatives and complementary repositories for LLMAgent
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆87Updated last year
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆92Updated last year
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆28Updated 8 months ago
- ☆22Updated 2 months ago
- A re-implementation of Meta-Prompt in LangChain for building self-improving agents.☆60Updated last year
- ☆14Updated last month
- Open Implementations of LLM Analyses☆94Updated last month
- A set of utilities for running few-shot prompting experiments on large-language models☆113Updated last year
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆82Updated 2 months ago
- ☆48Updated last year
- Based on the tree of thoughts paper☆45Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆71Updated 9 months ago
- ☆51Updated 3 months ago
- About The corresponding code from our paper " REFINER: Reasoning Feedback on Intermediate Representations" (EACL 2024). Do not hesitate t…☆66Updated 9 months ago
- SCREWS: A Modular Framework for Reasoning with Revisions☆26Updated last year
- ☆33Updated last year
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆49Updated 8 months ago
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆76Updated 9 months ago
- Evaluating LLMs with CommonGen-Lite☆85Updated 8 months ago
- ☆112Updated last month
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆48Updated 5 months ago
- ☆103Updated 3 months ago
- KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents☆173Updated last month
- ☆35Updated last year
- Beating the GAIA benchmark with Transformers Agents. 🚀☆62Updated 3 weeks ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- Evaluating tool-augmented LLMs in conversation settings☆72Updated 5 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆41Updated 9 months ago
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆155Updated 6 months ago
- ☆116Updated 5 months ago