Anni-Zou / Meta-CoT
Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models
☆87Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Meta-CoT
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆48Updated 5 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆49Updated 9 months ago
- ☆112Updated last month
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆124Updated 3 weeks ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆61Updated 4 months ago
- Codebase accompanying the Summary of a Haystack paper.☆72Updated 2 months ago
- ☆116Updated 5 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆41Updated 9 months ago
- Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"☆92Updated 10 months ago
- ☆63Updated 3 weeks ago
- ☆42Updated 4 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆75Updated last month
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆91Updated last year
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆76Updated 9 months ago
- ☆69Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆71Updated 9 months ago
- A set of utilities for running few-shot prompting experiments on large-language models☆113Updated last year
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆74Updated 10 months ago
- Benchmark baseline for retrieval qa applications☆95Updated 7 months ago
- Repository for paper Tools Are Instrumental for Language Agents in Complex Environments☆33Updated last month
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆73Updated 3 months ago
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆107Updated last year
- [ACL 2024] AUTOACT: Automatic Agent Learning from Scratch for QA via Self-Planning☆179Updated last month
- ☆171Updated 7 months ago
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆56Updated 8 months ago
- ☆48Updated last year
- Data preparation code for CrystalCoder 7B LLM☆42Updated 6 months ago
- Retrieval Augmented Generation Generalized Evaluation Dataset☆51Updated this week
- Evaluating LLMs with CommonGen-Lite☆85Updated 8 months ago
- This is the code repo for our paper "Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents".☆101Updated last month