shizhediao / automate-cot
Source code for the paper "Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data"
☆20Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for automate-cot
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆36Updated 8 months ago
- [NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback☆37Updated 7 months ago
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆47Updated 5 months ago
- ☆40Updated 11 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆48Updated 8 months ago
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆56Updated 8 months ago
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆77Updated last year
- We have released the code and demo program required for LLM with self-verification☆47Updated last year
- InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆51Updated 3 weeks ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆84Updated 4 months ago
- Lightweight tool to identify Data Contamination in LLMs evaluation☆40Updated 8 months ago
- Contrastive Chain-of-Thought Prompting☆53Updated 11 months ago
- This is the repository for paper "CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning of Large Language Models"☆22Updated last year
- ☆53Updated 2 months ago
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆50Updated 2 months ago
- Implementation of the paper: "Making Retrieval-Augmented Language Models Robust to Irrelevant Context"☆61Updated 3 months ago
- Towards Systematic Measurement for Long Text Quality☆28Updated 2 months ago
- ☆17Updated 8 months ago
- ☆43Updated last year
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆56Updated 2 weeks ago
- ☆63Updated 2 years ago
- The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmar…☆35Updated this week
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆45Updated 4 months ago
- Evaluate the Quality of Critique☆35Updated 5 months ago
- [ICLR24] The open-source repo of THU-KEG's KoLA benchmark.☆50Updated last year
- Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.☆61Updated 3 months ago
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆72Updated 3 months ago
- ☆25Updated last month
- Repo for "On Learning to Summarize with Large Language Models as References"☆42Updated last year
- The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agen…☆21Updated 7 months ago