Timothyxxx / EnvInteractiveLMPapers
Paper collections of methods that using language to interact with environment, including interact with real world, simulated world or WWW(π).
β127Updated last year
Alternatives and similar repositories for EnvInteractiveLMPapers
Users that are interested in EnvInteractiveLMPapers are comparing it to the libraries listed below
Sorting:
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)β139Updated 6 months ago
- Feeling confused about super alignment? Here is a reading listβ42Updated last year
- β54Updated last year
- β96Updated last year
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"β54Updated last year
- π€ConvReπ€―: An Investigation of LLMsβ Inefficacy in Understanding Converse Relations (EMNLP 2023)β23Updated last year
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questionsβ110Updated 8 months ago
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)β57Updated 7 months ago
- [ACL 2024] The project of Symbol-LLMβ54Updated 10 months ago
- Data and code for the ICLR 2023 paper "Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning".β153Updated last year
- Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Zihaβ¦β123Updated 11 months ago
- Code for the paper <SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning>β49Updated last year
- Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.β130Updated last year
- This is the repo for our paper "Mr-Ben: A Comprehensive Meta-Reasoning Benchmark for Large Language Models"β49Updated 6 months ago
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"β56Updated 11 months ago
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant seβ¦β60Updated 2 years ago
- β41Updated last year
- β59Updated 8 months ago
- [ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"β109Updated last year
- Evaluating the Ripple Effects of Knowledge Editing in Language Modelsβ55Updated last year
- Reasoning with Language Model is Planning with World Modelβ165Updated last year
- β61Updated 2 years ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"β74Updated 11 months ago
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planningβ36Updated last year
- β74Updated 11 months ago
- [ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Followingβ125Updated 10 months ago
- [ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models (LLMs + MCTS + Self-Improvement)β49Updated last year
- paper list on reasoning in NLPβ189Updated last month
- β25Updated last year
- Collection of papers for scalable automated alignment.β89Updated 6 months ago