csitfun / LogiEvalLinks
a benchmark suite for testing logical reasoning abilities of prompt-based models
☆30Updated last year
Alternatives and similar repositories for LogiEval
Users that are interested in LogiEval are comparing it to the libraries listed below
Sorting:
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆53Updated 9 months ago
- [ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models (LLMs + MCTS + Self-Improvement)☆49Updated last year
- [NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback☆41Updated last year
- Evaluate the Quality of Critique☆35Updated last year
- ☆97Updated last year
- ☆86Updated 2 years ago
- [NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…☆26Updated last year
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆40Updated 2 years ago
- Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.☆63Updated 6 months ago
- Implementation of the paper: "Making Retrieval-Augmented Language Models Robust to Irrelevant Context"☆69Updated 10 months ago
- Code for reproducing the ACL'23 paper: Don't Generate, Discriminate: A Proposal for Grounding Language Models to Real-World Environments☆73Updated 3 weeks ago
- ☆48Updated last year
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning☆36Updated last year
- [ACL'24] Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate Controllable Controversial Statements☆23Updated 8 months ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆38Updated last year
- Supporting code for ReCEval paper☆28Updated 8 months ago
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆60Updated 2 years ago
- [EMNLP'24 (Main)] DRPO(Dynamic Rewarding with Prompt Optimization) is a tuning-free approach for self-alignment. DRPO leverages a search-…☆24Updated 6 months ago
- Generating diverse counterfactual data for Natural Language Understanding tasks using Large Language Models (LLMs). The generator support…☆36Updated last year
- This is the repository for paper "CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning of Large Language Models"☆24Updated last year
- Paper list of "The Life Cycle of Knowledge in Big Language Models: A Survey"☆59Updated last year
- Repo for outstanding paper@ACL 2023 "Do PLMs Know and Understand Ontological Knowledge?"☆31Updated last year
- Complexity Based Prompting for Multi-Step Reasoning☆17Updated 2 years ago
- ☆36Updated last year
- [ACL 2024] The project of Symbol-LLM☆54Updated 10 months ago
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆55Updated last year
- ☆175Updated 10 months ago
- The code of Paper "Logic-Driven Context Extension and Data Augmentation for Logical Reasoning of Text".☆44Updated 2 years ago
- Resources for the Enigmata Project.☆32Updated this week
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study☆43Updated 2 years ago