Code for the paper <SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning>
☆48Aug 1, 2023Updated 2 years ago
Alternatives and similar repositories for SelfCheck
Users that are interested in SelfCheck are comparing it to the libraries listed below
Sorting:
- ☆26May 30, 2023Updated 2 years ago
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆270Sep 12, 2024Updated last year
- ☆14Oct 11, 2023Updated 2 years ago
- The Lean Theorem Proving Environment☆14May 7, 2023Updated 2 years ago
- ☆30Dec 27, 2024Updated last year
- The is the official implementation of "Lyra: Orchestrating Dual Correction in Automated Theorem Proving"☆15Jul 2, 2024Updated last year
- The baseline method for CCIR 22 https://www.datafountain.cn/competitions/573☆13Aug 2, 2022Updated 3 years ago
- Source code for ACL 2021 paper "Automatic ICD Coding via Interactive Shared Representation Networks with Self-distillation Mechanism"☆14Jun 1, 2021Updated 4 years ago
- [ NeurIPS 2023 ] Official Codebase for "Aligning Synthetic Medical Images with Clinical Knowledge using Human Feedback"☆19Oct 19, 2023Updated 2 years ago
- This is the artifact for paper “Are Machine Learning Cloud APIs Used Correctly? (#421)” in ICSE2021☆16Feb 27, 2021Updated 5 years ago
- Loop Nest - Linear algebra compiler and code generator.☆20Oct 22, 2022Updated 3 years ago
- VQA-Med 2021☆22Jul 11, 2022Updated 3 years ago
- Release of the ConditionalQA dataset☆21Nov 2, 2021Updated 4 years ago
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.☆22Nov 26, 2022Updated 3 years ago
- The official code of TACL 2021, "Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies".☆85Oct 31, 2022Updated 3 years ago
- [ACL 2024 Findings] The official repo for "ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large …☆24May 29, 2024Updated last year
- [𝐄𝐌𝐍𝐋𝐏 𝐅𝐢𝐧𝐝𝐢𝐧𝐠𝐬 𝟐𝟎𝟐𝟒 & 𝐀𝐂𝐋 𝟐𝟎𝟐𝟒 𝐍𝐋𝐑𝐒𝐄 𝐎𝐫𝐚𝐥] 𝘌𝘯𝘩𝘢𝘯𝘤𝘪𝘯𝘨 𝘔𝘢𝘵𝘩𝘦𝘮𝘢𝘵𝘪𝘤𝘢𝘭 𝘙𝘦𝘢𝘴𝘰𝘯𝘪𝘯…☆51May 4, 2024Updated last year
- ☆89Apr 2, 2022Updated 3 years ago
- Data and Code for Program of Thoughts [TMLR 2023]☆306May 15, 2024Updated last year
- ☆25Aug 23, 2024Updated last year
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆222Aug 10, 2023Updated 2 years ago
- ⚡Research papers about leveraging the capabilities of language models⚡☆53Jan 13, 2026Updated last month
- Masked Structural Growth for 2x Faster Language Model Pre-training☆25Apr 28, 2024Updated last year
- Source codes for paper ”ReACC: A Retrieval-Augmented Code Completion Framework“☆65Apr 18, 2022Updated 3 years ago
- Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.☆64Nov 27, 2024Updated last year
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆65Feb 13, 2023Updated 3 years ago
- Implementation of IceFormer: Accelerated Inference with Long-Sequence Transformers on CPUs (ICLR 2024).☆25Feb 22, 2026Updated last week
- ☆47Aug 5, 2025Updated 6 months ago
- Conic10K: A large-scale dataset for closed-vocabulary math problem understanding. Accepted to EMNLP2023 Findings.☆31Dec 6, 2023Updated 2 years ago
- The data and implementation for the experiments in the paper "Flows: Building Blocks of Reasoning and Collaborating AI".☆31Feb 12, 2024Updated 2 years ago
- We have released the code and demo program required for LLM with self-verification☆62Oct 18, 2023Updated 2 years ago
- Supporting code for ReCEval paper☆31Sep 14, 2024Updated last year
- ☆39Feb 7, 2025Updated last year
- [EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆31Apr 8, 2024Updated last year
- Internet Programming (CS330)☆11Jan 21, 2026Updated last month
- ZYN: Zero-Shot Reward Models with Yes-No Questions☆35Aug 15, 2023Updated 2 years ago
- ☆72Apr 2, 2024Updated last year
- Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback☆208May 24, 2023Updated 2 years ago
- [ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following☆137Jul 8, 2024Updated last year