Code for the paper <SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning>
☆47Aug 1, 2023Updated 2 years ago
Alternatives and similar repositories for SelfCheck
Users that are interested in SelfCheck are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The baseline method for CCIR 22 https://www.datafountain.cn/competitions/573☆13Aug 2, 2022Updated 3 years ago
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆270Sep 12, 2024Updated last year
- ☆11Nov 27, 2022Updated 3 years ago
- ☆26May 30, 2023Updated 2 years ago
- The is the official implementation of "Lyra: Orchestrating Dual Correction in Automated Theorem Proving"☆15Jul 2, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆30Dec 27, 2024Updated last year
- A dataset for natural language code search.☆14Feb 13, 2020Updated 6 years ago
- This is the artifact for paper “Are Machine Learning Cloud APIs Used Correctly? (#421)” in ICSE2021☆16Feb 27, 2021Updated 5 years ago
- [ NeurIPS 2023 ] Official Codebase for "Aligning Synthetic Medical Images with Clinical Knowledge using Human Feedback"☆20Oct 19, 2023Updated 2 years ago
- ☆14Oct 11, 2023Updated 2 years ago
- The official code of TACL 2021, "Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies".☆85Oct 31, 2022Updated 3 years ago
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆223Aug 10, 2023Updated 2 years ago
- Guidelines for our secondary layer of annotation adding multi-sentence AMR links☆12Sep 6, 2017Updated 8 years ago
- ☆13Oct 4, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- About The corresponding code from our paper " REFINER: Reasoning Feedback on Intermediate Representations" (EACL 2024). Do not hesitate t…☆74Jan 27, 2026Updated 3 months ago
- Source code for ACL 2021 paper "Automatic ICD Coding via Interactive Shared Representation Networks with Self-distillation Mechanism"☆14Jun 1, 2021Updated 4 years ago
- Code and models for ``Answering Open-Domain Multi-Answer Questions via a Recall-then-Verify Framework (ACL 2022)''☆12Jun 29, 2022Updated 3 years ago
- ☆25Aug 23, 2024Updated last year
- Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.☆64Nov 27, 2024Updated last year
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆67Feb 13, 2023Updated 3 years ago
- Supporting code for ReCEval paper☆32Sep 14, 2024Updated last year
- [NeurIPS 2023] PyTorch code for Can Language Models Teach? Teacher Explanations Improve Student Performance via Theory of Mind☆66Dec 21, 2023Updated 2 years ago
- Repo to reproduce the First-Explore paper results☆39Dec 25, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback☆208May 24, 2023Updated 2 years ago
- Code and data accompanying the paper "TRUE: Re-evaluating Factual Consistency Evaluation".☆87Apr 1, 2026Updated last month
- The official repository of "ChatCoT: Tool-Augmented Chain-of-Thought Reasoning on Chat-based Large Language Models"☆46Jun 2, 2023Updated 2 years ago
- Automated Machine Learning (AutoML) for Kaggle Competition☆32Jul 6, 2023Updated 2 years ago
- [𝐄𝐌𝐍𝐋𝐏 𝐅𝐢𝐧𝐝𝐢𝐧𝐠𝐬 𝟐𝟎𝟐𝟒 & 𝐀𝐂𝐋 𝟐𝟎𝟐𝟒 𝐍𝐋𝐑𝐒𝐄 𝐎𝐫𝐚𝐥] 𝘌𝘯𝘩𝘢𝘯𝘤𝘪𝘯𝘨 𝘔𝘢𝘵𝘩𝘦𝘮𝘢𝘵𝘪𝘤𝘢𝘭 𝘙𝘦𝘢𝘴𝘰𝘯𝘪𝘯…☆51May 4, 2024Updated 2 years ago
- PathPiece tokenizer☆14Nov 10, 2024Updated last year
- ☆14Jul 24, 2024Updated last year
- Code for AAAI 2023 accepted paper titled "Knowledge-Bridged Causal Interaction Network for Causal Emotion Entailment"☆14May 6, 2023Updated 2 years ago
- {DeepL, Google, WMT-Best, davinci-003, turbo, gpt-4} × {En-De, En-Cs, En-Ru, En-Zh, De-Fr, En-Ja, Uk-En, Uk-Cs, En-Hr, En-Ha, En-Is}☆14Jun 18, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 中国执业医师、药师、护士资格考试数据集和ChatGPT评估☆14Mar 13, 2026Updated last month
- Adapt MLLMs to Domains via Post-Training (EMNLP 2025 Findings)☆14Nov 11, 2025Updated 5 months ago
- A minimal language for Isabelle/HOL, designed for easing machine learning.☆27Jan 13, 2026Updated 3 months ago
- [APSIPA ASC 2023] The official code of paper, "FactLLaMA: Optimizing Instruction-Following Language Models with External Knowledge for Au…☆16Mar 7, 2024Updated 2 years ago
- [NeurIPS 2024] "Mind the Gap between Prototypes and Images in Cross-domain Finetuning"☆11Nov 15, 2024Updated last year
- This is a pytorch implementation of our Recurrent Aggregation of Multimodal Embeddings Network (RAMEN) from our CVPR-2019 paper.☆17Apr 5, 2020Updated 6 years ago
- Loop Nest - Linear algebra compiler and code generator.☆20Oct 22, 2022Updated 3 years ago