We have released the code and demo program required for LLM with self-verification
☆62Oct 18, 2023Updated 2 years ago
Alternatives and similar repositories for Self-Verification
Users that are interested in Self-Verification are comparing it to the libraries listed below
Sorting:
- [AAAI26] Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilitie…☆10Feb 7, 2026Updated 3 weeks ago
- 🤡 An up-to-date & curated list of awesome KBQA papers, methods & resources.☆10Jul 14, 2022Updated 3 years ago
- Grounding Language Models for Compositional and Spatial Reasoning☆18Oct 26, 2022Updated 3 years ago
- PACIFIC: Towards Proactive Conversational Question Answering over Tabular and Textual Data in Finance☆14May 15, 2024Updated last year
- RiddleSense: Reasoning about Riddle Questions Featuring Linguistic Creativity and Commonsense Knowledge☆14Oct 20, 2021Updated 4 years ago
- Data and Code for Program of Thoughts [TMLR 2023]☆306May 15, 2024Updated last year
- ☆37Oct 29, 2024Updated last year
- ☆68Jun 27, 2022Updated 3 years ago
- Llemma formal2formal (tactic prediction) theorem proving experiments☆20Oct 17, 2023Updated 2 years ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated 11 months ago
- LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.☆784Oct 4, 2024Updated last year
- This is the official repository for our NeurIPS'22 paper "Watermarking for Out-of-distribution Detection."☆18Feb 24, 2023Updated 3 years ago
- Official Implementation for PlugIn Inversion☆16Oct 23, 2021Updated 4 years ago
- [AAAI 2025] Augmenting Math Word Problems via Iterative Question Composing (https://arxiv.org/abs/2401.09003)☆23Oct 2, 2025Updated 5 months ago
- Official implementation of "Beyond Theorem Proving: Formulation, Framework and Benchmark for Formal Problem-Solving"☆29May 8, 2025Updated 9 months ago
- ACL 2021 - Defense against Adversarial Attacks in NLP via Dirichlet Neighborhood Ensemble☆18Jun 12, 2023Updated 2 years ago
- ☆21Mar 4, 2024Updated last year
- A Pytorch implementation of "Data-Free Learning of Student Networks" (ICCV 2019).☆18Oct 8, 2019Updated 6 years ago
- ☆139Nov 3, 2023Updated 2 years ago
- ☆49Aug 29, 2023Updated 2 years ago
- 🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)☆24Oct 10, 2023Updated 2 years ago
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆61Jun 3, 2024Updated last year
- Code for the paper <SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning>☆48Aug 1, 2023Updated 2 years ago
- Source code for the paper "Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data"☆20Feb 24, 2024Updated 2 years ago
- [NeurIPS 2021] “Improving Contrastive Learning on Imbalanced Data via Open-World Sampling”, Ziyu Jiang, Tianlong Chen, Ting Chen, Zhangya…☆29Dec 30, 2021Updated 4 years ago
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.☆28Feb 17, 2025Updated last year
- A framework for editing the CoTs for better factuality☆50Dec 9, 2023Updated 2 years ago
- ☆24Feb 2, 2026Updated last month
- ☆25Jun 23, 2021Updated 4 years ago
- Vecna is a Python chatbot which recommends songs and movies depending upon your feelings☆12Jun 28, 2022Updated 3 years ago
- [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆120Dec 10, 2024Updated last year
- code for the ICLR'22 paper: On Robust Prefix-Tuning for Text Classification☆27Mar 21, 2022Updated 3 years ago
- ☆25Mar 28, 2025Updated 11 months ago
- Evaluation on Logical Reasoning and Abstract Reasoning Challenges☆29Apr 21, 2025Updated 10 months ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆28Feb 9, 2026Updated 3 weeks ago
- Implementation of the paper "Exploring the Universal Vulnerability of Prompt-based Learning Paradigm" on Findings of NAACL 2022☆32Jul 11, 2022Updated 3 years ago
- a benchmark suite for testing logical reasoning abilities of prompt-based models☆32Nov 20, 2023Updated 2 years ago
- [ACL 2021] Learning Relation Alignment for Calibrated Cross-modal Retrieval☆34May 16, 2023Updated 2 years ago
- Flowlyt is a security analyzer that scans GitHub Actions workflows to detect malicious patterns, misconfigurations, and secrets exposure,…☆15Updated this week