Official repo for SAC3: Reliable Hallucination Detection in Black-Box Language Models via Semantic-aware Cross-check Consistency
☆39Jan 18, 2025Updated last year
Alternatives and similar repositories for sac3
Users that are interested in sac3 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DCR-Consistency: Divide-Conquer-Reasoning for Consistency Evaluation and Improvement of Large Language Models☆27May 23, 2024Updated 2 years ago
- ☆13Nov 7, 2023Updated 2 years ago
- CodexLeaks: Privacy Leaks from Code Generation Language Models in GitHub Copilot☆11Jul 11, 2023Updated 2 years ago
- UQpy (Uncertainty Quantification with python) is a general purpose Python toolbox for modeling uncertainty in physical and mathematical s…☆359Jun 2, 2026Updated last week
- ☆11Jul 6, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models☆824Jun 5, 2026Updated last week
- Resources for our AAAI 2022 paper: "Unsupervised Editing for Counterfactual Stories".☆12Oct 25, 2022Updated 3 years ago
- [ICLR 2025] FLAT: LLM Unlearning via Loss Adjustment with Only Forget Data☆14Feb 26, 2025Updated last year
- ☆14Oct 17, 2024Updated last year
- List of papers on hallucination detection in LLMs.☆1,101Jun 6, 2026Updated last week
- ☆38Oct 18, 2023Updated 2 years ago
- LSTM-VAE for Time Series Anomaly Detection☆10Feb 21, 2021Updated 5 years ago
- ☆18Apr 2, 2021Updated 5 years ago
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…☆440Apr 13, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for ACL 2022 long paper: Can Prompt Probe Pretrained Language Models? Understanding the Invisible Risks from a Causal View☆10May 17, 2022Updated 4 years ago
- [WNGT(2019)] On the Importance of the Kullback-Leibler Divergence Term in Variational Autoencoders for Text Generation☆11Apr 27, 2022Updated 4 years ago
- The source code of "Empowering Language Understanding with Counterfactual Reasoning" (ACL'21)☆11Sep 3, 2021Updated 4 years ago
- ☆25Jan 11, 2019Updated 7 years ago
- Homotopy type theory cheatsheets☆12Apr 15, 2026Updated 2 months ago
- FlexEval is an LLM evaluation tool designed for practical quantitative analysis.☆16Updated this week
- [ACL 2024] An Easy-to-use Hallucination Detection Framework for LLMs.☆42Feb 25, 2025Updated last year
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆45Jun 13, 2023Updated 3 years ago
- Framework for controlling demographic biases in NLG (using adversarial prompts)☆21Jun 12, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- SHAPR - An AI approach to predict 3D cell shapes from 2D microscopic images☆17May 31, 2023Updated 3 years ago
- exBERT on Transformers🤗☆10Jun 14, 2021Updated 5 years ago
- Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs) for LLMs in PyTorch☆10Aug 7, 2024Updated last year
- Code for the EACL 2024 paper: "Small Language Models Improve Giants by Rewriting Their Outputs"☆12Apr 20, 2024Updated 2 years ago
- An AI educational project disguised as Pac-Man!☆16Jun 2, 2026Updated 2 weeks ago
- ☆16Oct 24, 2023Updated 2 years ago
- Ask to Know More: Counterfactual Explanations for Fake Claims source code☆11Nov 22, 2022Updated 3 years ago
- [TACL] Code for "Red Teaming Language Model Detectors with Language Models"☆24Nov 24, 2023Updated 2 years ago
- Implementation of paper 'Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework from Logit Difference' [NeurIPS'24…☆26Jun 14, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Official Implementation of "GEAR: Augmenting Language Models with Generalizable and Efficient Tool Resolution"☆20Apr 3, 2024Updated 2 years ago
- Implementation of an LLM prompting pipeline combined with wrappers for auto-decomposing reasoning steps and for search through the reason…☆16May 7, 2024Updated 2 years ago
- Redis Celery Fabric Gunicorn Personal Blog☆13Aug 8, 2017Updated 8 years ago
- Archer2.0 evolves from its predecessor by introducing ASPO, which overcomes fundamental PPO-Clip limitations to prevent premature converg…☆31Oct 10, 2025Updated 8 months ago
- Code for NAACL 2025 paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"☆16Mar 2, 2026Updated 3 months ago
- Code and data for "Medical Dialogue Generation via Dual Flow Modeling" (ACL 2023 Findings)☆14Nov 22, 2023Updated 2 years ago
- Code for Our EMNLP (Industry) 2023 paper "LLM4Vis: Explainable Visualization Recommendation using ChatGPT"☆29Feb 4, 2024Updated 2 years ago