AIS is an evaluation framework for assessing whether the output of natural language models only contains information about the external world that is verifiable in source documents, or "Attributable to Identified Sources".
☆30Jan 14, 2023Updated 3 years ago
Alternatives and similar repositories for AIS
Users that are interested in AIS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/☆25Mar 10, 2025Updated last year
- ☆22Jan 5, 2024Updated 2 years ago
- mReasoner is a unified computational implementation of the model theory of thinking and reasoning☆15Aug 17, 2023Updated 2 years ago
- The InterScript dataset contains interactive user feedback on scripts generated by a T5-XXL model.☆12Dec 15, 2021Updated 4 years ago
- ☆11Nov 27, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Student materials for Stats/Datasci 507, Fall 2021.☆10Dec 10, 2021Updated 4 years ago
- ☆14Apr 29, 2025Updated last year
- Code for EMNLP 2021 paper "Measuring Association Between Labels and Free-Text Rationales"☆12Sep 12, 2023Updated 2 years ago
- TREC Core track☆11Jul 5, 2017Updated 8 years ago
- We believe the ability of an LLM to attribute the text that it generates is likely to be crucial for both system developers and users in …☆55Jul 28, 2023Updated 2 years ago
- Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.☆17Apr 25, 2021Updated 5 years ago
- Paper list for the paper "Authorship Attribution in the Era of Large Language Models: Problems, Methodologies, and Challenges (SIGKDD Exp…☆19May 25, 2026Updated 2 weeks ago
- Code and data accompanying the paper "TRUE: Re-evaluating Factual Consistency Evaluation".☆88Updated this week
- Hadoop tools for manipulating ClueWeb collections☆26Jul 15, 2016Updated 9 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆18Oct 6, 2022Updated 3 years ago
- Machine Learning for Authorship Identification using Gutenberg Data☆16Mar 6, 2017Updated 9 years ago
- WorldCuisines is an extensive multilingual and multicultural benchmark that spans 30 languages, covering a wide array of global cuisines.…☆27May 8, 2025Updated last year
- ☆25Oct 22, 2022Updated 3 years ago
- Code for gradient rollback, which explains predictions of neural matrix factorization models, as for example used for knowledge base comp…☆21Mar 16, 2021Updated 5 years ago
- The SMAPH system for query entity linking.☆20Jul 29, 2018Updated 7 years ago
- Code for the article "Shortcutted Commonsense: Data Spuriousness in Deep Learning of Commonsense Reasoning", Outstanding Paper at EMNLP20…☆10Nov 7, 2021Updated 4 years ago
- A paper list of research conducted based on wikiHow☆27Mar 5, 2022Updated 4 years ago
- This is the official implementation for the paper "Learning to Scaffold: Optimizing Model Explanations for Teaching"☆20May 19, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆19Nov 21, 2025Updated 6 months ago
- ☆14Mar 9, 2023Updated 3 years ago
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…☆439Apr 13, 2025Updated last year
- A zero-shot faithfulness evaluation metric for text summarization☆11Oct 17, 2023Updated 2 years ago
- ☆26Mar 20, 2024Updated 2 years ago
- An official codebase for "NormLens: Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Comm…☆10May 9, 2024Updated 2 years ago
- ☆10May 27, 2024Updated 2 years ago
- VERA-MH official repository☆39May 28, 2026Updated last week
- This project studies the performance and robustness of language models and task-adaptation methods.☆154May 18, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆12Mar 4, 2025Updated last year
- ☆17Jul 23, 2025Updated 10 months ago
- A case study approach to successful data science projects using Python pandas and scikit learn☆10Jun 27, 2019Updated 6 years ago
- IPython Notebook for Sentiment Classification☆10Nov 12, 2014Updated 11 years ago
- Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"☆29Apr 28, 2023Updated 3 years ago
- Github repo for the Evaluation of ChatGPT Family of Models for Biomedical Reasoning and Classification, code and data | Paper: https://ar…☆14Apr 6, 2023Updated 3 years ago
- Feature Selection using Simulated Annealing☆11Aug 10, 2022Updated 3 years ago