AuditNLG: Auditing Generative AI Language Modeling for Trustworthiness
☆103Jun 2, 2026Updated last week
Alternatives and similar repositories for AuditNLG
Users that are interested in AuditNLG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A curated list of resources dedicated to NLP (paper, blogs, note and etc)☆13Nov 30, 2019Updated 6 years ago
- The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).☆44Dec 25, 2022Updated 3 years ago
- Open source library for few shot NLP☆79Jun 2, 2026Updated last week
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16May 3, 2022Updated 4 years ago
- BossNet: Disentangling Language and Knowledge in Task Oriented Dialogs☆16Dec 8, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆24Jun 12, 2023Updated 2 years ago
- Commonsense Explanations for Commonsense Question Answering☆13Jun 27, 2019Updated 6 years ago
- FaVIQ: Fact Verification from Information-seeking Questions☆43Nov 23, 2022Updated 3 years ago
- 모두의 말뭉치 데이터를 분석에 편리한 형태로 변환하는 기능을 제공합니다.☆11Mar 2, 2022Updated 4 years ago
- Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"☆14Aug 19, 2022Updated 3 years ago
- ☆26May 9, 2022Updated 4 years ago
- ☆11Jul 6, 2024Updated last year
- A utility for storing and reading files for Korean LM training 💾☆35Oct 15, 2025Updated 7 months ago
- Deep neural parser for database query☆18Nov 20, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆33Jun 2, 2026Updated last week
- ☆62Aug 23, 2023Updated 2 years ago
- Code and data for Koo et al's ACL 2024 paper "Benchmarking Cognitive Biases in Large Language Models as Evaluators"☆23Feb 16, 2024Updated 2 years ago
- A first cut into exploring the use of dependency links for building Text Graphs, that, among other things, with help of a centrality algo…☆32Oct 20, 2023Updated 2 years ago
- A dataset for realistic evaluation of noisy label methods☆15Dec 3, 2023Updated 2 years ago
- The goal of this experiment is to take articles and certain metadata and group them by topic.☆11Apr 14, 2016Updated 10 years ago
- PromptCraft is a prompt perturbation toolkit from the character, word, and sentence levels for prompt robustness analysis. PyPI Package: …☆24Jan 3, 2024Updated 2 years ago
- [ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction☆13Apr 21, 2020Updated 6 years ago
- ☆12Apr 18, 2019Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Token-level Reference-free Hallucination Detection☆98Jul 25, 2023Updated 2 years ago
- AIR retriever for Multi-Hop QA (ACL 2020 paper)☆30Jul 18, 2020Updated 5 years ago
- ☆18Sep 15, 2025Updated 8 months ago
- ☆40Jun 2, 2026Updated last week
- PyTorch reimplementation of REALM and ORQA☆22Feb 3, 2022Updated 4 years ago
- ☆54Jan 18, 2023Updated 3 years ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Aug 25, 2023Updated 2 years ago
- ☆17May 19, 2023Updated 3 years ago
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆41Mar 23, 2023Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Code for the paper Code for the paper InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning☆101May 6, 2023Updated 3 years ago
- Code for bidirectional sequence generation (BiSon) for generating from BERT pre-trained models.☆51Mar 17, 2020Updated 6 years ago
- FRANK: Factuality Evaluation Benchmark☆59Dec 13, 2022Updated 3 years ago
- Official Repo for CRMArena and CRMArena-Pro☆140Jun 2, 2026Updated last week
- 🤖 Code for our EMNLP 2020 paper: "Will I Sound Like Me? Improving Persona Consistency in Dialogues through Pragmatic Self-Consciousness"☆37Oct 12, 2020Updated 5 years ago
- Code and Data for our EMNLP 2020 paper titled 'Learning to Explain: Datasets and Models for Identifying Valid Reasoning Chains in Multiho…☆28Feb 9, 2022Updated 4 years ago
- We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…☆42Jun 2, 2026Updated last week