salesforce/AuditNLG

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/salesforce/AuditNLG)

salesforce / AuditNLG

AuditNLG: Auditing Generative AI Language Modeling for Trustworthiness

☆103

Alternatives and similar repositories for AuditNLG

Users that are interested in AuditNLG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

eagle705 / awesome-nlp-note
View on GitHub
A curated list of resources dedicated to NLP (paper, blogs, note and etc)
☆13Nov 30, 2019Updated 6 years ago
salesforce / dialog-flow-extraction
View on GitHub
☆15Jun 2, 2026Updated last month
vid-koci / KBCtransferlearning
View on GitHub
Code accompanying the paper "Knowledge Base Completion Meets Transfer Learning"
☆15Feb 21, 2024Updated 2 years ago
salesforce / Overture
View on GitHub
Library for soft prompt tuning
☆22Jun 25, 2026Updated 3 weeks ago
salesforce / TaiChi
View on GitHub
Open source library for few shot NLP
☆79Jun 25, 2026Updated 3 weeks ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
chorusai / brave
View on GitHub
Brave is a simple visualisation library for NLP information extraction, built on top of embedded BRAT.
☆15Dec 25, 2019Updated 6 years ago
anthonywchen / MOCHA
View on GitHub
Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".
☆16May 3, 2022Updated 4 years ago
dair-iitd / BossNet
View on GitHub
BossNet: Disentangling Language and Knowledge in Task Oriented Dialogs
☆16Dec 8, 2022Updated 3 years ago
ozyyshr / ShareGPT_investigation
View on GitHub
The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions (EMNLP 2023))
☆13Dec 21, 2023Updated 2 years ago
ko-nlp / moducorpus-sanitizer
View on GitHub
모두의 말뭉치 데이터를 분석에 편리한 형태로 변환하는 기능을 제공합니다.
☆11Mar 2, 2022Updated 4 years ago
uds-lsv / TOKEN-is-a-MASK
View on GitHub
Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"
☆14Aug 19, 2022Updated 3 years ago
eval4nlp / SharedTask2023
View on GitHub
☆11Jul 6, 2024Updated 2 years ago
nec-research / st_tau
View on GitHub
This repository contains code for the paper "Uncertainty Estimation and Calibration with Finite-State Probabilistic RNNs" (Wang, Lawrence…
☆17Mar 8, 2021Updated 5 years ago
monologg / ko_lm_dataformat
View on GitHub
A utility for storing and reading files for Korean LM training 💾
☆35Updated this week
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
salesforce / QGen
View on GitHub
☆33Jun 2, 2026Updated last month
nickyringland / nested_named_entities
View on GitHub
☆62Aug 23, 2023Updated 2 years ago
minnesotanlp / cobbler
View on GitHub
Code and data for Koo et al's ACL 2024 paper "Benchmarking Cognitive Biases in Large Language Models as Evaluators"
☆23Feb 16, 2024Updated 2 years ago
ptarau / DeepRank
View on GitHub
A first cut into exploring the use of dependency links for building Text Graphs, that, among other things, with help of a centrality algo…
☆32Oct 20, 2023Updated 2 years ago
kernelmachine / silo-lm
View on GitHub
SILO Language Models code repository
☆83Feb 23, 2024Updated 2 years ago
uds-lsv / NoisyNER
View on GitHub
A dataset for realistic evaluation of noisy label methods
☆15Dec 3, 2023Updated 2 years ago
ZhangShiyue / extractive_is_not_faithful
View on GitHub
☆17May 19, 2023Updated 3 years ago
DFKI-NLP / REval
View on GitHub
[ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction
☆13Apr 21, 2020Updated 6 years ago
microsoft / HaDes
View on GitHub
Token-level Reference-free Hallucination Detection
☆97Jul 25, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
vikas95 / AIR-retriever
View on GitHub
AIR retriever for Multi-Hop QA (ACL 2020 paper)
☆30Jul 18, 2020Updated 6 years ago
wikistat / Fair-ML-4-Ethical-AI
View on GitHub
Fair Statistical Learning Algorithms for Ethical Artificial Intelligence
☆27Apr 5, 2023Updated 3 years ago
qqaatw / pytorch-realm-orqa
View on GitHub
PyTorch reimplementation of REALM and ORQA
☆22Feb 3, 2022Updated 4 years ago
bnewm0609 / arxivDIGESTables
View on GitHub
☆18Sep 15, 2025Updated 10 months ago
oriram / spider
View on GitHub
☆55Jan 18, 2023Updated 3 years ago
salesforce / factCC
View on GitHub
Resources for the "Evaluating the Factual Consistency of Abstractive Text Summarization" paper
☆305May 1, 2025Updated last year
facebookresearch / lss_eval
View on GitHub
This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…
☆31Aug 25, 2023Updated 2 years ago
carolinlawrence / BiSon
View on GitHub
Code for bidirectional sequence generation (BiSon) for generating from BERT pre-trained models.
☆51Mar 17, 2020Updated 6 years ago
wzhouad / context-faithful-llm
View on GitHub
Code and data for paper "Context-faithful Prompting for Large Language Models".
☆41Mar 23, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
DSBA-Lab / CodeLab
View on GitHub
DSBA code study
☆30Nov 7, 2023Updated 2 years ago
salesforce / DialFact
View on GitHub
We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…
☆41Jun 2, 2026Updated last month
google-research-datasets / answer-equivalence-dataset
View on GitHub
This dataset contains human judgements about answer equivalence. The data is based on SQuAD (Stanford Question Answering Dataset), and co…
☆30Oct 24, 2022Updated 3 years ago
skywalker023 / pragmatic-consistency
View on GitHub
🤖 Code for our EMNLP 2020 paper: "Will I Sound Like Me? Improving Persona Consistency in Dialogues through Pragmatic Self-Consciousness"
☆37Oct 12, 2020Updated 5 years ago
vid-koci / bert-commonsense
View on GitHub
Code for papers "A Surprisingly Robust Trick for Winograd Schema Challenge" and "WikiCREM: A Large Unsupervised Corpus for Coreference Re…
☆71Oct 4, 2022Updated 3 years ago
DavidMChan / grazier
View on GitHub
A tool for calling (and calling out to) large language models.
☆16Aug 13, 2024Updated last year
jderiu / spot-the-bot-code
View on GitHub
☆13Mar 1, 2022Updated 4 years ago