salesforce / AuditNLG
AuditNLG: Auditing Generative AI Language Modeling for Trustworthiness
☆97Updated this week
Alternatives and similar repositories for AuditNLG:
Users that are interested in AuditNLG are comparing it to the libraries listed below
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆124Updated 10 months ago
- Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"☆54Updated last year
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆92Updated last year
- ☆29Updated 11 months ago
- SILO Language Models code repository☆81Updated 11 months ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆82Updated 5 months ago
- ☆26Updated 2 years ago
- Token-level Reference-free Hallucination Detection☆93Updated last year
- ☆97Updated 2 years ago
- This project studies the performance and robustness of language models and task-adaptation methods.☆142Updated 8 months ago
- A framework for few-shot evaluation of autoregressive language models.☆102Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆45Updated last year
- [AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following☆79Updated 4 months ago
- ☆64Updated 11 months ago
- Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"☆59Updated this week
- Retrieval Augmented Generation Generalized Evaluation Dataset☆51Updated 2 months ago
- ☆45Updated last year
- ☆45Updated 2 months ago
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆108Updated last year
- ☆64Updated 2 years ago
- ☆55Updated 2 years ago
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆77Updated 9 months ago
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners☆113Updated 4 months ago
- ☆49Updated last year
- Detect hallucinated tokens for conditional sequence generation.☆64Updated 2 years ago
- Apps built using Inspired Cognition's Critique.☆58Updated last year
- A Human-LLM Collaborative Dataset for Generative Information-seeking with Attribution☆30Updated last year
- The official repository for Efficient Long-Text Understanding Using Short-Text Models (Ivgi et al., 2022) paper☆68Updated last year
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval☆40Updated 3 months ago
- Code for paper 'Data-Efficient FineTuning'☆29Updated last year