OpenLMLab / Sniffer
☆25Updated last year
Alternatives and similar repositories for Sniffer:
Users that are interested in Sniffer are comparing it to the libraries listed below
- ☆53Updated 8 months ago
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆63Updated last year
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆59Updated last year
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model☆68Updated 2 years ago
- [ACL 23] CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors☆36Updated 3 months ago
- Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)☆48Updated last year
- ☆74Updated 11 months ago
- Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions Outside Parametric Knowledge☆13Updated last year
- This repo is for the paper: On the Safety of Conversational Models: Taxonomy, Dataset, and Benchmark☆25Updated 2 years ago
- ☆42Updated last year
- [ACL 2024] Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"☆25Updated last year
- ☆41Updated last year
- [Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedback☆39Updated last year
- BeHonest: Benchmarking Honesty in Large Language Models☆31Updated 8 months ago
- Do Large Language Models Know What They Don’t Know?☆94Updated 6 months ago
- Shadow Alignment: The Ease of Subverting Safely-Aligned Language Models☆28Updated last year
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆48Updated 10 months ago
- Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.☆23Updated 8 months ago
- Target-oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation (EMNLP 2023)☆30Updated last year
- EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration☆35Updated last year
- ☆32Updated last year
- Code for our EMNLP-2023 paper: "Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks"☆24Updated last year
- EMNLP'2024: Knowledge Verification to Nip Hallucination in the Bud☆22Updated last year
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆55Updated last year
- [ACL 2024] FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models☆98Updated 2 weeks ago
- self-adaptive in-context learning☆45Updated 2 years ago
- Code for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models☆23Updated 9 months ago
- 🩺 A collection of ChatGPT evaluation reports on various bechmarks.☆48Updated 2 years ago
- Lightweight tool to identify Data Contamination in LLMs evaluation☆50Updated last year
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning☆36Updated last year