YFHuangxxxx / CBBQ
☆24Updated last year
Alternatives and similar repositories for CBBQ:
Users that are interested in CBBQ are comparing it to the libraries listed below
- EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration☆36Updated 11 months ago
- Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"☆57Updated 2 years ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆118Updated 8 months ago
- ☆38Updated last year
- [AAAI'25] CharacterBench: Benchmarking Character Customization of Large Language Models☆13Updated 2 months ago
- This repo is for the paper: On the Safety of Conversational Models: Taxonomy, Dataset, and Benchmark☆24Updated 2 years ago
- ☆52Updated 6 months ago
- The source code of paper "CHEF: A Pilot Chinese Dataset for Evidence-Based Fact-Checking"☆71Updated 2 years ago
- ☆72Updated 9 months ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆77Updated last year
- Repo for ACL2023 paper "Plug-and-Play Knowledge Injection for Pre-trained Language Models"☆60Updated 10 months ago
- ☆11Updated last year
- ☆24Updated last year
- Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)☆47Updated 10 months ago
- ☆60Updated last month
- ☆80Updated last year
- Collection of papers for scalable automated alignment.☆82Updated 4 months ago
- This is the repo for our work “An Extensible Plug-and-Play Method for Multi-Aspect Controllable Text Generation” (ACL 2023).☆12Updated last year
- OMGEval😮: An Open Multilingual Generative Evaluation Benchmark for Foundation Models☆32Updated 7 months ago
- [ACL 2024] Unveiling Linguistic Regions in Large Language Models☆29Updated 8 months ago
- EMNLP'2024: Knowledge Verification to Nip Hallucination in the Bud☆22Updated 11 months ago
- Safety-J: Evaluating Safety with Critique☆16Updated 6 months ago
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆63Updated 11 months ago
- ☆26Updated 6 months ago
- [ACL 2024] Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"☆25Updated 10 months ago
- ☆24Updated 11 months ago
- self-adaptive in-context learning☆44Updated last year
- Accompanying repo for the DP2O paper accepted by AAAI 2024 main conference☆15Updated 10 months ago
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆37Updated last year
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)☆64Updated this week