tatsu-lab / opinions_qa
☆104Updated 10 months ago
Alternatives and similar repositories for opinions_qa:
Users that are interested in opinions_qa are comparing it to the libraries listed below
- ☆47Updated last year
- Repository for the Bias Benchmark for QA dataset.☆103Updated last year
- Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/☆21Updated 2 weeks ago
- Röttger et al. (NAACL 2024): "XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models"☆93Updated last month
- How do transformer LMs encode relations?☆46Updated last year
- ☆23Updated last year
- Inspecting and Editing Knowledge Representations in Language Models☆112Updated last year
- Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper☆77Updated 4 years ago
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆69Updated 2 weeks ago
- This repository contains data, code and models for contextual noncompliance.☆20Updated 8 months ago
- ☆22Updated last year
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".☆71Updated last year
- Data and code for the preprint "In-Context Learning with Long-Context Models: An In-Depth Exploration"☆32Updated 7 months ago
- ☆25Updated last year
- Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.☆138Updated 5 months ago
- ☆38Updated last year
- ☆44Updated 6 months ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆58Updated last year
- Code for "From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Mod…☆35Updated last year
- Code repository for the paper "Mission: Impossible Language Models."☆50Updated this week
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆58Updated 2 years ago
- ☆72Updated last year
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆52Updated 4 months ago
- ☆82Updated 7 months ago
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆73Updated last year
- The Prism Alignment Project☆69Updated 11 months ago
- Code and data accompanying the paper "TRUE: Re-evaluating Factual Consistency Evaluation".☆76Updated 2 weeks ago
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆41Updated 3 months ago
- LoFiT: Localized Fine-tuning on LLM Representations☆34Updated 2 months ago
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model☆66Updated 2 years ago