aida-ugent / llm-ideology-analysisLinks
Public repository for the paper "Large Language Models Reflect the Ideology of their Creators"
☆27Updated 9 months ago
Alternatives and similar repositories for llm-ideology-analysis
Users that are interested in llm-ideology-analysis are comparing it to the libraries listed below
Sorting:
- ☆80Updated last year
- Resources for cultural NLP research☆106Updated last month
- Benchmarking Large Language Models☆100Updated 4 months ago
- Code for "From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Mod…☆40Updated last year
- ☆24Updated 2 years ago
- A curated list of research papers and resources on Cultural LLM.☆52Updated last year
- Repository for the Bias Benchmark for QA dataset.☆129Updated last year
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆223Updated 11 months ago
- Code and Resources for the paper, "Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries"☆19Updated last year
- Repository for research in the field of Responsible NLP at Meta.☆202Updated 5 months ago
- Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments (Zhou et al., EMNLP 2024)☆13Updated last year
- ☆49Updated 11 months ago
- BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages☆40Updated 2 months ago
- The Prism Alignment Project☆84Updated last year
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆109Updated last year
- Code for the paper "Studying Large Language Model Behaviors Under Context-Memory Conflicts With Real Documentss"☆15Updated last year
- Attribute statements generated by LLMs to preceding tokens using attention weights.☆18Updated 6 months ago
- Public repository for SemEval 2023 - Task 10 - Explainable Detection of Online Sexism (EDOS)☆24Updated 2 years ago
- ☆35Updated 3 months ago
- Interpreting Language Models with Contrastive Explanations (EMNLP 2022 Best Paper Honorable Mention)☆62Updated 3 years ago
- Calculate perplexity on a text with pre-trained language models. Support MLM (eg. DeBERTa), recurrent LM (eg. GPT3), and encoder-decoder …☆162Updated 4 months ago
- Landing page for MIB: A Mechanistic Interpretability Benchmark☆21Updated 2 months ago
- ☆116Updated last year
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)☆49Updated 9 months ago
- Data set for LREC 2020 paper "I Feel Offended, Don't Be Abusive!"☆18Updated 2 years ago
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆155Updated 2 years ago
- Code for "Goodtriever: Toxicity Mitigation with Retrieval-augmented Language Models"☆23Updated last year
- [NeurIPS 2023 D&B Track] Code and data for paper "Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evalua…☆35Updated 2 years ago
- ☆34Updated last year
- ☆48Updated last year