Amaodemao / BiasPainterLinks
basically all the things I used for this article
☆25Updated 9 months ago
Alternatives and similar repositories for BiasPainter
Users that are interested in BiasPainter are comparing it to the libraries listed below
Sorting:
- ☆35Updated 7 months ago
- ☆31Updated 7 months ago
- ☆40Updated 9 months ago
- MTTM: Metamorphic Testing for Textual Content Moderation Software☆32Updated 2 years ago
- Multilingual safety benchmark for Large Language Models☆52Updated last year
- Benchmarking LLMs' Psychological Portrayal☆124Updated 9 months ago
- Benchmarking LLMs' Emotional Alignment with Humans☆112Updated 8 months ago
- ☆35Updated last year
- The official GitHub repository of the paper "Recent advances in large langauge model benchmarks against data contamination: From static t…☆45Updated 3 weeks ago
- Code and Results of the Paper: On the Reliability of Psychological Scales on Large Language Models☆30Updated last year
- ☆57Updated last year
- Recent papers on (1) Psychology of LLMs; (2) Biases in LLMs.☆49Updated last year
- Benchmarking LLMs' Gaming Ability in Multi-Agent Environments☆88Updated 5 months ago
- [ICLR 2025] ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation☆123Updated 3 months ago
- [EMNLP 2024] ”ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models“☆20Updated last year
- [2025-TMLR] A Survey on the Honesty of Large Language Models☆59Updated 10 months ago
- [ICLR 2025] Pad: Personalized alignment of llms at decoding-time☆15Updated 6 months ago
- MMoE: Multimodal Mixture-of-Experts (EMNLP 2024)☆12Updated 10 months ago
- ☆26Updated 2 years ago
- Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets and…☆51Updated 4 months ago
- Official repo for "AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability"☆33Updated last year
- BeHonest: Benchmarking Honesty in Large Language Models☆34Updated last year
- Code for ACL 2024 paper "Soft Self-Consistency Improves Language Model Agents"☆25Updated last year
- Code and data for the paper "Steering Conversational Large Language Models for Long Emotional Support Conversations" along with a UI to v…☆12Updated 5 months ago
- Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping☆54Updated 4 months ago
- Game-RL: Synthesizing Multimodal Verifiable Game Data to Boost VLMs' General Reasoning☆84Updated this week
- Code for the paper "Self-Detoxifying Language Models via Toxification Reversal" (EMNLP 2023)☆16Updated last year
- ☆27Updated 2 years ago
- This is the repository of our ACL 2024 paper "ESCoT: Towards Interpretable Emotional Support Dialogue Systems".☆31Updated 5 months ago
- The implement of ACL2024: "MAPO: Advancing Multilingual Reasoning through Multilingual Alignment-as-Preference Optimization"☆42Updated last year