yxwan123 / BiasAskerLinks
☆39Updated 8 months ago
Alternatives and similar repositories for BiasAsker
Users that are interested in BiasAsker are comparing it to the libraries listed below
Sorting:
- MTTM: Metamorphic Testing for Textual Content Moderation Software☆32Updated 2 years ago
- basically all the things I used for this article☆25Updated 8 months ago
- Multilingual safety benchmark for Large Language Models☆52Updated last year
- ☆34Updated 6 months ago
- ☆31Updated 7 months ago
- Benchmarking LLMs' Psychological Portrayal☆123Updated 8 months ago
- Benchmarking LLMs' Emotional Alignment with Humans☆111Updated 7 months ago
- Recent papers on (1) Psychology of LLMs; (2) Biases in LLMs.☆49Updated last year
- [EMNLP 2024] The official GitHub repo for the paper "Course-Correction: Safety Alignment Using Synthetic Preferences"☆19Updated 11 months ago
- Benchmarking LLMs' Gaming Ability in Multi-Agent Environments☆88Updated 4 months ago
- The official GitHub repository of the paper "Recent advances in large langauge model benchmarks against data contamination: From static t…☆45Updated last week
- ☆28Updated last year
- ☆110Updated 4 months ago
- Codes and datasets of the paper Red-Teaming Large Language Models using Chain of Utterances for Safety-Alignment☆105Updated last year
- Code and Results of the Paper: On the Reliability of Psychological Scales on Large Language Models☆30Updated last year
- ☆51Updated last year
- Official repository for ICML 2024 paper "On Prompt-Driven Safeguarding for Large Language Models"☆96Updated 4 months ago
- [2025-TMLR] A Survey on the Honesty of Large Language Models☆59Updated 9 months ago
- 【ACL 2024】 SALAD benchmark & MD-Judge☆161Updated 6 months ago
- R-Judge: Benchmarking Safety Risk Awareness for LLM Agents (EMNLP Findings 2024)☆88Updated 4 months ago
- ☆35Updated 11 months ago
- ☆75Updated last year
- A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…☆109Updated last month
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆137Updated last year
- An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).☆56Updated last year
- ☆47Updated last year
- RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024☆82Updated 11 months ago
- ☆21Updated last year
- ☆11Updated 2 years ago
- Official Implementation of RISE (Reinforcing Reasoning with Self-Verification)☆28Updated last month