LeonDiao0427 / SEAS
We release our code and data for SEAS in this repository.
☆19Updated 3 months ago
Alternatives and similar repositories for SEAS:
Users that are interested in SEAS are comparing it to the libraries listed below
- The reinforcement learning codes for dataset SPA-VL☆31Updated 9 months ago
- A Survey on the Honesty of Large Language Models☆56Updated 3 months ago
- ☆64Updated 9 months ago
- ☆30Updated last week
- The official code repository for PRMBench.☆68Updated last month
- ☆26Updated 9 months ago
- FeatureAlignment = Alignment + Mechanistic Interpretability☆28Updated 3 weeks ago
- ☆43Updated 5 months ago
- ☆23Updated 5 months ago
- ☆32Updated 6 months ago
- [Preprint] A Neural-Symbolic Self-Training Framework☆104Updated 8 months ago
- mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating☆94Updated last year
- ☆43Updated 9 months ago
- [ICLR 2025] Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist☆31Updated 5 months ago
- This is the repo for our paper "Mr-Ben: A Comprehensive Meta-Reasoning Benchmark for Large Language Models"☆47Updated 5 months ago
- Safety-J: Evaluating Safety with Critique☆16Updated 8 months ago
- 😎 curated list of awesome LMM hallucinations papers, methods & resources.☆150Updated last year
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆63Updated last year
- BeHonest: Benchmarking Honesty in Large Language Models☆31Updated 7 months ago
- the official repo for EMNLP 2024 (main) paper "EFUF: Efficient Fine-grained Unlearning Framework for Mitigating Hallucinations in Multimo…☆19Updated 2 weeks ago
- The implementation of paper "LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Fee…☆38Updated 8 months ago
- An Easy-to-use Hallucination Detection Framework for LLMs.☆58Updated 11 months ago
- [EMNLP 2024] The official GitHub repo for the paper "Course-Correction: Safety Alignment Using Synthetic Preferences"☆19Updated 6 months ago
- ☆59Updated 7 months ago
- ☆19Updated 5 months ago
- ☆74Updated last week
- ☆36Updated 3 weeks ago
- ☆17Updated 5 months ago
- Language Imbalance Driven Rewarding for Multilingual Self-improving☆15Updated 5 months ago
- Official repository for ICML 2024 paper "On Prompt-Driven Safeguarding for Large Language Models"☆88Updated 6 months ago