LeonDiao0427 / SEASLinks
We release our code and data for SEAS in this repository.
☆20Updated last year
Alternatives and similar repositories for SEAS
Users that are interested in SEAS are comparing it to the libraries listed below
Sorting:
- ☆56Updated last year
- The reinforcement learning codes for dataset SPA-VL☆44Updated last year
- ☆38Updated last year
- ☆23Updated last year
- Official repository for "Safety in Large Reasoning Models: A Survey" - Exploring safety risks, attacks, and defenses for Large Reasoning …☆87Updated 5 months ago
- 【ACL 2024】 SALAD benchmark & MD-Judge☆170Updated 11 months ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆258Updated 6 months ago
- A comprehensive collection of process reward models.☆136Updated 4 months ago
- The awesome agents in the era of large language models☆71Updated 2 years ago
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆191Updated last year
- ☆174Updated 3 months ago
- ☆44Updated 7 months ago
- This is the repository of DEER, a Dynamic Early Exit in Reasoning method for Large Reasoning Language Models.☆179Updated 7 months ago
- Multilingual safety benchmark for Large Language Models☆53Updated last year
- R-Judge: Benchmarking Safety Risk Awareness for LLM Agents (EMNLP Findings 2024)☆99Updated last month
- ☆20Updated 7 months ago
- [ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.☆88Updated 11 months ago
- Awesome Large Reasoning Model(LRM) Safety.This repository is used to collect security-related research on large reasoning models such as …☆81Updated last week
- Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"☆15Updated 10 months ago
- [2025-TMLR] A Survey on the Honesty of Large Language Models☆64Updated last year
- A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…☆155Updated 5 months ago
- 🔍 Awesome Agentic Search is a curated list of papers, tools, and resources on agentic search—where AI agents plan, search, and reason to…☆53Updated 5 months ago
- [ACL 2025] Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety☆53Updated 6 months ago
- RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024☆90Updated last year
- [NAACL2024] Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey☆109Updated last year
- ☆306Updated 7 months ago
- ☆70Updated 7 months ago
- S-Eval: Towards Automated and Comprehensive Safety Evaluation for Large Language Models☆109Updated 3 months ago
- ☆52Updated last year
- Accepted by ECCV 2024☆186Updated last year