[ACL 2025] LongSafety: Evaluating Long-Context Safety of Large Language Models
☆16Jun 18, 2025Updated 9 months ago
Alternatives and similar repositories for LongSafety
Users that are interested in LongSafety are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2026] BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs☆18May 21, 2025Updated 10 months ago
- Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks☆32Jul 9, 2024Updated last year
- [ACL 2024] Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization☆29Jul 9, 2024Updated last year
- ☆19Oct 14, 2024Updated last year
- 🌿快速生成文件夹目录结构,支持定义目录层级,支持生成到 markdown 文件。☆13Oct 19, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Focused Papers, Delivered Simply :)☆52Dec 25, 2025Updated 3 months ago
- 清华大学2019计网联合实验第一组☆28Jan 15, 2020Updated 6 years ago
- ☆20Jun 16, 2025Updated 9 months ago
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated last year
- 2022春季学期清华大学计算机图形学大作业☆12Mar 4, 2023Updated 3 years ago
- ☆30May 22, 2025Updated 10 months ago
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆27Oct 3, 2025Updated 5 months ago
- ☆22Dec 9, 2023Updated 2 years ago
- Official code for PLoP☆17Mar 6, 2026Updated 2 weeks ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Web version of the MiniDecaf compiler.☆13Sep 17, 2020Updated 5 years ago
- 致远OA通过发送特殊请求获取管理员cookie,再通过文件上传接口上传webshell压缩文件,最后发送解压请求获取webshell☆10Apr 11, 2021Updated 4 years ago
- Openreviewers: Multi Agent Academic Review Simulation System☆23Mar 2, 2024Updated 2 years ago
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- [AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs☆55Dec 7, 2025Updated 3 months ago
- 汉英双语词典,python crawler,chinese-english bilingual dictionary☆15Oct 15, 2019Updated 6 years ago
- ☆15Feb 5, 2025Updated last year
- ☆55Mar 18, 2026Updated last week
- ☆26Nov 7, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Log4j_dos_CVE-2021-45105☆13Dec 19, 2021Updated 4 years ago
- Official Code for NAACL 2022 paper: "Persona-Guided Planning for Controlling the Protagonist's Persona in Story Generation"☆16Sep 1, 2022Updated 3 years ago
- scrollview嵌套viewpager嵌套recyclerview冲突解决☆10Jun 22, 2018Updated 7 years ago
- ☆10Jun 13, 2020Updated 5 years ago
- all of tibetan dictionary.ཚོང་ལས་ལས་དོན་དུ་སྤྱོད་མི་ཆོག གལ་སྲིད་འགལ་ན་ཁྲིམས་རྩོད་བྱུང་ངེས།☆15Oct 15, 2023Updated 2 years ago
- Your finetuned model's back to its original safety standards faster than you can say "SafetyLock"!☆11Oct 16, 2024Updated last year
- Our research proposes a novel MoGU framework that improves LLMs' safety while preserving their usability.☆18Jan 14, 2025Updated last year
- POC of CVE-2025-7783☆31Oct 31, 2025Updated 4 months ago
- KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality☆40Dec 1, 2025Updated 3 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Codebase for multilingual neural machine translation☆13Nov 24, 2022Updated 3 years ago
- lightsmile个人的用于爬取网络公开语料数据的mini通用爬虫框架。☆13Sep 30, 2020Updated 5 years ago
- The repo for paper: Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models.☆14Dec 16, 2024Updated last year
- A pure Markdown documents showcase☆35Nov 14, 2014Updated 11 years ago
- ☆15Jun 27, 2020Updated 5 years ago
- Official implementation of Visco-Attack (EMNLP 2025 Main). We will progressively release the code and one-click reproduction scripts.☆30Aug 22, 2025Updated 7 months ago
- Less Is More: Training-Free Sparse Attention with Global Locality for Efficient Reasoning☆29Sep 12, 2025Updated 6 months ago