[ACL 2025] LongSafety: Evaluating Long-Context Safety of Large Language Models
☆16Jun 18, 2025Updated 11 months ago
Alternatives and similar repositories for LongSafety
Users that are interested in LongSafety are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2026] BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs☆18May 21, 2025Updated last year
- Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks☆32Jul 9, 2024Updated last year
- [ACL 2024] Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization☆29Jul 9, 2024Updated last year
- ☆19Oct 14, 2024Updated last year
- 🌿快速生成文件夹目录结构,支持定义目录层级,支持生成到 markdown 文件。☆13Oct 19, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Focused Papers, Delivered Simply :)☆55Dec 25, 2025Updated 5 months ago
- 清华大学2019计网联合实验第一组☆28Jan 15, 2020Updated 6 years ago
- ☆22Jun 16, 2025Updated 11 months ago
- 2022春季学期清华大学计算机图形学大作业☆12Mar 4, 2023Updated 3 years ago
- Suri: Multi-constraint instruction following for long-form text generation [EMNLP’24]☆27Oct 3, 2025Updated 7 months ago
- ☆22Dec 9, 2023Updated 2 years ago
- Official code for PLoP☆20Mar 6, 2026Updated 2 months ago
- ☆32May 22, 2025Updated last year
- Math24o: 高 中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆12Mar 27, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Web version of the MiniDecaf compiler.☆13Sep 17, 2020Updated 5 years ago
- SkillRouter: Retrieve-and-Rerank Skill Selection for LLM Agents at Scale☆146May 7, 2026Updated 2 weeks ago
- 致远OA通过发送特殊请求获取管理员cookie,再通过文件上传接口上传webshell压缩文件,最后发送解压请求获取webshell☆10Apr 11, 2021Updated 5 years ago
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- [AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs☆56Dec 7, 2025Updated 5 months ago
- Openreviewers: Multi Agent Academic Review Simulation System☆23Mar 2, 2024Updated 2 years ago
- 汉英双语词典,python crawler,chinese-english bilingual dictionary☆15Oct 15, 2019Updated 6 years ago
- ☆15Feb 5, 2025Updated last year
- ☆56Mar 18, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆26Nov 7, 2022Updated 3 years ago
- Log4j_dos_CVE-2021-45105☆13Dec 19, 2021Updated 4 years ago
- Official Code for NAACL 2022 paper: "Persona-Guided Planning for Controlling the Protagonist's Persona in Story Generation"☆16Sep 1, 2022Updated 3 years ago
- scrollview嵌套viewpager嵌套recyclerview冲突解决☆10Jun 22, 2018Updated 7 years ago
- ☆10Jun 13, 2020Updated 5 years ago
- Your finetuned model's back to its original safety standards faster than you can say "SafetyLock"!☆11Oct 16, 2024Updated last year
- Our research proposes a novel MoGU framework that improves LLMs' safety while preserving their usability.☆18Jan 14, 2025Updated last year
- KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality☆45May 19, 2026Updated last week
- POC of CVE-2025-7783☆32Oct 31, 2025Updated 6 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Codebase for multilingual neural machine translation☆13Nov 24, 2022Updated 3 years ago
- lightsmile个人的用于爬取网络公开语料数据的mini通用爬虫框架。☆13Sep 30, 2020Updated 5 years ago
- The repo for paper: Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models.☆14Dec 16, 2024Updated last year
- ☆15Jun 27, 2020Updated 5 years ago
- Official implementation of Visco-Attack (EMNLP 2025 Main). An open-source one-click reproduction script is also provided.☆30Apr 11, 2026Updated last month
- A pure Markdown documents showcase☆35Nov 14, 2014Updated 11 years ago
- A game in QT☆10Apr 21, 2018Updated 8 years ago