pillowsofwind / LLM-CBRN-RisksLinks
[ACL 2025 Findings] The official GitHub repo for the paper "Nuclear Deployed: Analyzing Catastrophic Risks in Decision-making of Autonomous LLM Agents"
☆18Updated last month
Alternatives and similar repositories for LLM-CBRN-Risks
Users that are interested in LLM-CBRN-Risks are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] GuardT2I: Defending Text-to-Image Models from Adversarial Prompts☆53Updated 3 weeks ago
- This repo collects research papers that use AI tools and are in the field of scientific research (including computer science, agronomy, c…☆95Updated 3 months ago
- This is the public code repository of paper 'Comprehensive Assessment of Jailbreak Attacks Against LLMs'☆86Updated 9 months ago
- DPO-Shift: Shifting the Distribution of Direct Preference Optimization☆59Updated 3 months ago
- A Easily Extensible labeling annotation template web tool (Flask + Vue 3) for annotation [易扩展的标注网页模板]☆24Updated 2 months ago
- Official implementation of CIKM2024 paper titled "PROSPECT: Learn MLPs on Graphs Robust against Adversarial Structure Attacks"☆22Updated 4 months ago
- LLM-FuzzX is a user-friendly fuzz testing tool for Large Language Models (e.g., GPT, Claude, LLaMA), featuring advanced task-aware mutati…☆112Updated last month
- kight is a static analysis tool for c/c++ programs.☆216Updated 6 months ago
- Code and dataset of ARMOUR: zero-permission sensor usage (ACM WiSec 2025)☆35Updated 2 weeks ago
- Official Repository for Paper: The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning☆50Updated 2 months ago
- ☆101Updated 3 weeks ago
- Tensor-Var: Efficient four-dimensional variational data assimilation☆31Updated 4 months ago
- Deep Seek AI-Driven Strategies, Blockchain-Verified Trust☆42Updated 4 months ago
- RAS: Retrieval-And-Structuring for Knowledge-Intensive LLM Generation☆49Updated last month
- HeFlwr: Federated Learning for Heterogeneous Devices☆118Updated 3 months ago
- Industrial-grade weather visualization system that transforms AI model predictions into professional meteorological plots, emphasizing op…☆27Updated 5 months ago
- ☆23Updated 9 months ago
- [ACL 2025] FinMME: Benchmark Dataset for Financial Multi-Modal Reasoning Evaluation☆27Updated last week
- [MM 2024] Official code for VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness☆49Updated 11 months ago
- EvaLearn is a pioneering benchmark designed to evaluate large language models (LLMs) on their learning capability and efficiency in chall…☆120Updated last week
- AI-powered document summarization engine that transforms lengthy texts into crystallized insights☆147Updated 7 months ago
- ☆19Updated 6 months ago
- ☆10Updated last year
- AI phone agents for business.☆17Updated 4 months ago
- ☆179Updated 4 months ago
- A toolkit that helps you automatically deletes old Docker images from an AWS ECR repository, keeping only the latest N images.☆52Updated 4 months ago
- 论文阅读助手☆55Updated 5 months ago
- ☆141Updated last year
- Source Evaluation scripts for Humanity's Last Code Exam☆39Updated last week
- ☆163Updated last week