☆86Mar 20, 2025Updated 11 months ago
Alternatives and similar repositories for TrustAgent
Users that are interested in TrustAgent are comparing it to the libraries listed below
Sorting:
- ☆12Dec 22, 2025Updated 2 months ago
- Code for ICML 2022 paper: Achieving Fairness at No Utility Cost via Data Reweighing with Influence☆11Aug 3, 2022Updated 3 years ago
- ☆10Sep 25, 2024Updated last year
- Consuming Resrouce via Auto-generation for LLM-DoS Attack under Black-box Settings☆18Sep 1, 2025Updated 6 months ago
- [ICLR 2024] heterogeneous MoE: mixture of weak & strong experts on graphs https//openreview.net/pdf?id=wYvuY60SdD☆20Apr 6, 2025Updated 10 months ago
- Papers on concurrency vulnerability analysis, including multithreaded programs, multi-tasking programs and interrupt driven programs.☆15Nov 11, 2022Updated 3 years ago
- ☆38Oct 12, 2025Updated 4 months ago
- Code for Findings-EMNLP 2023 paper: Multi-step Jailbreaking Privacy Attacks on ChatGPT☆36Oct 15, 2023Updated 2 years ago
- ☆77Dec 19, 2024Updated last year
- TrustAgent: Towards Safe and Trustworthy LLM-based Agents☆56Feb 7, 2025Updated last year
- [CCS'22] SSLGuard: A Watermarking Scheme for Self-supervised Learning Pre-trained Encoders☆18Jul 12, 2022Updated 3 years ago
- ☆118Jul 2, 2024Updated last year
- ☆22Jul 25, 2024Updated last year
- [IEEE T-IFS] AutoPT: How Far Are We from the Fully Automated Web Penetration Testing?☆32Aug 18, 2025Updated 6 months ago
- Matching Natural Language Sentences with Hierarchical Sentence Factorization☆22Apr 26, 2018Updated 7 years ago
- Tongji select courses 同济抢课(捡漏)程序--适用于四轮选课☆20Jan 8, 2024Updated 2 years ago
- SIMON and SPECK, the two lightweight block ciphers designed by the researchers from NSA☆24Jul 12, 2013Updated 12 years ago
- An implementation of drophead regularization for pytorch transformers☆19Aug 24, 2021Updated 4 years ago
- LLM Platform Security: Applying a Systematic Evaluation Framework to OpenAI's ChatGPT Plugins☆29Jul 29, 2024Updated last year
- ☆72Mar 30, 2025Updated 11 months ago
- ☆28Aug 21, 2023Updated 2 years ago
- Code for WWW2019 paper "A Hierarchical Attention Retrieval Model for Healthcare Question Answering"☆22Jul 25, 2020Updated 5 years ago
- ☆34Aug 28, 2024Updated last year
- ☆37Oct 15, 2024Updated last year
- Implementation of the paper "Exploring the Universal Vulnerability of Prompt-based Learning Paradigm" on Findings of NAACL 2022☆32Jul 11, 2022Updated 3 years ago
- A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).☆1,870Feb 23, 2026Updated last week
- This is the official repository for the ICLR 2025 accepted paper Badrobot: Manipulating Embodied LLMs in the Physical World.☆41Jun 26, 2025Updated 8 months ago
- Enterprise AI Security Platform - Real-time firewall protection for LLM applications against prompt injection, data leakage, and function…☆23Sep 14, 2025Updated 5 months ago
- Improved techniques for optimization-based jailbreaking on large language models (ICLR2025)☆142Apr 7, 2025Updated 10 months ago
- A data construction and evaluation framework to quantify privacy norm awareness of language models (LMs) and emerging privacy risk of LM …☆43Mar 4, 2025Updated last year
- Clone of JSAI static analysis framework☆13Jul 29, 2017Updated 8 years ago
- ☆10Aug 9, 2023Updated 2 years ago
- Homework for STAT 205A - Berkeley☆13Dec 9, 2014Updated 11 years ago
- MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning [NeurIPS 2025 Poster]☆23Dec 10, 2025Updated 2 months ago
- Implementation of followinf estimation algorithms in python: Kalman Filter, Extended Kalman Filter, Unscented Kalman Filter, Cubature Kal…☆11Dec 2, 2023Updated 2 years ago
- CTINexus is a framework that leverages optimized in-context learning of LLMs to enable data-efficient extraction of cyber threat intellig…☆70Feb 25, 2026Updated last week
- Enable Next-sentence Prediction for Large Language Models with Faster Speed, Higher Accuracy and Longer Context☆41Aug 16, 2024Updated last year
- ☆10Apr 30, 2024Updated last year
- EmotionCircuits-LLM: A complete, reproducible framework for discovering and controlling emotion circuits in large language models.☆25Oct 20, 2025Updated 4 months ago