TrustAgent: Towards Safe and Trustworthy LLM-based Agents
☆59Feb 7, 2025Updated last year
Alternatives and similar repositories for TrustAgent
Users that are interested in TrustAgent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code&Data for the paper "Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents" [NeurIPS 2024]☆112Sep 27, 2024Updated last year
- [ACL 2025] The official code for "AGrail: A Lifelong Agent Guardrail with Effective and Adaptive Safety Detection".☆41Aug 4, 2025Updated 10 months ago
- The official implementation of CVPR 2025 paper "Invisible Backdoor Attack against Self-supervised Learning"☆18Jul 5, 2025Updated 11 months ago
- A production-grade implementation of an Investment Portfolio Management System created for testing LLM translation of real world legacy a…☆26Oct 30, 2024Updated last year
- [VLM-Attack-Survey-2024] Paper list and projects for VLM attacks☆18Feb 12, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Knowledge Graph Large Language Model (KG-LLM)☆40Jun 23, 2024Updated last year
- ☆24Dec 22, 2024Updated last year
- [USENIX Security 2022] Mitigating Membership Inference Attacks by Self-Distillation Through a Novel Ensemble Architecture☆16Aug 29, 2022Updated 3 years ago
- A repostory with several COBOL libraries [intended to be used with the latest COBOL standard], each containing a number of very useful fu…☆12Feb 10, 2025Updated last year
- Repository for the Paper: Leave My Images Alone: Preventing Multi-Modal Large Language Models from Analyzing Images via Visual Prompt Inj…☆19Apr 17, 2026Updated last month
- ☆56Dec 7, 2024Updated last year
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆19Feb 29, 2024Updated 2 years ago
- Schoenfeld’s Anatomy of Mathematical Reasoning by Language Models☆27Dec 21, 2025Updated 5 months ago
- Energetic GraphNeural Networks (EGNN) implementation based on Dirichlet Energy Constrained Learning.☆27Nov 1, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Concealed Data Poisoning Attacks on NLP Models☆21Sep 4, 2023Updated 2 years ago
- A package that achieves 95%+ transfer attack success rate against GPT-4☆26Oct 24, 2024Updated last year
- [ICLR 2025] Dissecting adversarial robustness of multimodal language model agents☆137Feb 19, 2025Updated last year
- [COLM 2024] LITE: Modeling Environmental Ecosystems with Multimodal Large Language Models☆14Jan 4, 2025Updated last year
- The official implementation of USENIX Security'23 paper "Meta-Sift" -- Ten minutes or less to find a 1000-size or larger clean subset on …☆20Apr 27, 2023Updated 3 years ago
- Official repository for CoTran: An LLM-based code translator for whole-program translation, fine-tuned using feedback from compiler and s…☆15Nov 6, 2024Updated last year
- Multi-Agent LangGraph System☆13May 22, 2025Updated last year
- CICS Process Management Application☆12Jun 6, 2020Updated 6 years ago
- ☆19Mar 25, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Repository for the NeurIPS 2024 paper "SearchLVLMs: A Plug-and-Play Framework for Augmenting Large Vision-Language Models by Searching Up…☆26Dec 9, 2024Updated last year
- Code to the paper: The Geometry of Refusal in Large Language Models: Concept Cones and Representational Independence☆32Jul 31, 2025Updated 10 months ago
- This is an agent (including contextual prompts) that queries your CSV☆10Jun 8, 2023Updated 3 years ago
- List of T2I safety papers, updated daily, welcome to discuss using Discussions☆68Aug 12, 2024Updated last year
- ☆78Jan 21, 2026Updated 4 months ago
- ☆16Nov 25, 2024Updated last year
- Code repository for the paper --- [USENIX Security 2023] Towards A Proactive ML Approach for Detecting Backdoor Poison Samples☆30Jul 11, 2023Updated 2 years ago
- Backdoor Safety Tuning (NeurIPS 2023 & 2024 Spotlight)☆27Nov 18, 2024Updated last year
- ☆10Jul 9, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An official PyTorch implementation of "Certifiably Robust Graph Contrastive Learning" (NeurIPS 2023)☆11Jan 22, 2024Updated 2 years ago
- Repository for the Paper (AAAI 2024, Oral) --- Visual Adversarial Examples Jailbreak Large Language Models☆277May 13, 2024Updated 2 years ago
- ☆94Mar 20, 2025Updated last year
- Advanced Embodied Intelligence Brain Model☆36Nov 5, 2025Updated 7 months ago
- Code for paper "Poisoned classifiers are not only backdoored, they are fundamentally broken"☆26Jan 7, 2022Updated 4 years ago
- Code for the paper "Understanding and Evaluating Racial Biases in Image Captioning"☆12Mar 26, 2026Updated 2 months ago
- Reimplementation of the paper `Human Attention Maps for Text Classification: Do Humans and Neural Networks Focus on the Same Words? (ACL2…☆17Jul 10, 2020Updated 5 years ago