TangciuYueng/AMemGuard

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TangciuYueng/AMemGuard)

TangciuYueng / AMemGuard

☆10

Alternatives and similar repositories for AMemGuard

Users that are interested in AMemGuard are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dsh3n77 / MINJA
View on GitHub
Memory Injection Attacks on LLM Agents via Query-Only Interaction
☆28Feb 10, 2026Updated 5 months ago
Wangtk311 / SafeEar-Inference-Test-Script
View on GitHub
SafeEar是由浙大和清华共同开发的一种深度伪声探测模型。这是我撰写的模型推理脚本。我不确定它是否正确，目前我还是初学者，如有问题请原谅我并指出，谢谢！
☆16May 16, 2025Updated last year
zzh-thu-22 / ExtendAttack
View on GitHub
[AAAI 2026] This is the official implementation of the paper "ExtendAttack: Attacking Servers of LRMs via Extending Reasoning".
☆25Mar 18, 2026Updated 3 months ago
chanchimin / AgentMonitor
View on GitHub
Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"
☆13Dec 13, 2024Updated last year
wangbo9719 / MEXTRA
View on GitHub
Source code for the ACL'2025 paper titled "Unveiling privacy risks in llm agent memory"
☆34Dec 2, 2025Updated 7 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
THU-KEG / DICE
View on GitHub
DICE: Detecting In-distribution Data Contamination with LLM's Internal State
☆12Sep 21, 2024Updated last year
ZZZhr-1 / Robust_GUI_Grounding
View on GitHub
On the Robustness of GUI Grounding Models Against Image Attacks
☆12Apr 8, 2025Updated last year
CosmosYi / ReasoningShield
View on GitHub
ReasoningShield: Safety Detection over Reasoning Traces of Large Reasoning Models
☆26Sep 27, 2025Updated 9 months ago
thunlp / SparsingLaw
View on GitHub
The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".
☆32Nov 12, 2024Updated last year
kaijiezhu11 / MELON
View on GitHub
[ICML'25] MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents
☆35Jul 31, 2025Updated 11 months ago
1229095296 / ResRL
View on GitHub
This repository includes code for our paper: ResRL: Boosting LLM Reasoning via Negative Sample Projection Residual Reinforcement Learning…
☆15May 2, 2026Updated 2 months ago
Astarojth / AgentAuditor-ASSEBench
View on GitHub
☆39May 29, 2026Updated last month
Simplified-Reasoning / TRM
View on GitHub
Code repository for the ICML 2026 Oral paper "Characterizing, Evaluating, and Optimizing Complex Reasoning".
☆17Jun 21, 2026Updated 3 weeks ago
xxiqiao / TROJail
View on GitHub
Official implementation of "TROJail: Trajectory-Level Optimization for Multi-Turn Large Language Model Jailbreaks with Process Rewards"
☆30Updated this week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
sunblaze-ucb / progent
View on GitHub
Progent: Securing AI Agents with Privilege Control
☆39May 14, 2026Updated 2 months ago
Bklight999 / world-knowledge
View on GitHub
a novel self-evolving paradigm, without task, reward, or complex workflow
☆36May 12, 2026Updated 2 months ago
UCSC-REAL / FLAT
View on GitHub
[ICLR 2025] FLAT: LLM Unlearning via Loss Adjustment with Only Forget Data
☆14Feb 26, 2025Updated last year
yyy01 / PAC
View on GitHub
The official implementation of the paper "Data Contamination Calibration for Black-box LLMs" (ACL 2024)
☆16May 21, 2024Updated 2 years ago
Ymm-cll / TrustAgent
View on GitHub
☆98Mar 20, 2025Updated last year
Tele-EVOL / TeleAI-Safety
View on GitHub
☆27Jan 5, 2026Updated 6 months ago
ZBox1005 / AgentForesight
View on GitHub
AgentForesight: Online Auditing for Early Failure Prediction in Multi-Agent Systems
☆15May 12, 2026Updated 2 months ago
SaFo-Lab / AGrail4Agent
View on GitHub
[ACL 2025] The official code for "AGrail: A Lifelong Agent Guardrail with Effective and Adaptive Safety Detection".
☆42Aug 4, 2025Updated 11 months ago
Kaminyou / deepspeech2-pytorch-adversarial-attack
View on GitHub
Adversarial attack against DeepSpeech2 ASR pytorch model
☆24Jan 15, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
FoundationAgents / AutoEnv
View on GitHub
Scaling Agentic Environments Automatically.
☆66Mar 26, 2026Updated 3 months ago
ZHUWEI-hub / GUARD
View on GitHub
[ACL 2026] Dissecting Failure Dynamics in Large Language Model Reasoning
☆17Apr 17, 2026Updated 2 months ago
Wangyuhao06 / IKEA
View on GitHub
Implement of Implicit Knowledge Extraction Attack.
☆24Apr 17, 2026Updated 2 months ago
collinzrj / adversarial_decoding
View on GitHub
☆29Oct 27, 2025Updated 8 months ago
boyi-liu / Awesome-Personalized-Federated-Learning
View on GitHub
Paper List for Personalized Federated Learning (PFL)
☆15Jul 23, 2024Updated last year
ejones313 / roben
View on GitHub
☆12Mar 7, 2021Updated 5 years ago
yangjunx21 / Paper-Pulse
View on GitHub
Focused Papers, Delivered Simply ：）
☆55Dec 25, 2025Updated 6 months ago
wangyu-ustc / Mem-alpha
View on GitHub
The official implementation of the paper "Mem-α: Learning Memory Construction via Reinforcement Learning"
☆216Dec 25, 2025Updated 6 months ago
Zhow01 / SkillAttack
View on GitHub
☆52May 19, 2026Updated last month
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
inclusionAI / TC-AE
View on GitHub
Official repo for "TC-AE: Unlocking Token Capacity for Deep Compression Autoencoders"
☆24Apr 9, 2026Updated 3 months ago
YapengTian / AV-Robustness-CVPR21
View on GitHub
Can audio-visual integration strengthen robustness under multimodal attacks?
☆30Mar 31, 2022Updated 4 years ago
MiroMindAI / MiroEval
View on GitHub
MiroEval: A benchmark and evaluation framework for deep research agents — 100 tasks (70 text, 30 multimodal) assessed across synthesis qu…
☆45Jul 6, 2026Updated last week
leost123456 / LLaVAShield
View on GitHub
[CVPR 2026] LLaVAShield: Safeguarding Multimodal Multi-Turn Dialogues in Vision-Language Models
☆16Jun 26, 2026Updated 2 weeks ago
LINs-lab / MASArena
View on GitHub
A comprehensive framework for benchmarking single and multi-agent systems across a wide range of tasks—evaluating performance, accuracy, …
☆38Jun 5, 2026Updated last month
ZhaozwTD / KPE
View on GitHub
Codes for ACL2023 paper: Knowledgeable Parameter Efficient Tuning Network for Commonsense Question Answering.
☆11Sep 23, 2023Updated 2 years ago
PanasonicConnect / rap
View on GitHub
RAP: Retrieval-Augmented Planning with Contextual Memory for Multimodal LLM Agents
☆25Aug 23, 2024Updated last year