Tele-EVOL/TeleAI-Safety

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Tele-EVOL/TeleAI-Safety)

Tele-EVOL / TeleAI-Safety

☆26

Alternatives and similar repositories for TeleAI-Safety

Users that are interested in TeleAI-Safety are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Tele-EVOL / AI-Governance
View on GitHub
Global AI Safety and Governance: Never Compromise to Vulnerabilities
☆44Sep 11, 2025Updated 8 months ago
thinwayliu / Multimodal-Unlearnable-Examples
View on GitHub
The code for ACM MM2024 (Multimodal Unlearnable Examples: Protecting Data against Multimodal Contrastive Learning)
☆15Jul 18, 2024Updated last year
HyeonjeongHa / MM-PoisonRAG
View on GitHub
Official PyTorch implementation of "MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Poisoning Attacks"
☆15Dec 4, 2025Updated 5 months ago
SaFo-Lab / DoxBench
View on GitHub
[ICLR 2026] The official code for "Doxing via the Lens: Revealing Location-related Privacy Leakage on Multi-modal Large Reasoning Models"
☆26Feb 7, 2026Updated 3 months ago
zzh-thu-22 / ExtendAttack
View on GitHub
[AAAI 2026] This is the official implementation of the paper "ExtendAttack: Attacking Servers of LRMs via Extending Reasoning".
☆22Mar 18, 2026Updated 2 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
amazon-science / synthesizrr
View on GitHub
Synthesizing realistic and diverse text-datasets from augmented LLMs
☆19Apr 4, 2026Updated last month
jiaxiaojunQAQ / FP-Better
View on GitHub
Code for Fast Propagation is Better: Accelerating Single-Step Adversarial Training via Sampling Subnetworks (TIFS2024)
☆13Mar 29, 2024Updated 2 years ago
llltttppp / SS-NAN
View on GitHub
keras implement of the paper Self-Supervised Neural Aggregation Networks for Human Parsing
☆23Sep 12, 2018Updated 7 years ago
UCSC-REAL / FLAT
View on GitHub
[ICLR 2025] FLAT: LLM Unlearning via Loss Adjustment with Only Forget Data
☆14Feb 26, 2025Updated last year
Alibaba-AAIG / Oyster
View on GitHub
The Oyster series is a set of safety models developed in-house by Alibaba-AAIG, devoted to building a responsible AI ecosystem. | Oyster …
☆62Apr 29, 2026Updated 3 weeks ago
Sadcardation / ImageProtector
View on GitHub
Repository for the Paper: Leave My Images Alone: Preventing Multi-Modal Large Language Models from Analyzing Images via Visual Prompt Inj…
☆19Apr 17, 2026Updated last month
Institut-Polytechnique-de-Paris / time-disentanglement-lib
View on GitHub
🤗 [ICLR 2024] Disentangling Time Series Representations via Contrastive based l-Variational Inference
☆18Dec 11, 2025Updated 5 months ago
SolidShen / BAIT
View on GitHub
🔥🔥🔥 Detecting hidden backdoors in Large Language Models with only black-box access
☆56Jun 2, 2025Updated 11 months ago
thu-coai / AISafetyLab
View on GitHub
AISafetyLab: A comprehensive framework covering safety attack, defense, evaluation and paper list.
☆241Apr 21, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
INTREBID / Awesome-MM-RAG
View on GitHub
This repository is for our survey paper: "A Comprehensive Survey on Multimodal RAG: All Combinations of Modalities as Input and Output"
☆49Nov 21, 2025Updated 6 months ago
Wangyuhao06 / IKEA
View on GitHub
Implement of Implicit Knowledge Extraction Attack.
☆23Apr 17, 2026Updated last month
Xiuyuan-Chen / AutoEval-Video
View on GitHub
☆37Jan 25, 2024Updated 2 years ago
collinzrj / adversarial_decoding
View on GitHub
☆27Oct 27, 2025Updated 6 months ago
yangjunx21 / Paper-Pulse
View on GitHub
Focused Papers, Delivered Simply ：）
☆55Dec 25, 2025Updated 4 months ago
shashankprasanna / autogluon-demos
View on GitHub
☆18Mar 31, 2020Updated 6 years ago
Yanie1asdfg / Quant-Lectures
View on GitHub
☆14May 16, 2022Updated 4 years ago
SeRAlab / CNCA
View on GitHub
CNCA: Toward Customizable and Natural Generation of Adversarial Camouflage for Vehicle Detectors
☆15Nov 3, 2024Updated last year
PositionalHidden / PositionalHidden
View on GitHub
To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …
☆11Jun 18, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
roywang021 / IDEATOR
View on GitHub
Code for ICCV2025 paper——IDEATOR: Jailbreaking and Benchmarking Large Vision-Language Models Using Themselves
☆17Jul 11, 2025Updated 10 months ago
shiningrain / JailGuard
View on GitHub
☆29Mar 16, 2025Updated last year
XiaoYiWeio / deepsafe-scan
View on GitHub
Universal preflight security scanner for AI coding agents — Detects hooks injection, credential exfiltration & backdoors in .cursorrules,…
☆70Apr 9, 2026Updated last month
Y-Xiang-hub / AdvEWM
View on GitHub
This repository contains code for AdvEWM, as detailed in our paper published in JISA
☆18Mar 3, 2026Updated 2 months ago
TrustAIRLab / HateBench
View on GitHub
[USENIX'25] HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns
☆14Mar 1, 2025Updated last year
xiangzejun / Optimizing_Implementations_of_Linear_Layers
View on GitHub
A new heuristic to optimize implementations of linear matrices
☆20Jan 2, 2023Updated 3 years ago
thunxxx / MLLM-Jailbreak-evaluation-MMJ-Bench
View on GitHub
☆78Mar 30, 2025Updated last year
aisa-group / skill-inject
View on GitHub
Skill-Inject: Measuring Agent Vulnerability to Skill File Attacks
☆72May 7, 2026Updated 2 weeks ago
Lpzhan931 / llm-debugger
View on GitHub
An interactive attention visualization and intervention tool for LLM Decode Stage.
☆48Jan 6, 2026Updated 4 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
S3IC-Lab / Odysseus
View on GitHub
[NDSS 2026] Official repo for Odysseus: Jailbreaking Commercial Multimodal LLM-integrated Systems via Dual Steganography
☆36Mar 14, 2026Updated 2 months ago
AmourWaltz / UAlign
View on GitHub
Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"
☆14Mar 25, 2025Updated last year
Zhang-Yihao / Adversarial-Representation-Engineering
View on GitHub
Official implementation repository for the paper Towards General Conceptual Model Editing via Adversarial Representation Engineering.
☆20Dec 6, 2024Updated last year
jiaxiaojunQAQ / FGSM-PGI
View on GitHub
Code for Prior-Guided Adversarial Initialization for Fast Adversarial Training (ECCV2022)
☆28Nov 25, 2022Updated 3 years ago
NY1024 / BAP-Jailbreak-Vision-Language-Models-via-Bi-Modal-Adversarial-Prompt
View on GitHub
☆60Jun 5, 2024Updated last year
IAAR-Shanghai / SafeRAG
View on GitHub
☆59Mar 11, 2025Updated last year
Social-AI-Studio / MemeCraft
View on GitHub
Official repository for WWW'24 paper "MemeCraft: Contextual and Stance-Driven Multimodal Meme Generation"
☆12Jul 25, 2024Updated last year