XSafeAI/AI-safety-report

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/XSafeAI/AI-safety-report)

XSafeAI / AI-safety-report

The evaluation code for A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5

☆53

Alternatives and similar repositories for AI-safety-report

Users that are interested in AI-safety-report are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AI-safety-book / AI-safety-book.github.io
View on GitHub
☆28Feb 19, 2025Updated last year
MurrayTom / ToolSafe
View on GitHub
Official Implementation of "ToolSafe: Enhancing Tool Invocation Safety of LLM-based Agents via Proactive Step-level Guardrail and Feedbac…
☆58Mar 25, 2026Updated 2 months ago
Inso-13 / ArtHOI
View on GitHub
[ArXiv 26] The official repository of "ArtHOI: Articulated Human-Object Interaction Synthesis by 4D Reconstruction from Video Priors".
☆36Mar 5, 2026Updated 2 months ago
InternRobotics / M3
View on GitHub
M³: Dense Matching Meets Multi-View Foundation Models for Monocular Gaussian Splatting SLAM
☆71Mar 18, 2026Updated 2 months ago
yuezhouhu / residual-context-diffusion
View on GitHub
Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.
☆56Mar 12, 2026Updated 2 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
HKU-MMLab / EVATok
View on GitHub
[CVPR 2026] Official repo for "EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation"
☆59Mar 13, 2026Updated 2 months ago
RU-System-Software-and-Security / NIC
View on GitHub
☆12Mar 24, 2023Updated 3 years ago
OpenHelix-Team / frappe
View on GitHub
Official implementation of FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignment
☆50Mar 24, 2026Updated 2 months ago
aim-uofa / EvoTokenDLM
View on GitHub
[ACL'26] EvoToken-DLM (Beyond Hard Masks: Progressive Token Evolution for Diffusion Language)
☆48Apr 7, 2026Updated last month
mbzuai-oryx / MediX-R1
View on GitHub
Open Ended Medical Reinforcement Learning
☆55Mar 15, 2026Updated 2 months ago
0nandon / EmbodiedSplat
View on GitHub
[CVPR 2026] Official code of "EmbodiedSplat: Online Feed-Forward Semantic 3DGS for Open-Vocabulary 3D Scene Understanding"
☆86Updated this week
gimpong / CVPR25-Condenser
View on GitHub
The code for the paper "Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning" (CVPR'25).
☆15Sep 25, 2025Updated 8 months ago
zeyuwang-zju / UP-Diff
View on GitHub
Source code for UP-Diff
☆15Nov 26, 2024Updated last year
GAIR-NLP / daVinci-Agency
View on GitHub
daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently
☆39Feb 4, 2026Updated 3 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
BrianChen1120 / RL-AWB
View on GitHub
☆36Jan 30, 2026Updated 3 months ago
Go2Heart / OmniStream
View on GitHub
OmniStream: Mastering Perception, Reconstruction and Action in Continuous Streams
☆100Mar 15, 2026Updated 2 months ago
kay-ck / GCMA
View on GitHub
[ACM MM2023] Code Release of GCMA: Generative Cross-Modal Transferable Adversarial Attacks from Images to Videos
☆12Mar 29, 2024Updated 2 years ago
XSafeAI / XSafeClaw
View on GitHub
Introducing XSafeClaw: The Open-Source Agent Safety Platform from Fudan University
☆151May 15, 2026Updated last week
AweAI-Team / BeyondSWE
View on GitHub
☆44May 15, 2026Updated last week
xbmxb / StructureCharacterization4DD
View on GitHub
https://openreview.net/forum?id=OC1o4_OI6Jw
☆13May 27, 2022Updated 3 years ago
taolinzhang / BoostAdapter
View on GitHub
[NeurIPS2024] BoostAdapter: Improving Test-Time Adaptation via Regional Bootstrapping
☆19Feb 28, 2026Updated 2 months ago
wdrink / OpenTokenizer
View on GitHub
☆21Jan 17, 2025Updated last year
xinwong / AdvDetect
View on GitHub
Adversarial Examples Detection Benchmark
☆17Dec 6, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zyh16143998882 / LCM
View on GitHub
The code for the paper "LCM: Locally Constrained Compact Point Cloud Model for Masked Point Modeling" (NeurIPS'24).
☆14Dec 25, 2024Updated last year
AI45Lab / OpenRT
View on GitHub
Open-source red teaming framework for MLLMs with 42+ attack methods
☆249Mar 25, 2026Updated 2 months ago
teqkilla / RubricHub
View on GitHub
TBD
☆57Mar 13, 2026Updated 2 months ago
zlh-thu / DPFL
View on GitHub
A Fine-grained Differentially Private Federated Learning against Leakage from Gradients
☆16Jan 18, 2023Updated 3 years ago
microsoft / InfoAgent
View on GitHub
☆69Feb 6, 2026Updated 3 months ago
20000yshust / SWARM
View on GitHub
[CVPR 2024] Not All Prompts Are Secure: A Switchable Backdoor Attack Against Pre-trained Vision Transfomers
☆16Oct 24, 2024Updated last year
SciPhi-AI / RAG-Performance
View on GitHub
Measuring RAG solutions throughput and latency
☆20Jul 23, 2024Updated last year
jiawangbai / TA-LBF
View on GitHub
The implementatin of our ICLR 2021 work: Targeted Attack against Deep Neural Networks via Flipping Limited Weight Bits
☆19Jul 20, 2021Updated 4 years ago
zeyuwang-zju / TIRDet
View on GitHub
[ACM MM 2023] Official code for "TIRDet: Mono-Modality Thermal InfraRed Object Detection Based on Prior Thermal-To-Visible Translation"
☆23Dec 3, 2025Updated 5 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
gimpong / AAAI25-S5VH
View on GitHub
The code for the paper "Efficient Self-Supervised Video Hashing with Selective State Spaces" (AAAI'25).
☆23Aug 2, 2025Updated 9 months ago
jiamingzhang94 / Adversarial-Prompt-Tuning
View on GitHub
ECCV2024: Adversarial Prompt Tuning for Vision-Language Models
☆31Mar 7, 2026Updated 2 months ago
RenatoGeh / advtok
View on GitHub
Adversarial Tokenization
☆39Nov 21, 2025Updated 6 months ago
zhipeng-wei / EmojiAttack
View on GitHub
Emoji Attack [ICML 2025]
☆44Jul 15, 2025Updated 10 months ago
SxJyJay / Lumen
View on GitHub
[NeurIPS 2024] Lumen: a Large multimodal model with versatile vision-centric capabilities
☆25Sep 27, 2024Updated last year
gimpong / WWW22-HCQ
View on GitHub
The code for the paper "Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval" (WWW'22, Oral).
☆17Mar 8, 2022Updated 4 years ago
Hank0626 / DDN
View on GitHub
Official implementation of "DDN: Dual-domain Dynamic Normalization for Non-stationary Time Series Forecasting" (NeurIPS 2024)
☆25Oct 28, 2024Updated last year