The evaluation code for A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5
☆53Jan 18, 2026Updated 4 months ago
Alternatives and similar repositories for AI-safety-report
Users that are interested in AI-safety-report are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆28Feb 19, 2025Updated last year
- Official Implementation of "ToolSafe: Enhancing Tool Invocation Safety of LLM-based Agents via Proactive Step-level Guardrail and Feedbac…☆63Mar 25, 2026Updated 2 months ago
- [ACL 2026 Findings] "Omni-R1: Towards the Unified Generative Paradigm for Multimodal Reasoning"☆62May 26, 2026Updated 2 weeks ago
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.☆57Mar 12, 2026Updated 3 months ago
- ☆12Mar 24, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR 2026] Official repo for "EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation"☆60Mar 13, 2026Updated 3 months ago
- [CVPR 2026 Oral, Best Paper Finalist] SeaCache: Spectral-Evolution-Aware Cache for Accelerating Diffusion Models☆68Jun 5, 2026Updated last week
- [ACL'26] EvoToken-DLM (Beyond Hard Masks: Progressive Token Evolution for Diffusion Language)☆48Apr 7, 2026Updated 2 months ago
- Security-native LLM system for AI-generated application security.☆253Jun 4, 2026Updated last week
- In-Context Reinforcement Learning for Tool Use in Large Language Models☆48Mar 26, 2026Updated 2 months ago
- Open Ended Medical Reinforcement Learning☆62Mar 15, 2026Updated 3 months ago
- Code and data for "An Accurate Unsupervised Method for Joint Entity Alignment and Dangling Entity Detection".☆15Mar 26, 2022Updated 4 years ago
- [CVPR 2026] Official code of "EmbodiedSplat: Online Feed-Forward Semantic 3DGS for Open-Vocabulary 3D Scene Understanding"☆92Jun 1, 2026Updated 2 weeks ago
- Source code for UP-Diff☆15Nov 26, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently☆39Feb 4, 2026Updated 4 months ago
- ☆37Jan 30, 2026Updated 4 months ago
- [ACM MM2023] Code Release of GCMA: Generative Cross-Modal Transferable Adversarial Attacks from Images to Videos☆12Mar 29, 2024Updated 2 years ago
- Introducing XSafeClaw: The Open-Source Agent Safety Platform from Fudan University☆154Updated this week
- ☆12Aug 12, 2024Updated last year
- ☆47May 15, 2026Updated last month
- [ICML 2026] The official implementation of paper "Generation Enhances Understanding in Unified Multimodal Models via Multi-Representation…☆76May 25, 2026Updated 3 weeks ago
- https://openreview.net/forum?id=OC1o4_OI6Jw☆13May 27, 2022Updated 4 years ago
- [NeurIPS2024] BoostAdapter: Improving Test-Time Adaptation via Regional Bootstrapping☆20Feb 28, 2026Updated 3 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆21Jan 17, 2025Updated last year
- ☆78Apr 20, 2026Updated last month
- The code for the paper "LCM: Locally Constrained Compact Point Cloud Model for Masked Point Modeling" (NeurIPS'24).☆15Dec 25, 2024Updated last year
- [ACM MM 2024] ReToMe-VA: Recursive Token Merging for Video Diffusion-based Unrestricted Adversarial Attack☆14Dec 20, 2024Updated last year
- [CVPR 2024] Not All Prompts Are Secure: A Switchable Backdoor Attack Against Pre-trained Vision Transfomers☆16Oct 24, 2024Updated last year
- Measuring RAG solutions throughput and latency☆20Jul 23, 2024Updated last year
- ☆70Feb 6, 2026Updated 4 months ago
- The implementatin of our ICLR 2021 work: Targeted Attack against Deep Neural Networks via Flipping Limited Weight Bits☆19Jul 20, 2021Updated 4 years ago
- Green-VLA: Staged Vision-Language-Action Model for Generalist Robots☆134Mar 5, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Test for Graph Unlearning Benchmark☆19Jul 12, 2025Updated 11 months ago
- ECCV2024: Adversarial Prompt Tuning for Vision-Language Models☆31Mar 7, 2026Updated 3 months ago
- Adversarial Tokenization☆39Nov 21, 2025Updated 6 months ago
- Emoji Attack [ICML 2025]☆44Jul 15, 2025Updated 11 months ago
- [NeurIPS 2024] Lumen: a Large multimodal model with versatile vision-centric capabilities☆25Sep 27, 2024Updated last year
- [ICLR 2025] BlueSuffix: Reinforced Blue Teaming for Vision-Language Models Against Jailbreak Attacks☆31Nov 2, 2025Updated 7 months ago
- The code of paper: Fully Exploiting Every Real Sample: SuperPixel Sample Gradient Model Stealing (CVPR 2024))☆19Mar 12, 2024Updated 2 years ago