[FCS'24] LVLM Safety paper
☆19Jan 4, 2025Updated last year
Alternatives and similar repositories for LVLM-Safety
Users that are interested in LVLM-Safety are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICONIP'24]Mingyu.Jin's final year project☆30Aug 23, 2024Updated last year
- [ACL'24] Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correla…☆47May 11, 2025Updated 11 months ago
- VAEGAN, I Love u☆16Aug 15, 2023Updated 2 years ago
- From Commands to Prompts: LLM-based Semantic File System for AIOS☆49Mar 9, 2025Updated last year
- The code for ICLR2025 paper "SLMRec: Empowering Small Language Models for Sequential Recommendation".☆51Jun 16, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Demo code for the paper: One Thing to Fool them All: Generating Interpretable, Universal, and Physically-Realizable Adversarial Features☆12Nov 30, 2023Updated 2 years ago
- ☆13Jan 9, 2024Updated 2 years ago
- ☆20Mar 25, 2026Updated last month
- The official implementation of CVPR 2025 paper "Invisible Backdoor Attack against Self-supervised Learning"☆17Jul 5, 2025Updated 10 months ago
- [ICLR 2025] Official codebase for the ICLR 2025 paper "Multimodal Situational Safety"☆34Jun 23, 2025Updated 10 months ago
- ☆19Mar 25, 2025Updated last year
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆87Nov 2, 2025Updated 6 months ago
- [ACL 2024] CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion☆59Oct 1, 2025Updated 7 months ago
- ☆16Nov 25, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for EMNLP 2022 main conference paper "Information-Transport-based Policy for Simultaneous Translation"☆13Nov 3, 2022Updated 3 years ago
- Code and dataset for the paper: "Can Editing LLMs Inject Harm?" [AAAI'26]☆21Dec 26, 2025Updated 4 months ago
- ☆18Jun 3, 2024Updated last year
- ☆21Oct 12, 2024Updated last year
- Knowledge Graph Large Language Model (KG-LLM)☆39Jun 23, 2024Updated last year
- Official repo of M$^2$PT: Multimodal Prompt Tuning for Zero-shot Instruction Learning☆28Mar 23, 2025Updated last year
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 7 months ago
- [WSDM 2026] LookAhead Tuning: Safer Language Models via Partial Answer Previews☆17Dec 14, 2025Updated 4 months ago
- [ACL 2025] The official code for "AGrail: A Lifelong Agent Guardrail with Effective and Adaptive Safety Detection".☆40Aug 4, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Cerebrum: Agent SDK for AIOS☆134Apr 23, 2026Updated 2 weeks ago
- [ICML 2024] Safety Fine-Tuning at (Almost) No Cost: A Baseline for Vision Large Language Models.☆87Jan 19, 2025Updated last year
- ☆18Apr 7, 2025Updated last year
- ☆11Jan 19, 2025Updated last year
- A toolkit for testing and improving named entity recognition [ESEC/FSE'23]☆11Aug 31, 2023Updated 2 years ago
- Source code of paper "Systematic Assessment of Factual Knowledge in Large Language Models" - EMNLP Findings 2023☆17Mar 17, 2026Updated last month
- Tasks for describing differences between text distributions.☆17Aug 9, 2024Updated last year
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆53Nov 30, 2024Updated last year
- ☆89Sep 11, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Röttger et al. (2025): "MSTS: A Multimodal Safety Test Suite for Vision-Language Models"☆18Mar 31, 2025Updated last year
- ☆13Sep 12, 2024Updated last year
- This repository provides a PyTorch implementation of "Fooling Neural Network Interpretations via Adversarial Model Manipulation". Our pap…☆23Dec 19, 2020Updated 5 years ago
- ☆17Feb 23, 2025Updated last year
- Collection of Reverse Engineering in Large Model☆36Jan 8, 2025Updated last year
- Official code of "The Automated but Risky Game: Modeling Agent-to-Agent Negotiations and Transactions in Consumer Markets"☆26Mar 24, 2026Updated last month
- Frontend for Talent, a talent acquisition web application☆10Jan 5, 2023Updated 3 years ago