[FCS'24] LVLM Safety paper
☆19Jan 4, 2025Updated last year
Alternatives and similar repositories for LVLM-Safety
Users that are interested in LVLM-Safety are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL'24] Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correla…☆47May 11, 2025Updated last year
- The code for ICLR2025 paper "SLMRec: Empowering Small Language Models for Sequential Recommendation".☆51Jun 16, 2025Updated last year
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆19Feb 29, 2024Updated 2 years ago
- Demo code for the paper: One Thing to Fool them All: Generating Interpretable, Universal, and Physically-Realizable Adversarial Features☆12Nov 30, 2023Updated 2 years ago
- [COLM 2024] LITE: Modeling Environmental Ecosystems with Multimodal Large Language Models☆14Jan 4, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆112Jan 23, 2026Updated 4 months ago
- The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"☆39Oct 1, 2025Updated 8 months ago
- ☆19Mar 25, 2025Updated last year
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆86Nov 2, 2025Updated 7 months ago
- Code and dataset for the paper: "Can Editing LLMs Inject Harm?" [AAAI'26]☆21Dec 26, 2025Updated 5 months ago
- Demonstration Agents for AIOS☆19Dec 25, 2024Updated last year
- ☆22Oct 12, 2024Updated last year
- Your finetuned model's back to its original safety standards faster than you can say "SafetyLock"!☆11Oct 16, 2024Updated last year
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A novel approach to improve the safety of large language models, enabling them to transition effectively from unsafe to safe state.☆72May 22, 2025Updated last year
- [ACL 2025] The official code for "AGrail: A Lifelong Agent Guardrail with Effective and Adaptive Safety Detection".☆41Aug 4, 2025Updated 10 months ago
- Cerebrum: Agent SDK for AIOS☆145Apr 23, 2026Updated last month
- [ICML 2024] Safety Fine-Tuning at (Almost) No Cost: A Baseline for Vision Large Language Models.☆90Jan 19, 2025Updated last year
- ☆16Mar 22, 2025Updated last year
- ☆11Jan 19, 2025Updated last year
- A toolkit for testing and improving named entity recognition [ESEC/FSE'23]☆11Aug 31, 2023Updated 2 years ago
- Tasks for describing differences between text distributions.☆17Aug 9, 2024Updated last year
- Röttger et al. (2025): "MSTS: A Multimodal Safety Test Suite for Vision-Language Models"☆20Mar 31, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆13Sep 12, 2024Updated last year
- Collection of Reverse Engineering in Large Model☆36Jan 8, 2025Updated last year
- ☆31Feb 10, 2025Updated last year
- Official code of "The Automated but Risky Game: Modeling and Benchmarking Agent-to-Agent Negotiations and Transactions in Consumer Market…☆27Jun 9, 2026Updated last week
- Frontend for Talent, a talent acquisition web application☆10Jan 5, 2023Updated 3 years ago
- ☆13Apr 3, 2024Updated 2 years ago
- Code and data of the ACL 2021 paper "Turn the Combination Lock: Learnable Textual Backdoor Attacks via Word Substitution"☆16Jun 29, 2021Updated 4 years ago
- Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow 2.0. Fork is made to detect ice and ship in a test envi…☆12Nov 15, 2023Updated 2 years ago
- A large-scale dataset composed of high-quality synthetic images aimed at evaluating social biases in LVLMs☆16Apr 7, 2026Updated 2 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A terminal interface that understands natural language commands instead of complex syntax☆19Nov 16, 2024Updated last year
- Source code for ICLR 2023 spotlight paper "Hidden Markov Transformer for Simultaneous Machine Translation"☆24Dec 11, 2023Updated 2 years ago
- A peer-to-peer communication system. BIT 小学期软件开发实训。☆11Sep 7, 2018Updated 7 years ago
- implementation of paper "Large Language Models are In-Context Semantic Reasoners rather than Symbolic Reasoners"☆20Aug 17, 2023Updated 2 years ago
- ☆14Nov 22, 2024Updated last year
- [ICLR2025] Detecting Backdoor Samples in Contrastive Language Image Pretraining☆20Feb 26, 2025Updated last year
- ☆11Sep 16, 2024Updated last year