chen37058 / Physical-Attacks-in-Embodied-Navigation
The official implementation for "Towards Physically-Realizable Adversarial Attacks in Embodied Vision Navigation"
☆17Updated 2 months ago
Alternatives and similar repositories for Physical-Attacks-in-Embodied-Navigation:
Users that are interested in Physical-Attacks-in-Embodied-Navigation are comparing it to the libraries listed below
- [CCS 2024] "BadMerging: Backdoor Attacks Against Model Merging": official code implementation.☆24Updated 5 months ago
- [ICML 2024] Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models☆115Updated 2 months ago
- AAPL: Adding Attributes to Prompt Learning for Vision-Language Models (CVPRw 2024)☆34Updated 8 months ago
- PyTorch code for "ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning"☆18Updated 3 months ago
- [TMLR'24] This repository includes the official implementation our paper "FedConv: Enhancing Convolutional Neural Networks for Handling D…☆25Updated 9 months ago
- Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models☆72Updated 4 months ago
- Official implementation to DELT: A Simple Diversity-driven EarlyLate Training for Dataset Distillation which outperforms SOTA top 1-acc b…☆18Updated last month
- The official implementation of Preference Data Reward-Augmentation.☆16Updated 3 months ago
- Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]☆41Updated last week
- [Arxiv 2024] Dissecting Adversarial Robustness of Multimodal LM Agents☆54Updated 2 weeks ago
- Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model☆38Updated 2 weeks ago
- [ECCV'24 Workshops Oral] DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling☆28Updated 2 months ago
- [ICML 2024] Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast☆93Updated 10 months ago
- Official Implementation of KnobGen: Controlling the Sophistication of Artwork in Sketch-Based Diffusion Models☆38Updated 3 months ago
- ☆38Updated last month
- Multi-vision Sensor Perception and Reasoning (MS-PR) benchmark, assessing VLMs on their capacity for sensor-specific reasoning.☆12Updated 3 weeks ago
- [ECCV 2024] The official code for "AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shi…☆49Updated 6 months ago
- The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?☆23Updated 2 months ago
- Code associated with the EMNLP 2024 Main paper: "Image, tell me your story!" Predicting the original meta-context of visual misinformatio…☆35Updated this week
- ☆27Updated last year
- Official Repository of Multi-Object Hallucination in Vision-Language Models (NeurIPS 2024)☆26Updated 2 months ago
- [NeurIPSw'24] This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simu…☆77Updated last week
- The code of RouterDC☆46Updated 2 weeks ago
- [NeurIPS 2024] A task generation and model evaluation system for multimodal language models.☆62Updated 2 months ago
- [IEEE VIS 2024] LLaVA-Chart: Advancing Multimodal Large Language Models in Chart Question Answering with Visualization-Referenced Instruc…☆56Updated last week
- [ECCV 2024] API: Attention Prompting on Image for Large Vision-Language Models☆60Updated 3 months ago
- [ECCV'24 Oral] The official GitHub page for ''Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking …☆21Updated 3 months ago
- TensorFlow code for our ECCV'24 Workshop paper "LightAvatar: Efficient Head Avatar as Dynamic NeLF"☆27Updated 2 months ago
- [NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆62Updated 3 months ago