NeuralSentinel / SafeInferView external linksLinks
☆22Jan 14, 2025Updated last year
Alternatives and similar repositories for SafeInfer
Users that are interested in SafeInfer are comparing it to the libraries listed below
Sorting:
- ☆13Jan 14, 2025Updated last year
- Programming Club IIT Kanpur Summer Project☆10Apr 6, 2017Updated 8 years ago
- ☆11Nov 12, 2024Updated last year
- The code and datasets of our ACM MM 2024 paper "Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed …☆11Sep 27, 2024Updated last year
- The official implement of "Routing Experts: Learning to Route Dynamic Experts in Existing Multi-modal Large Language Models"☆17Mar 24, 2025Updated 10 months ago
- SeCap: Self-Calibrating and Adaptive Prompts for Cross-view Person Re-Identification in Aerial-Ground Networks (CVPR'25)☆19Jul 1, 2025Updated 7 months ago
- get the media stream from Dahua/Haikang IPC SDK, and demux the stream to vedio and audio ES☆12Nov 15, 2015Updated 10 years ago
- [ACL 2025 Main] Open-source toolkit for automatic evaluation of text-to-image generation task, including training & test datasets and a d…☆16Jul 5, 2025Updated 7 months ago
- Torch code for Visual Question Generation☆14Mar 30, 2019Updated 6 years ago
- "Visual Prompt Selection for In-Context Learning Segmentation Framework"☆14Dec 13, 2024Updated last year
- Cog wrapper for playgroundai/playground-v2.5-1024px-aesthetic☆17Nov 25, 2024Updated last year
- ☆13Feb 24, 2025Updated 11 months ago
- Instruction Following Eval☆15Jan 16, 2025Updated last year
- 中文原生多层次文生视频测评基准☆18Jul 8, 2024Updated last year
- I don't want to maintain this project, the code probably won't compile or run. Archived.☆13Feb 25, 2024Updated last year
- Visual Dialog☆16Aug 30, 2020Updated 5 years ago
- The Code for Lever LM: Configuring In-Context Sequence to Lever Large Vision Language Models☆17Oct 4, 2024Updated last year
- ☆18Nov 30, 2025Updated 2 months ago
- 这是一个基于OpenCompass的模型评测系统,该系统提供了前端页面UI以方便用户自助开展评测工作。☆24Aug 25, 2025Updated 5 months ago
- ☆15Nov 17, 2020Updated 5 years ago
- SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types☆24Nov 29, 2024Updated last year
- PULSE-EVAL☆23Jan 12, 2024Updated 2 years ago
- LLM evaluation.☆16Nov 7, 2023Updated 2 years ago
- The official repository of the paper "The Digital Cybersecurity Expert: How Far Have We Come?" presented in IEEE S&P 2025☆24May 21, 2025Updated 8 months ago
- ☆21Aug 19, 2024Updated last year
- the official repo for EMNLP 2024 (main) paper "EFUF: Efficient Fine-grained Unlearning Framework for Mitigating Hallucinations in Multimo…☆20Apr 9, 2025Updated 10 months ago
- ☆20Oct 21, 2022Updated 3 years ago
- Learning Descriptive Image Captioning via Semipermeable Maximum Likelihood Estimation (NeurIPS 2023)☆22Oct 1, 2023Updated 2 years ago
- The rule-based evaluation subset and code implementation of Omni-MATH☆26Dec 23, 2024Updated last year
- VulnHeist is an Automated Penetration Testing Suite 🔖 that streamlines vulnerability scanning 🔍 and exploitation 💥 using Nmap 🌐 and …☆35Mar 22, 2025Updated 10 months ago
- ☆27Apr 18, 2025Updated 9 months ago
- This is the official repository for the "Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP" paper acce…☆25Apr 18, 2024Updated last year
- ☆26Jun 5, 2024Updated last year
- ☆38Jul 14, 2025Updated 6 months ago
- 本文提出了一个基于“文心一言”的中国LLMs的安全评估基准,其中包括8种典型的安全场景和6种指令攻击类型。此外,本文还提出了安全评估的框架和过程,利用手动编写和收集开源数据的测试Prompts,以及人工干预结合利用LLM强大的评估能力作为“共同评估者”。☆32Sep 1, 2023Updated 2 years ago
- DataSciBench: An LLM Agent Benchmark for Data Science☆50Jan 21, 2026Updated 3 weeks ago
- PyTorch implementation of "Debiased Visual Question Answering from Feature and Sample Perspectives" (NeurIPS 2021)☆27Oct 13, 2022Updated 3 years ago
- [ACL 2024 Main] NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese Jou…☆33Jun 25, 2024Updated last year
- A Lightweight Visual Reasoning Benchmark for Evaluating Large Multimodal Models through Complex Diagrams in Coding Tasks☆14Feb 25, 2025Updated 11 months ago