☆21Apr 23, 2025Updated 11 months ago
Alternatives and similar repositories for PrivaCI-Bench
Users that are interested in PrivaCI-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code and dataset for the EMNLP 2024 paper: GoldCoin: Grounding Large Language Models in Privacy Laws via Contextual Integrity Theory☆48Sep 26, 2024Updated last year
- Code for ACL 2024 paper: PrivLM-Bench: A Multi-level Privacy Evaluation Benchmark for Language Models.☆16Feb 5, 2025Updated last year
- Code for Findings-EMNLP 2023 paper: Multi-step Jailbreaking Privacy Attacks on ChatGPT☆36Oct 15, 2023Updated 2 years ago
- A Swiss Army Knife for computational social choice research☆19Updated this week
- Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"☆11Jan 10, 2025Updated last year
- This repository contains the code for the EMNLP'23 paper "AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot Classificati…☆16Jun 3, 2024Updated last year
- Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs (EMNLP 2024)☆16Nov 17, 2024Updated last year
- NOTICE - this is deprecated. The Angular component router is under redesign and these samples are pre-RC1. - A small sample app for a bl…☆10Jan 15, 2016Updated 10 years ago
- ☆13May 10, 2025Updated 10 months ago
- Ranking-Consistent Language-Image Pretraining☆12Oct 24, 2025Updated 4 months ago
- A Benchmark for Evaluating Safety and Trustworthiness in Web Agents for Enterprise Scenarios☆21Mar 12, 2026Updated last week
- Let there be clock in the beach - WACV 2022☆15Nov 15, 2021Updated 4 years ago
- Using Vrep to simulate a six-legged robot to do motion planning & path planning☆10Jan 10, 2019Updated 7 years ago
- Codebase for "Surveilling Surveillance: Estimating the Prevalence of Surveillance Cameras with Street View Data"☆20Jul 15, 2021Updated 4 years ago
- Text generation using language models with multiple exit heads☆16Sep 18, 2025Updated 6 months ago
- Code implementation of R^2-Guard: Robust Reasoning Enabled LLM Guardrail via Knowledge-Enhanced Logical Reasoning☆22Jul 8, 2024Updated last year
- [NeurIPS 2025] Official Implementation for "Enhancing Vision-Language Model Reliability with Uncertainty-Guided Dropout Decoding"☆22Dec 8, 2024Updated last year
- A distributed rate limiting WSGI middleware.☆102Sep 20, 2013Updated 12 years ago
- ☆14Aug 7, 2025Updated 7 months ago
- established for the data normalization and reinforcement learning training scheme to train an agent in DCS world☆12Oct 22, 2021Updated 4 years ago
- Official repository for "On the Multi-modal Vulnerability of Diffusion Models"☆16Jul 15, 2024Updated last year
- 【入口,请看这里!】Bulletin of our awesome collections 📓📔📒📕📗📘📙📚📖🔖☆11Mar 13, 2018Updated 8 years ago
- [NeurIPS'25 Spotlight] MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation☆21Feb 23, 2025Updated last year
- reveal-md is great project. Improve your presentation even more with custom user scripts. Here is the place to find them.☆15Dec 7, 2023Updated 2 years ago
- An implementation of vdist2vec model in paper A Learning Based Approach to Predict Shortest-Path Distances☆11Apr 8, 2022Updated 3 years ago
- AIR-Bench 2024 is a safety benchmark that aligns with emerging government regulations and company policies☆29Aug 14, 2024Updated last year
- ☆12Feb 19, 2024Updated 2 years ago
- A benchmark that focuses on the sampling dilemma in long-video tasks. Through well-designed tasks, it evaluates the sampling efficiency o…☆27Aug 7, 2025Updated 7 months ago
- Run multiple commands in a docker container.☆37Oct 5, 2014Updated 11 years ago
- A Python library for guardrail models evaluation.☆34Oct 9, 2025Updated 5 months ago
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging (ICML 2025)☆26Feb 25, 2025Updated last year
- [ICLR'26] EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization☆29Aug 5, 2025Updated 7 months ago
- porting memtester to Android☆22Mar 14, 2021Updated 5 years ago
- ☆15May 22, 2024Updated last year
- Code and data repository for "The Mirage of Model Editing: Revisiting Evaluation in the Wild"☆16Aug 27, 2025Updated 6 months ago
- Explore VLM-Eval, a framework for evaluating Video Large Language Models, enhancing your video analysis with cutting-edge AI technology.☆37Jan 20, 2024Updated 2 years ago
- 免费的计算机编程类中文书籍,欢迎投稿☆15Dec 22, 2015Updated 10 years ago
- porting iozone to android☆27Feb 5, 2016Updated 10 years ago
- [NAACL 2024 Findings] Evaluation suite for the systematic evaluation of instruction selection methods.☆23Jul 26, 2023Updated 2 years ago