☆55May 22, 2025Updated last year
Alternatives and similar repositories for Preference-Leakage
Users that are interested in Preference-Leakage are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆21Mar 17, 2025Updated last year
- To Think or Not to Think: Exploring the Unthinking Vulnerability in Large Reasoning Models☆33May 21, 2025Updated last year
- ☆23Oct 10, 2025Updated 8 months ago
- personal settings for linux tools, including zsh, vim, tmux, pip.☆11Dec 2, 2019Updated 6 years ago
- Benchmarking Multi-Image Understanding in Vision and Language Models☆11Jul 29, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Working code repository for the paper "SQL-of-Thought: Multi-agentic Text-to-SQL with Guided Error Correction"☆35Dec 18, 2025Updated 5 months ago
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆32Nov 12, 2024Updated last year
- ☆27Jun 5, 2023Updated 3 years ago
- ☆37May 29, 2026Updated 2 weeks ago
- ☆556May 21, 2026Updated 3 weeks ago
- Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]☆24Aug 13, 2024Updated last year
- [TACL/EMNLP'24] Do Vision and Language Models Share Concepts? A Vector Space Alignment Study☆16Nov 22, 2024Updated last year
- ☆19Mar 23, 2025Updated last year
- This is the code repo for our paper "Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression".☆13Feb 27, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 🚀 First survey on Attention Sink in Transformers — 200+ papers on utilization, interpretation, and mitigation.☆128Jun 5, 2026Updated last week
- ☆11Dec 5, 2020Updated 5 years ago
- Code for the paper "Multitasking Framework for Unsupervised Simple Definition Generation" on ACL 2022.☆17Aug 17, 2022Updated 3 years ago
- DICE: Detecting In-distribution Data Contamination with LLM's Internal State☆12Sep 21, 2024Updated last year
- ☆18Jul 20, 2025Updated 10 months ago
- [NDSS 2026] Official repo for Odysseus: Jailbreaking Commercial Multimodal LLM-integrated Systems via Dual Steganography☆56Mar 14, 2026Updated 3 months ago
- This is the code repo for the paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards".