☆57Mar 12, 2025Updated last year
Alternatives and similar repositories for output2prompt
Users that are interested in output2prompt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official repository of 'Unnatural Language Are Not Bugs but Features for LLMs'☆24May 20, 2025Updated 10 months ago
- ☆26Oct 27, 2025Updated 5 months ago
- ☆14Mar 9, 2025Updated last year
- Seminar 2022☆23Mar 19, 2026Updated 3 weeks ago
- Official implementation of Privacy Implications of Retrieval-Based Language Models (EMNLP 2023). https://arxiv.org/abs/2305.14888☆37Jun 10, 2024Updated last year
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Official repository for "Robust Prompt Optimization for Defending Language Models Against Jailbreaking Attacks"☆62Aug 8, 2024Updated last year
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆61Mar 11, 2025Updated last year
- Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.☆18Dec 19, 2024Updated last year
- Code of paper: xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking"☆18Apr 3, 2026Updated last week
- [arXiv'21] Additively Symmetric Homomorphic Encryption for Cross-Silo Federated Learning☆22Apr 28, 2025Updated 11 months ago
- use angr to deobfuscation☆10Oct 8, 2019Updated 6 years ago
- Implementation of BEAST adversarial attack for language models (ICML 2024)☆89May 14, 2024Updated last year
- ☆20Jun 16, 2025Updated 9 months ago
- Official code for the ICCV2023 paper ``One-bit Flip is All You Need: When Bit-flip Attack Meets Model Training''☆20Aug 9, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [ArXiv 2025] Denial-of-Service Poisoning Attacks on Large Language Models☆23Oct 22, 2024Updated last year
- Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups☆51Dec 23, 2024Updated last year
- Improving Alignment and Robustness with Circuit Breakers☆259Sep 24, 2024Updated last year
- Official implementation of AdvPrompter https//arxiv.org/abs/2404.16873☆181May 6, 2024Updated last year
- MishformerLens intends to be a drop-in replacement for TransformerLens that AST patches HuggingFace Transformers rather than implementing…☆10Oct 7, 2024Updated last year
- Code for Voice Jailbreak Attacks Against GPT-4o.☆38May 31, 2024Updated last year
- Surgically de-slop LLMs☆14Jun 1, 2025Updated 10 months ago
- [ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"☆87Nov 28, 2023Updated 2 years ago
- A tool for visualization of complex job searches.☆13Jul 8, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆11Aug 21, 2017Updated 8 years ago
- ☆27Jun 5, 2024Updated last year
- ☆70Feb 4, 2024Updated 2 years ago
- Learning to route instances for Human vs AI Feedback (ACL Main '25)☆28Jul 23, 2025Updated 8 months ago
- Official code for "On Calibrating Diffusion Probabilistic Models"☆30Feb 22, 2023Updated 3 years ago
- `dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.☆120Mar 24, 2026Updated 2 weeks ago
- Data Valuation without Training of a Model, submitted to ICLR'23☆22Dec 30, 2022Updated 3 years ago
- ☆12Jan 5, 2023Updated 3 years ago
- A package that achieves 95%+ transfer attack success rate against GPT-4☆26Oct 24, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆14Nov 7, 2022Updated 3 years ago
- Website for Artifact Evaluation at EuroSys, SOSP, OSDI, ATC☆51Updated this week
- ☆40May 19, 2023Updated 2 years ago
- ☆27Jul 18, 2025Updated 8 months ago
- TPC-E Benchmark☆13Feb 16, 2016Updated 10 years ago
- An Extension for oobabooga/text-generation-webui☆37Jul 15, 2023Updated 2 years ago
- Experiments with representation engineering☆14Feb 28, 2024Updated 2 years ago