hwchase17 / adversarial-prompts
Curation of prompts that are known to be adversarial to large language models
☆179Updated 2 years ago
Alternatives and similar repositories for adversarial-prompts
Users that are interested in adversarial-prompts are comparing it to the libraries listed below
Sorting:
- Red-Teaming Language Models with DSPy☆192Updated 3 months ago
- PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to a…☆365Updated last year
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆91Updated last month
- This shows the results from using a second, filter LLM that analyses prompts before sending them to GPT-Chat☆112Updated 2 years ago
- Persuasive Jailbreaker: we can persuade LLMs to jailbreak them!☆299Updated 7 months ago
- This project is an attempt to create a common metric to test LLM's for progress in eliminating hallucinations which is the most serious c…☆222Updated 2 years ago
- Camel-Coder: Collaborative task completion with multiple agents. Role-based prompts, intervention mechanism, and thoughtful suggestions☆33Updated last year
- Code for the website www.jailbreakchat.com☆89Updated last year
- Test suite for LLM prompts☆48Updated last year
- ☆131Updated 2 years ago
- Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".☆102Updated last year
- Interactive Composition Explorer: a debugger for compositional language model programs☆551Updated last month
- ☆100Updated 2 months ago
- automatically generate @openai plugins by specifying your API in markdown in smol-developer style☆120Updated last year
- A Toolkit for Creating and Deploying LangChain Apps☆168Updated 2 years ago
- A command-line interface to generate textual and conversational datasets with LLMs.☆298Updated last year
- Directly Connecting Python to LLMs via Strongly-Typed Functions, Dataclasses, Interfaces & Generic Types☆399Updated 2 months ago
- Recursive self-improvement☆56Updated last year
- Learning to Program with Natural Language☆6Updated last year
- [NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898☆217Updated last year
- ☆269Updated 10 months ago
- GPT-based Conversation Summarizer☆148Updated 2 years ago
- ☆84Updated last year
- ☆163Updated last year
- A set of utilities for running few-shot prompting experiments on large-language models☆120Updated last year
- Meta-prompt: a simple self-improving language agent☆88Updated 2 years ago
- ☆217Updated 2 years ago
- 🔓 The open-source autonomous agent LLM initiative 🔓☆91Updated last year
- A codebase for "Language Models can Solve Computer Tasks"☆234Updated last year
- ☆54Updated 7 months ago