hwchase17 / adversarial-promptsLinks
Curation of prompts that are known to be adversarial to large language models
☆186Updated 2 years ago
Alternatives and similar repositories for adversarial-prompts
Users that are interested in adversarial-prompts are comparing it to the libraries listed below
Sorting:
- This shows the results from using a second, filter LLM that analyses prompts before sending them to GPT-Chat☆113Updated 2 years ago
- Red-Teaming Language Models with DSPy☆216Updated 7 months ago
- Test suite for LLM prompts☆55Updated last year
- PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to a…☆422Updated last year
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆96Updated 5 months ago
- A joint community effort to create one central leaderboard for LLMs.☆305Updated last year
- Meta-prompt: a simple self-improving language agent☆89Updated 2 years ago
- A codebase for "Language Models can Solve Computer Tasks"☆236Updated last year
- Sphynx Hallucination Induction☆53Updated 8 months ago
- Problem solving by engaging multiple AI agents in conversation with each other and the user.☆226Updated last year
- Collection of Tree of Thoughts prompting techniques I've found useful to start with, then stylize, then iterate☆92Updated last year
- [Corca / ML] Automatically solved Gandalf AI with LLM☆51Updated 2 years ago
- Masked Python SDK wrapper for OpenAI API. Use public LLM APIs securely.☆119Updated 2 years ago
- CLARA: Code Language Assistant & Repository Analyzer☆94Updated 2 years ago
- A simple wrapper for OpenAI to log input/outputs.☆106Updated 2 years ago
- Record and replay LLM interactions for langchain☆82Updated last year
- ☆176Updated 2 years ago
- Fiddler Auditor is a tool to evaluate language models.☆188Updated last year
- GPT-based Conversation Summarizer☆148Updated 2 years ago
- This project is an attempt to create a common metric to test LLM's for progress in eliminating hallucinations which is the most serious c…☆222Updated 2 years ago
- Security measure for agentic LLMs using a council of AIs moderted by a veto system. The council judges an agent's actions outputs based o…☆38Updated 2 years ago
- A set of utilities for running few-shot prompting experiments on large-language models☆123Updated last year
- Code for the website www.jailbreakchat.com☆106Updated 2 years ago
- Hosted embedding platform to discover, evaluate, and retrieve embeddings☆73Updated 2 years ago
- ☆34Updated 4 months ago
- Persuasive Jailbreaker: we can persuade LLMs to jailbreak them!☆321Updated last year
- Official Implementation of InstructZero; the first framework to optimize bad prompts of ChatGPT(API LLMs) and finally obtain good prompts…☆197Updated last year
- ☆303Updated last year
- 🎸 Integrating AI plugins to LLMs☆229Updated 2 years ago
- ☆174Updated last year