hwchase17 / adversarial-promptsLinks
Curation of prompts that are known to be adversarial to large language models
☆184Updated 2 years ago
Alternatives and similar repositories for adversarial-prompts
Users that are interested in adversarial-prompts are comparing it to the libraries listed below
Sorting:
- This shows the results from using a second, filter LLM that analyses prompts before sending them to GPT-Chat☆113Updated 2 years ago
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆97Updated 6 months ago
- Test suite for LLM prompts☆55Updated last year
- Red-Teaming Language Models with DSPy☆235Updated 8 months ago
- PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to a…☆427Updated last year
- Meta-prompt: a simple self-improving language agent☆90Updated 2 years ago
- GPT-based Conversation Summarizer☆149Updated 2 years ago
- Camel-Coder: Collaborative task completion with multiple agents. Role-based prompts, intervention mechanism, and thoughtful suggestions☆33Updated 2 years ago
- LUI: Autonomous Collective Decision Making via Large Language Models☆105Updated 2 years ago
- 🔓 The open-source autonomous agent LLM initiative 🔓☆91Updated last year
- Fact-checking LLM outputs with self-ask☆304Updated 2 years ago
- CLARA: Code Language Assistant & Repository Analyzer☆94Updated 2 years ago
- LLM-powered autonomous agent with hierarchical task management☆52Updated 2 years ago
- Public repo for my book Symphony of Thought: Orchestrating Artificial Cognition☆112Updated 3 years ago
- A set of utilities for running few-shot prompting experiments on large-language models☆125Updated 2 years ago
- Collection of Tree of Thoughts prompting techniques I've found useful to start with, then stylize, then iterate☆91Updated 2 years ago
- A simple wrapper for OpenAI to log input/outputs.☆106Updated 2 years ago
- Security measure for agentic LLMs using a council of AIs moderted by a veto system. The council judges an agent's actions outputs based o…☆38Updated 2 years ago
- This project is an attempt to create a common metric to test LLM's for progress in eliminating hallucinations which is the most serious c…☆222Updated 2 years ago
- A codebase for "Language Models can Solve Computer Tasks"☆235Updated last year
- ☆215Updated 2 years ago
- Record and replay LLM interactions for langchain☆82Updated last year
- Sphynx Hallucination Induction☆53Updated 8 months ago
- ☆173Updated 2 years ago
- A repo built for the purpose of benchmarking the performance of agents, regardless of how they are set up and how they work.☆277Updated last year
- ☆175Updated last year
- ☆273Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆109Updated 10 months ago
- ☆132Updated 2 years ago
- Keeping language models honest by directly eliciting knowledge encoded in their activations.☆211Updated this week