haizelabs / dspy-redteamLinks
Red-Teaming Language Models with DSPy
☆202Updated 5 months ago
Alternatives and similar repositories for dspy-redteam
Users that are interested in dspy-redteam are comparing it to the libraries listed below
Sorting:
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆92Updated 3 months ago
- Inference-time scaling for LLMs-as-a-judge.☆251Updated this week
- ☆24Updated 8 months ago
- Sphynx Hallucination Induction☆53Updated 5 months ago
- ⚖️ Awesome LLM Judges ⚖️☆107Updated 2 months ago
- ☆121Updated last month
- Collection of evals for Inspect AI☆178Updated this week
- ☆97Updated 2 weeks ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆84Updated 9 months ago
- Fiddler Auditor is a tool to evaluate language models.☆183Updated last year
- The fastest Trust Layer for AI Agents☆138Updated last month
- A better way of testing, inspecting, and analyzing AI Agent traces.☆39Updated last week
- Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".☆109Updated last year
- Code for the paper "Fishing for Magikarp"☆157Updated 2 months ago
- Guardrails for secure and robust agent development☆313Updated last month
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆99Updated 2 weeks ago
- Functional Benchmarks and the Reasoning Gap☆88Updated 9 months ago
- ☆306Updated 3 weeks ago
- Open Source Replication of Anthropic's Alignment Faking Paper☆44Updated 3 months ago
- ☆71Updated 8 months ago
- Attribute (or cite) statements generated by LLMs back to in-context information.☆245Updated 9 months ago
- ☆55Updated 9 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆106Updated 7 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated 11 months ago
- ☆129Updated 3 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆240Updated 5 months ago
- TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle☆285Updated this week
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆173Updated 4 months ago
- ☆134Updated 3 months ago
- An open-source compliance-centered evaluation framework for Generative AI models☆158Updated last week