chziakas / redevalLinks
A library for red-teaming LLM applications with LLMs.
☆28Updated last year
Alternatives and similar repositories for redeval
Users that are interested in redeval are comparing it to the libraries listed below
Sorting:
- Red-Teaming Language Models with DSPy☆249Updated 10 months ago
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆100Updated 8 months ago
- ☆26Updated last year
- Sphynx Hallucination Induction☆53Updated 11 months ago
- Code for the paper "Fishing for Magikarp"☆178Updated 7 months ago
- A prompt injection game to collect data for robust ML research☆65Updated 11 months ago
- ☆64Updated this week
- CodeSage: Code Representation Learning At Scale (ICLR 2024)☆115Updated last year
- ☆34Updated last year
- Papers about red teaming LLMs and Multimodal models.☆159Updated 7 months ago
- ☆29Updated 7 months ago
- Official repository for the paper "ALERT: A Comprehensive Benchmark for Assessing Large Language Models’ Safety through Red Teaming"☆53Updated last year
- CiteME is a benchmark designed to test the abilities of language models in finding papers that are cited in scientific texts.☆48Updated 2 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆111Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆51Updated last year
- The fastest Trust Layer for AI Agents☆146Updated 7 months ago
- Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".☆123Updated last year
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆66Updated last month
- ☆86Updated last year
- Open Source Replication of Anthropic's Alignment Faking Paper☆52Updated 9 months ago
- Track the progress of LLM context utilisation☆55Updated 8 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆65Updated last year
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆62Updated 3 months ago
- ☆55Updated last year
- ☆49Updated 9 months ago
- Measuring the situational awareness of language models☆39Updated last year
- Open Implementations of LLM Analyses☆107Updated last year
- Curation of prompts that are known to be adversarial to large language models☆188Updated 2 years ago
- Codebase accompanying the Summary of a Haystack paper.☆80Updated last year
- ☆38Updated 7 months ago