chziakas / redevalLinks
A library for red-teaming LLM applications with LLMs.
☆29Updated last year
Alternatives and similar repositories for redeval
Users that are interested in redeval are comparing it to the libraries listed below
Sorting:
- Red-Teaming Language Models with DSPy☆250Updated 11 months ago
- ☆29Updated 8 months ago
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆100Updated 9 months ago
- Sphynx Hallucination Induction☆53Updated last year
- ☆190Updated last month
- The fastest Trust Layer for AI Agents☆152Updated last week
- Official repository for the paper "ALERT: A Comprehensive Benchmark for Assessing Large Language Models’ Safety through Red Teaming"☆54Updated last year
- ☆34Updated last year
- Papers about red teaming LLMs and Multimodal models.☆160Updated 8 months ago
- ☆23Updated 2 years ago
- This project investigates the security of large language models by performing binary classification of a set of input prompts to discover…☆57Updated 2 years ago
- ☆26Updated last year
- ☆38Updated 8 months ago
- The repository contains generative AI analytics platform application code.☆28Updated 4 months ago
- A repository of Language Model Vulnerabilities and Exposures (LVEs).☆112Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆80Updated last year
- ☆56Updated last year
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆66Updated 2 months ago
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆63Updated 4 months ago
- A prompt injection game to collect data for robust ML research☆68Updated last year
- Code for reproducing our paper "Are Sparse Autoencoders Useful? A Case Study in Sparse Probing"☆31Updated 10 months ago
- Code for the paper "Fishing for Magikarp"☆180Updated 8 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆51Updated last year
- LLM security and privacy☆54Updated last year
- Measuring the situational awareness of language models☆40Updated last year
- Efficient Dictionary Learning with Switch Sparse Autoencoders (SAEs)☆25Updated last year
- Here Comes the AI Worm: Preventing the Propagation of Adversarial Self-Replicating Prompts Within GenAI Ecosystems☆222Updated 5 months ago
- Track the progress of LLM context utilisation☆55Updated 9 months ago
- ☆65Updated last week
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆35Updated 9 months ago