NextWordDev / psychoevalsLinks
Repository for PsychoEvals - a framework for LLM security, psychoanalysis, and moderation.
☆17Updated 2 years ago
Alternatives and similar repositories for psychoevals
Users that are interested in psychoevals are comparing it to the libraries listed below
Sorting:
- Awesome deliberative prompting: How to ask LLMs to produce reliable reasoning and make reason-responsive decisions.☆120Updated 10 months ago
- Analyzing and scoring reasoning traces of LLMs☆47Updated last year
- An active inference model of Lacanian psychoanalysis☆14Updated 6 months ago
- A set of utilities for running few-shot prompting experiments on large-language models☆126Updated 2 years ago
- Large Language Models Meet NL2Code: A Survey☆35Updated last year
- ☆100Updated last year
- ☆29Updated 2 years ago
- Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023☆251Updated 2 years ago
- PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to a…☆443Updated last year
- repo for the paper titled “CodeGen4Libs: A Two-Stage Approach for Library-Oriented Code Generation”☆14Updated 2 years ago
- Official repo for GPTFUZZER : Red Teaming Large Language Models with Auto-Generated Jailbreak Prompts☆556Updated last year
- [NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898☆232Updated last year
- Official repo for Customized but Compromised: Assessing Prompt Injection Risks in User-Designed GPTs☆29Updated 2 years ago
- The official implementation of our NAACL 2024 paper "A Wolf in Sheep’s Clothing: Generalized Nested Jailbreak Prompts can Fool Large Lang…☆149Updated 3 months ago
- APIBench is a benchmark for evaluating the performance of API recommendation approaches released in the paper "Revisiting, Benchmarking a…☆64Updated 2 years ago
- Meta-prompt: a simple self-improving language agent☆90Updated 2 years ago
- Test LLMs against jailbreaks and unprecedented harms☆36Updated last year
- distilled Self-Critique refines the outputs of a LLM with only synthetic data☆11Updated last year
- ☆68Updated last year
- ☆53Updated 9 months ago
- The Synthetic-Persona-Chat dataset is a synthetically generated persona-based dialogue dataset. It extends the original Persona-Chat data…☆105Updated last year
- Whispers in the Machine: Confidentiality in Agentic Systems☆41Updated 2 weeks ago
- Source code for the paper "Active Prompting with Chain-of-Thought for Large Language Models"☆248Updated last year
- Curation of prompts that are known to be adversarial to large language models☆186Updated 2 years ago
- Persuasive Jailbreaker: we can persuade LLMs to jailbreak them!☆341Updated 2 months ago
- LLM red teaming datasets from the paper 'Student-Teacher Prompting for Red Teaming to Improve Guardrails' for the ART of Safety Workshop …☆18Updated 2 years ago
- LLM experiments done during SERI MATS - focusing on activation steering / interpreting activation spaces☆100Updated 2 years ago
- A collection of works that investigate social agents, simulations and their real-world impact in text, embodied, and robotics contexts.☆104Updated last year
- Code accompanying "How I learned to start worrying about prompt formatting".☆113Updated 6 months ago
- awesome-LLM-controlled-constrained-generation☆54Updated last year