NextWordDev / psychoevals
Repository for PsychoEvals - a framework for LLM security, psychoanalysis, and moderation.
☆15Updated last year
Related projects ⓘ
Alternatives and complementary repositories for psychoevals
- ☆36Updated 3 weeks ago
- ☆17Updated 4 months ago
- A set of utilities for running few-shot prompting experiments on large-language models☆113Updated last year
- A repository for the paper "Beliefs about AI influence human-AI interaction and can be manipulated to increase perceived trustworthiness,…☆14Updated last year
- APIBench is a benchmark for evaluating the performance of API recommendation approaches released in the paper "Revisiting, Benchmarking a…☆53Updated last year
- Code accompanying "How I learned to start worrying about prompt formatting".☆95Updated last month
- ☆50Updated 5 months ago
- Recent papers on (1) Psychology of LLMs; (2) Biases in LLMs.☆43Updated last year
- The official repository of the OpenToM dataset☆17Updated 8 months ago
- ☆81Updated 4 months ago
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆92Updated last year
- LLM experiments done during SERI MATS - focusing on activation steering / interpreting activation spaces☆78Updated last year
- Data and evaluation scripts for "CodePlan: Repository-level Coding using LLMs and Planning", FSE 2024☆52Updated 2 months ago
- ToK aka Tree of Knowledge for Large Language Models LLM. It's a novel dataset that inspires knowledge symbolic correlation in simple inpu…☆46Updated last year
- Gentopia Agent Zoo and Agent Benchmark☆28Updated last year
- Enhancing AI Software Engineering with Repository-level Code Graph☆96Updated 2 months ago
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆141Updated last year
- [NeurIPS 2023] PyTorch code for Can Language Models Teach? Teacher Explanations Improve Student Performance via Theory of Mind☆67Updated 11 months ago
- [ICLR'24 Spotlight] A language model (LM)-based emulation framework for identifying the risks of LM agents with tool use☆115Updated 8 months ago
- Source code for paper: INTERVENOR : Prompt the Coding Ability of Large Language Models with the Interactive Chain of Repairing☆24Updated 5 months ago
- Official Repository for ACL 2024 Paper SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding☆99Updated 4 months ago
- ☆86Updated 5 months ago
- Evaluating the Moral Beliefs Encoded in LLMs☆21Updated 9 months ago
- Large Language Models Meet NL2Code: A Survey☆34Updated this week
- Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Ref…☆23Updated last month
- RepoQA: Evaluating Long-Context Code Understanding☆100Updated 2 weeks ago
- Weak-to-Strong Jailbreaking on Large Language Models☆67Updated 9 months ago
- Analyzing and scoring reasoning traces of LLMs☆41Updated 2 months ago
- A collection of works that investigate social agents, simulations and their real-world impact in text, embodied, and robotics contexts.☆63Updated 5 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆80Updated 2 months ago