SALT-NLP / normbankLinks
Data and code for the paper "NormBank: A Knowledge Bank of Situational Social Norms"
☆32Updated 2 years ago
Alternatives and similar repositories for normbank
Users that are interested in normbank are comparing it to the libraries listed below
Sorting:
- Code for the paper "CoS: Enhancing Personalization and Mitigating Bias with Context Steering"☆18Updated 11 months ago
- This repo contains code for our NeurIPS 2023 spotlight paper: Evaluating and Inducing Personality in Pre-trained Language Models☆56Updated 2 years ago
- Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)☆264Updated last week
- ☆116Updated last year
- ☆35Updated 2 years ago
- Official Code Repository for LM-Steer Paper: "Word Embeddings Are Steers for Language Models" (ACL 2024 Outstanding Paper Award)☆130Updated 4 months ago
- Machine Theory of Mind Reading List. Built upon EMNLP Findings 2023 Paper: Towards A Holistic Landscape of Situated Theory of Mind in Lar…☆147Updated 9 months ago
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆78Updated last year
- Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging☆111Updated 2 years ago
- Steering Llama 2 with Contrastive Activation Addition☆195Updated last year
- ☆89Updated 11 months ago
- LLM Agora, debating between open-source LLMs to refine the answers☆82Updated 2 years ago
- Code and data for Marked Personas (ACL 2023)☆28Updated 2 years ago
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆67Updated last year
- Modular Pluralism @ EMNLP 2024☆21Updated last year
- Repository for the Bias Benchmark for QA dataset.☆132Updated last year
- Performant framework for training, analyzing and visualizing Sparse Autoencoders (SAEs) and their frontier variants.☆167Updated this week
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆78Updated last year
- ☆180Updated last year
- Inspecting and Editing Knowledge Representations in Language Models☆119Updated 2 years ago
- Resources for cultural NLP research☆109Updated 2 months ago
- Function Vectors in Large Language Models (ICLR 2024)☆186Updated 7 months ago
- ☆47Updated 2 months ago
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆193Updated 9 months ago
- ☆57Updated 2 years ago
- LLM experiments done during SERI MATS - focusing on activation steering / interpreting activation spaces☆100Updated 2 years ago
- Recent papers on (1) Psychology of LLMs; (2) Biases in LLMs.☆50Updated 2 years ago
- FeatureAlignment = Alignment + Mechanistic Interpretability☆33Updated 9 months ago
- The Prism Alignment Project☆86Updated last year
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆123Updated last year