josh-ashkinaze / plurals
Plurals: A System for Guiding LLMs Via Simulated Social Ensembles
☆17Updated this week
Alternatives and similar repositories for plurals:
Users that are interested in plurals are comparing it to the libraries listed below
- ☆29Updated last year
- Factored Cognition Primer: How to write compositional language model programs☆48Updated 2 years ago
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Updated 7 months ago
- Finding semantically meaningful and accurate prompts.☆46Updated last year
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 3 years ago
- LLM plugin for clustering embeddings☆72Updated last year
- The Prism Alignment Project☆70Updated 11 months ago
- ☆55Updated 4 months ago
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆50Updated 3 weeks ago
- Evaluation of neuro-symbolic engines☆35Updated 7 months ago
- Governance of the Commons Simulation (GovSim)☆44Updated 2 months ago
- ☆68Updated last year
- Code for our NeurIPS'24 Dataset and Benchmark paper: Cooperation, Competition, and Maliciousness: LLM-Stakeholders Interactive Negotiatio…☆26Updated 4 months ago
- ☆89Updated last month
- Functional Benchmarks and the Reasoning Gap☆84Updated 6 months ago
- General-Sum variant of the game Diplomacy for evaluating AIs.☆28Updated last year
- ☆131Updated 5 months ago
- Evaluating the Moral Beliefs Encoded in LLMs☆24Updated 3 months ago
- ☆45Updated last week
- MoodCat😼 classifies the mood of English sentences.☆14Updated 2 years ago
- ☆16Updated 6 months ago
- Pre-train Static Word Embeddings☆51Updated 3 weeks ago
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆73Updated last year
- Datasets and code from our paper, where we use machine learning to predict if ChatGPT will refuse a given prompt.☆36Updated last year
- ☆55Updated this week
- We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.☆113Updated 10 months ago
- Learning to route instances for Human vs AI Feedback☆22Updated last month
- TextGraphs + LLMs + graph ML for entity extraction, linking, ranking, and constructing a lemma graph☆23Updated last year
- ☆91Updated 10 months ago
- ☆70Updated 6 months ago