allenai / everyday-thingsLinks
β17Updated 2 years ago
Alternatives and similar repositories for everyday-things
Users that are interested in everyday-things are comparing it to the libraries listed below
Sorting:
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkIβ95Updated 2 years ago
- π» Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"β57Updated last year
- SILO Language Models code repositoryβ83Updated last year
- Byte-sized text games for code generation tasks on virtual environmentsβ20Updated last year
- A unified benchmark for math reasoningβ89Updated 2 years ago
- Repo for: When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgmentβ38Updated 2 years ago
- DialOp: Decision-oriented dialogue environments for collaborative language agentsβ111Updated last year
- Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"β60Updated 10 months ago
- This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language modβ¦β14Updated 2 years ago
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learningβ33Updated 11 months ago
- β38Updated 2 months ago
- Apps built using Inspired Cognition's Critique.β57Updated 2 years ago
- Language Models of Code are Few-Shot Commonsense Learners (EMNLP 2022)β86Updated 2 years ago
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learnersβ116Updated 5 months ago
- β44Updated last year
- β56Updated 2 years ago
- Neural models of common sense. π€β98Updated 2 years ago
- Supporting code for ReCEval paperβ30Updated last year
- Pretraining Efficiently on S2ORC!β175Updated last year
- β69Updated last year
- β141Updated 3 years ago
- β36Updated 3 years ago
- β72Updated 2 years ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"β66Updated last year
- [NAACL 2024] Struc-Bench: Are Large Language Models Good at Generating Complex Structured Tabular Data? https://aclanthology.org/2024.naaβ¦β55Updated 4 months ago
- A library for finding knowledge neurons in pretrained transformer models.β158Updated 3 years ago
- β32Updated 4 years ago
- β24Updated last year
- Code for our EMNLP '22 paper "Fixing Model Bugs with Natural Language Patches"β19Updated 3 years ago
- β38Updated last year