lil-lab / cb2Links
An NLP research and data collection platform.
☆17Updated last year
Alternatives and similar repositories for cb2
Users that are interested in cb2 are comparing it to the libraries listed below
Sorting:
- ☆15Updated 7 months ago
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆110Updated 11 months ago
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆88Updated last year
- The repository contains the code and dataset for the Socratic Debugging task which is a novel task for Socratically Questioning Novice De…☆18Updated last year
- Codebase for LLM story generation; updated version of https//github.com/yangkevin2/doc-story-generation☆85Updated last year
- Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)☆258Updated last month
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆78Updated last year
- Byte-sized text games for code generation tasks on virtual environments☆20Updated last year
- Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"☆20Updated last month
- Neuron Activation☆24Updated 11 months ago
- Super fast implementations of common benchmark text world games☆51Updated 2 months ago
- A collection of works that investigate social agents, simulations and their real-world impact in text, embodied, and robotics contexts.☆98Updated last year
- LLM Dynamic Planner - Combining LLM with PDDL Planners to solve an embodied task☆47Updated 10 months ago
- ☆46Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated 2 years ago
- Evaluating the Moral Beliefs Encoded in LLMs☆31Updated 10 months ago
- SummScreen: A Dataset for Abstractive Screenplay Summarization (ACL 2022)☆38Updated 3 years ago
- Code for "From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Mod…☆40Updated last year
- Language Models of Code are Few-Shot Commonsense Learners (EMNLP 2022)☆86Updated 2 years ago
- Source code and data for The Magic of IF: Investigating Causal Reasoning Abilities in Large Language Models of Code (Findings of ACL 2023…☆30Updated 2 years ago
- [NAACL 2024] Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models☆86Updated last year
- Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs☆13Updated last year
- ☆29Updated last year
- Using conversational games to evaluate powerful LLMs☆18Updated 2 years ago
- This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…☆14Updated 2 years ago
- Code accompanying our EMNLP 2019 paper: "Revisiting the Evaluation of Theory of Mind through Question Answering"☆26Updated 5 years ago
- distilled Self-Critique refines the outputs of a LLM with only synthetic data☆11Updated last year
- [NAACL 2024] Better Zero-Shot Reasoning with Role-Play Prompting☆36Updated last year
- ☆100Updated last year
- PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion☆58Updated last year