lil-lab / cb2Links
An NLP research and data collection platform.
☆17Updated last year
Alternatives and similar repositories for cb2
Users that are interested in cb2 are comparing it to the libraries listed below
Sorting:
- Codebase for LLM story generation; updated version of https//github.com/yangkevin2/doc-story-generation☆85Updated last year
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆110Updated 11 months ago
- Byte-sized text games for code generation tasks on virtual environments☆20Updated last year
- ☆14Updated 6 months ago
- Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)☆251Updated 2 weeks ago
- The repository contains the code and dataset for the Socratic Debugging task which is a novel task for Socratically Questioning Novice De…☆18Updated last year
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆78Updated last year
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆87Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆28Updated 2 years ago
- Using conversational games to evaluate powerful LLMs☆17Updated 2 years ago
- PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion☆57Updated last year
- A collection of works that investigate social agents, simulations and their real-world impact in text, embodied, and robotics contexts.☆96Updated last year
- LLM Dynamic Planner - Combining LLM with PDDL Planners to solve an embodied task☆47Updated 9 months ago
- distilled Self-Critique refines the outputs of a LLM with only synthetic data☆11Updated last year
- List of papers on Self-Correction of LLMs.☆78Updated 9 months ago
- ☆46Updated last year
- This repo contains code for our NeurIPS 2023 spotlight paper: Evaluating and Inducing Personality in Pre-trained Language Models☆55Updated last year
- Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback☆207Updated 2 years ago
- Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"☆20Updated 3 weeks ago
- [NAACL 2024] Better Zero-Shot Reasoning with Role-Play Prompting☆36Updated last year
- SummScreen: A Dataset for Abstractive Screenplay Summarization (ACL 2022)☆38Updated 3 years ago
- A corpus and code for understanding norms and subjectivity. 🤖☆52Updated last year
- Code for "From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Mod…☆40Updated last year
- Alignment with a millennium of moral progress. Spotlight@NeurIPS 2024 Track on Datasets and Benchmarks.☆24Updated 6 months ago
- KokoMind: Can LLMs Understand Social Interactions?☆102Updated 2 years ago
- Super fast implementations of common benchmark text world games☆51Updated last month
- This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'☆128Updated 4 months ago
- [EMNLP '23] Discriminator-Guided Chain-of-Thought Reasoning☆49Updated last year
- Language Models of Code are Few-Shot Commonsense Learners (EMNLP 2022)☆85Updated 2 years ago
- ☆99Updated last year