lil-lab / cb2
An NLP research and data collection platform.
☆17Updated last year
Alternatives and similar repositories for cb2:
Users that are interested in cb2 are comparing it to the libraries listed below
- Evaluating the Moral Beliefs Encoded in LLMs☆25Updated 4 months ago
- Super fast implementations of common benchmark text world games☆47Updated last month
- ☆33Updated last month
- Byte-sized text games for code generation tasks on virtual environments☆19Updated 9 months ago
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆62Updated 11 months ago
- Repo for: When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment☆38Updated last year
- ☆27Updated last month
- [EMNLP 2020] Keep CALM and Explore: Language Models for Action Generation in Text-based Games☆71Updated 4 years ago
- Codebase for LLM story generation; updated version of https//github.com/yangkevin2/doc-story-generation☆80Updated last year
- LLM Dynamic Planner - Combining LLM with PDDL Planners to solve an embodied task☆42Updated 3 months ago
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆106Updated 5 months ago
- The source code for running LLMs on the AAAR-1.0 benchmark.☆16Updated 3 weeks ago
- 🧪Create domain-adapted language models by distilling from many pre-trained LMs☆10Updated 2 years ago
- This repo contains code for our NeurIPS 2023 spotlight paper: Evaluating and Inducing Personality in Pre-trained Language Models☆52Updated last year
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆51Updated last month
- This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…☆14Updated last year
- [ICML 2024] Language Models Represent Beliefs of Self and Others☆32Updated 7 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- For experiments involving instruct gpt. Currently used for documenting open research questions.☆71Updated 2 years ago
- [ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)☆19Updated 2 months ago
- Code for "From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Mod…☆36Updated last year
- Source code and data for The Magic of IF: Investigating Causal Reasoning Abilities in Large Language Models of Code (Findings of ACL 2023…☆29Updated last year
- [EMNLP 2023, Findings] GRACE: Discriminator-Guided Chain-of-Thought Reasoning☆47Updated 6 months ago
- Source code for the GPT-2 story generation models in the EMNLP 2020 paper "STORIUM: A Dataset and Evaluation Platform for Human-in-the-Lo…☆39Updated last year
- ☆41Updated 3 weeks ago
- Alignment with a millennium of moral progress. Spotlight@NeurIPS 2024 Track on Datasets and Benchmarks.☆22Updated 3 weeks ago
- Generate sentences from a probabilistic context-free grammar.☆16Updated 5 months ago
- Code/data for MARG (multi-agent review generation)☆42Updated 5 months ago
- SummScreen: A Dataset for Abstractive Screenplay Summarization (ACL 2022)☆37Updated 2 years ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆45Updated last year