lil-lab / cb2Links
An NLP research and data collection platform.
☆17Updated last year
Alternatives and similar repositories for cb2
Users that are interested in cb2 are comparing it to the libraries listed below
Sorting:
- Codebase for LLM story generation; updated version of https//github.com/yangkevin2/doc-story-generation☆87Updated 2 years ago
- ☆47Updated last year
- Evaluating the Moral Beliefs Encoded in LLMs☆31Updated last year
- PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion☆59Updated last year
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆90Updated last year
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆111Updated last year
- SummScreen: A Dataset for Abstractive Screenplay Summarization (ACL 2022)☆39Updated 3 years ago
- ☆15Updated last month
- distilled Self-Critique refines the outputs of a LLM with only synthetic data☆11Updated last year
- KokoMind: Can LLMs Understand Social Interactions?☆104Updated 2 years ago
- [ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models (LLMs + MCTS + Self-Improvement)☆50Updated 2 years ago
- Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)☆272Updated last week
- Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback☆208Updated 2 years ago
- ☆100Updated last year
- Byte-sized text games for code generation tasks on virtual environments☆20Updated last year
- ☆214Updated 2 years ago
- [NAACL 2024] Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models☆86Updated last year
- For experiments involving instruct gpt. Currently used for documenting open research questions.☆71Updated 3 years ago
- Super fast implementations of common benchmark text world games☆52Updated 4 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated 2 years ago
- A set of utilities for running few-shot prompting experiments on large-language models☆126Updated 2 years ago
- ⚡Research papers about leveraging the capabilities of language models⚡☆52Updated 2 years ago
- A collection of works that investigate social agents, simulations and their real-world impact in text, embodied, and robotics contexts.☆106Updated last year
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆81Updated last year
- Code accompanying our EMNLP 2019 paper: "Revisiting the Evaluation of Theory of Mind through Question Answering"☆26Updated 5 years ago
- Data and code for the paper "Inducing Positive Perspectives with Text Reframing"☆61Updated 2 years ago
- The Prism Alignment Project☆87Updated last year
- Machine Theory of Mind Reading List. Built upon EMNLP Findings 2023 Paper: Towards A Holistic Landscape of Situated Theory of Mind in Lar…☆149Updated 10 months ago
- ZYN: Zero-Shot Reward Models with Yes-No Questions☆35Updated 2 years ago
- [EMNLP '23] Discriminator-Guided Chain-of-Thought Reasoning☆50Updated last year