lil-lab / cb2
An NLP research and data collection platform.
☆17Updated 11 months ago
Alternatives and similar repositories for cb2:
Users that are interested in cb2 are comparing it to the libraries listed below
- Tasks for describing differences between text distributions.☆16Updated 6 months ago
- Codebase for LLM story generation; updated version of https//github.com/yangkevin2/doc-story-generation☆76Updated last year
- Alignment with a millennium of moral progress. Spotlight@NeurIPS 2024 Track on Datasets and Benchmarks.☆20Updated last month
- Byte-sized text games for code generation tasks on virtual environments☆19Updated 7 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆51Updated 2 weeks ago
- LLM Dynamic Planner - Combining LLM with PDDL Planners to solve an embodied task☆41Updated last month
- Code/data for MARG (multi-agent review generation)☆38Updated 3 months ago
- Evaluating the Moral Beliefs Encoded in LLMs☆23Updated 2 months ago
- ☆42Updated this week
- Repository for Skill Set Optimization☆12Updated 6 months ago
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆106Updated 3 months ago
- The Prism Alignment Project☆66Updated 9 months ago
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆57Updated 9 months ago
- Official implementation of paper "Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment" (https://arxiv.or…☆21Updated this week
- ☆27Updated this week
- ☆23Updated 5 months ago
- Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)☆186Updated this week
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆42Updated last year
- Source code and data for The Magic of IF: Investigating Causal Reasoning Abilities in Large Language Models of Code (Findings of ACL 2023…☆29Updated last year
- ☆9Updated 2 months ago
- SummScreen: A Dataset for Abstractive Screenplay Summarization (ACL 2022)☆35Updated 2 years ago
- Repo for: When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment☆38Updated last year
- ☆31Updated last year
- Code and Data for the NAACL 24 paper: MacGyver: Are Large Language Models Creative Problem Solvers?☆25Updated 10 months ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆29Updated last week
- The KiloGram Tangrams dataset☆54Updated last year
- ☆32Updated last year
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆35Updated this week