epfl-dlab / cc_flowsLinks
The data and implementation for the experiments in the paper "Flows: Building Blocks of Reasoning and Collaborating AI".
☆31Updated last year
Alternatives and similar repositories for cc_flows
Users that are interested in cc_flows are comparing it to the libraries listed below
Sorting:
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆58Updated 6 months ago
- The data and the PyTorch implementation for the models and experiments in the paper "Language Model Decoding as Likelihood–Utility Alignm…☆14Updated last year
- Finding semantically meaningful and accurate prompts.☆46Updated last year
- Pseudo-code Instructions dataset☆27Updated last year
- Advanced Reasoning Benchmark Dataset for LLMs☆47Updated last year
- Factored Cognition Primer: How to write compositional language model programs☆49Updated 2 years ago
- ☆15Updated 2 months ago
- Code repository for the c-BTM paper☆106Updated last year
- ☆29Updated 2 years ago
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆48Updated last year
- Entailment self-training☆25Updated 2 years ago
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆106Updated 7 months ago
- A repository for transformer critique learning and generation☆90Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated 2 months ago
- ☆94Updated 6 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆104Updated 6 months ago
- Repository for the code of the "PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided Decoding" paper, NAACL'22☆66Updated 2 years ago
- ☆43Updated 2 months ago
- A set of utilities for running few-shot prompting experiments on large-language models☆121Updated last year
- Reasoning by Communicating with Agents☆29Updated last month
- ☆35Updated 2 years ago
- ☆76Updated 3 months ago
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆54Updated last year
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆70Updated 2 years ago
- [NeurIPS 2023] PyTorch code for Can Language Models Teach? Teacher Explanations Improve Student Performance via Theory of Mind☆67Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- An implementation of Deepmind's Promptbreeder.☆22Updated last year
- Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"☆59Updated 5 months ago
- ☆51Updated 8 months ago
- ☆36Updated 2 years ago