epfl-dlab / cc_flowsLinks
The data and implementation for the experiments in the paper "Flows: Building Blocks of Reasoning and Collaborating AI".
☆31Updated last year
Alternatives and similar repositories for cc_flows
Users that are interested in cc_flows are comparing it to the libraries listed below
Sorting:
- Finding semantically meaningful and accurate prompts.☆46Updated last year
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆56Updated 5 months ago
- ☆40Updated 10 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated last month
- Advanced Reasoning Benchmark Dataset for LLMs☆46Updated last year
- [NeurIPS 2023] PyTorch code for Can Language Models Teach? Teacher Explanations Improve Student Performance via Theory of Mind☆67Updated last year
- ☆44Updated 6 months ago
- Functional Benchmarks and the Reasoning Gap☆86Updated 8 months ago
- ☆15Updated 2 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- A repository for transformer critique learning and generation☆90Updated last year
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆54Updated last year
- Retrieval Augmented Generation Generalized Evaluation Dataset☆53Updated 6 months ago
- ☆22Updated last year
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆65Updated last year
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆106Updated 6 months ago
- Entailment self-training☆25Updated 2 years ago
- Based on the tree of thoughts paper☆48Updated last year
- ☆24Updated 9 months ago
- A set of utilities for running few-shot prompting experiments on large-language models☆121Updated last year
- The data and the PyTorch implementation for the models and experiments in the paper "Language Model Decoding as Likelihood–Utility Alignm…☆14Updated last year
- Experiments for efforts to train a new and improved t5☆77Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆104Updated 5 months ago
- A library for squeakily cleaning and filtering language datasets.☆47Updated last year
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated last year
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- ☆29Updated last year
- ☆36Updated 2 years ago
- Reasoning by Communicating with Agents☆28Updated last month
- Public Inflection Benchmarks☆68Updated last year