epfl-dlab / cc_flowsLinks
The data and implementation for the experiments in the paper "Flows: Building Blocks of Reasoning and Collaborating AI".
☆31Updated last year
Alternatives and similar repositories for cc_flows
Users that are interested in cc_flows are comparing it to the libraries listed below
Sorting:
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 5 months ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆64Updated 2 years ago
- Advanced Reasoning Benchmark Dataset for LLMs☆46Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆109Updated 10 months ago
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆110Updated 10 months ago
- ☆41Updated last year
- Entailment self-training☆25Updated 2 years ago
- Functional Benchmarks and the Reasoning Gap☆89Updated last year
- ☆44Updated 10 months ago
- ☆44Updated last year
- Based on the tree of thoughts paper☆48Updated 2 years ago
- ☆95Updated 9 months ago
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Updated 9 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆64Updated 10 months ago
- A repository for transformer critique learning and generation☆88Updated last year
- Finding semantically meaningful and accurate prompts.☆48Updated last year
- Probabilistic LLM evaluations. [CogSci2023; ACL2023]☆72Updated last year
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆20Updated last year
- Multi-Domain Expert Learning☆66Updated last year
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated 2 years ago
- Aioli: A unified optimization framework for language model data mixing☆27Updated 8 months ago
- Evaluating LLMs with CommonGen-Lite☆91Updated last year
- Code repository for the c-BTM paper☆107Updated 2 years ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- ☆78Updated 6 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆28Updated 2 years ago
- ☆39Updated last year
- Repository for the code of the "PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided Decoding" paper, NAACL'22☆66Updated 2 years ago
- Experiments for efforts to train a new and improved t5☆75Updated last year
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆107Updated 2 years ago