epfl-dlab / cc_flows
The data and implementation for the experiments in the paper "Flows: Building Blocks of Reasoning and Collaborating AI".
☆31Updated last year
Alternatives and similar repositories for cc_flows:
Users that are interested in cc_flows are comparing it to the libraries listed below
- The data and the PyTorch implementation for the models and experiments in the paper "Language Model Decoding as Likelihood–Utility Alignm…☆14Updated last year
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆54Updated 4 months ago
- Code for our EMNLP '22 paper "Fixing Model Bugs with Natural Language Patches"☆19Updated 2 years ago
- Functional Benchmarks and the Reasoning Gap☆85Updated 6 months ago
- Factored Cognition Primer: How to write compositional language model programs☆48Updated 2 years ago
- Advanced Reasoning Benchmark Dataset for LLMs☆45Updated last year
- Finding semantically meaningful and accurate prompts.☆46Updated last year
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆64Updated last year
- ☆75Updated 3 weeks ago
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆106Updated 5 months ago
- ☆36Updated 2 years ago
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated last year
- ☆40Updated 9 months ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 3 years ago
- Based on the tree of thoughts paper☆48Updated last year
- ☆29Updated last year
- Code and Data for "Language Modeling with Editable External Knowledge"☆32Updated 10 months ago
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆39Updated 5 months ago
- A set of utilities for running few-shot prompting experiments on large-language models☆118Updated last year
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆69Updated 2 years ago
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆53Updated last year
- Probabilistic LLM evaluations. [CogSci2023; ACL2023]☆73Updated 8 months ago
- ☆44Updated 5 months ago
- [NeurIPS 2023] PyTorch code for Can Language Models Teach? Teacher Explanations Improve Student Performance via Theory of Mind☆67Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆33Updated this week
- Language Models of Code are Few-Shot Commonsense Learners (EMNLP 2022)☆86Updated 2 years ago
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆47Updated last year
- ☆15Updated 2 weeks ago
- Entailment self-training☆25Updated last year
- Aioli: A unified optimization framework for language model data mixing☆23Updated 3 months ago