zorazrw / workflow-induction-toolkitLinks
A toolkit to induce interpretable workflows from raw computer-use activities.
☆25Updated last week
Alternatives and similar repositories for workflow-induction-toolkit
Users that are interested in workflow-induction-toolkit are comparing it to the libraries listed below
Sorting:
- Codebase accompanying the Summary of a Haystack paper.☆79Updated last year
- ☆23Updated 8 months ago
- LangCode - Improving alignment and reasoning of large language models (LLMs) with natural language embedded program (NLEP).☆47Updated 2 years ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 9 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆101Updated 11 months ago
- Learning to route instances for Human vs AI Feedback (ACL Main '25)☆25Updated 3 months ago
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆42Updated last year
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆27Updated 11 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated last year
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆91Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- ☆129Updated last year
- Verifiers for LLM Reinforcement Learning☆79Updated 7 months ago
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆44Updated 7 months ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆72Updated this week
- ☆81Updated this week
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 6 months ago
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated 2 years ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆105Updated 6 months ago
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆54Updated 3 months ago
- Aioli: A unified optimization framework for language model data mixing☆28Updated 9 months ago
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆99Updated 2 years ago
- This repository contains ScholarQABench data and evaluation pipeline.☆85Updated 3 months ago
- ☆50Updated 5 months ago
- ☆55Updated last year
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆51Updated last year
- Code accompanying "How I learned to start worrying about prompt formatting".☆110Updated 5 months ago
- ☆49Updated 7 months ago
- List of papers on Self-Correction of LLMs.☆80Updated 10 months ago
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆102Updated 3 months ago