xiusic / DecisionFlowLinks
☆29Updated 2 months ago
Alternatives and similar repositories for DecisionFlow
Users that are interested in DecisionFlow are comparing it to the libraries listed below
Sorting:
- The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆86Updated 3 weeks ago
- ☆66Updated 4 months ago
- II-Thought-RL is our initial attempt at developing a large-scale, multi-domain Reinforcement Learning (RL) dataset☆26Updated 4 months ago
- ☆40Updated 7 months ago
- Official Repository for Task-Circuit Quantization☆22Updated 2 months ago
- Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"☆56Updated 5 months ago
- Analysis code for paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆45Updated this week
- Source code and utilities for the Genesys distributed language model architecture discovery system.☆47Updated last month
- The Library for LLM-based multi-agent applications☆92Updated 3 weeks ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 6 months ago
- ☆59Updated 8 months ago
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆61Updated last month
- The official repo for the code and data of paper SMART☆31Updated 5 months ago
- Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"☆14Updated 2 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- ScreenSuite - The most comprehensive benchmarking suite for GUI Agents!☆100Updated last week
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 6 months ago
- [ICLR'25] ApolloMoE: Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts☆46Updated 8 months ago
- accompanying material for sleep-time compute paper☆102Updated 3 months ago
- ☆19Updated 5 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hours☆63Updated 11 months ago
- ☆24Updated 10 months ago
- ☆20Updated 2 months ago
- Verifiers for LLM Reinforcement Learning☆69Updated 3 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆59Updated 8 months ago
- LLMs as Collaboratively Edited Knowledge Bases☆45Updated last year
- One Line To Build Zero-Data Classifiers in Minutes☆58Updated 10 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆120Updated 9 months ago