yangzhch6 / ReSocraticLinks
OptiBench and ReSocratic Synthesis Method
☆23Updated 2 months ago
Alternatives and similar repositories for ReSocratic
Users that are interested in ReSocratic are comparing it to the libraries listed below
Sorting:
- The LLMOPT project offers a comprehensive set of resources, including the model, dataset, training framework, and inference code, enablin…☆64Updated last month
- Official implementation of the paper "Chain-of-Experts: When LLMs Meet Complex Operation Research Problems"☆99Updated 3 months ago
- Benchmarking Language Model Agents in Algorithm Search for Combinatorial Optimization☆19Updated last week
- ICML 2025 Spotlight☆90Updated this week
- ☆28Updated last month
- ☆11Updated 8 months ago
- ORLM: Training Large Language Models for Optimization Modeling☆157Updated 2 months ago
- MARFT stands for Multi-Agent Reinforcement Fine-Tuning. This repository implements an LLM-based multi-agent reinforcement fine-tuning fra…☆35Updated 2 weeks ago
- ☆11Updated 3 months ago
- ☆62Updated 2 months ago
- [ICLR 2025] Official code of "Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization"☆14Updated last year
- Code for "Decision-Focused Learning without Differentiable Optimization: Learning Locally Optimized Decision Losses"☆27Updated last year
- ☆29Updated 2 months ago
- ☆23Updated last year
- Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"☆69Updated 5 months ago
- [ICML'24 Oral] Rethinking Post-Hoc Search-Based Neural Approaches for Solving Large-Scale Traveling Salesman Problems☆37Updated 2 months ago
- Natural Language for Optimization Modelling☆51Updated 5 months ago
- ☆42Updated last month
- [NeurIPS 2024] Official implementation for paper "Can Graph Learning Improve Planning in LLM-based Agents?"☆126Updated 3 weeks ago
- Code for paper: End-to-end Stochastic Optimization with Energy-based Model☆16Updated 2 years ago
- [NeurIPS 2023] T2T: From Distribution Learning in Training to Gradient Search in Testing for Combinatorial Optimization☆63Updated 4 months ago
- ☆114Updated 4 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆120Updated 8 months ago
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆79Updated 9 months ago
- Must-read Papers on Large Language Model (LLM) as Optimizers and Automatic Optimization for Prompting LLMs.☆244Updated last year
- ☆13Updated last month
- Predict and search framework for MilP☆53Updated 2 years ago
- This my attempt to create Self-Correcting-LLM based on the paper Training Language Models to Self-Correct via Reinforcement Learning by g…☆35Updated 2 months ago
- Code for "Reasoning to Learn from Latent Thoughts"☆104Updated 2 months ago
- [ICML 2024] "MVMoE: Multi-Task Vehicle Routing Solver with Mixture-of-Experts"☆70Updated this week