yangzhch6 / ReSocraticLinks
OptiBench and ReSocratic Synthesis Method
☆30Updated 3 months ago
Alternatives and similar repositories for ReSocratic
Users that are interested in ReSocratic are comparing it to the libraries listed below
Sorting:
- The LLMOPT project offers a comprehensive set of resources, including the model, dataset, training framework, and inference code, enablin…☆119Updated 2 months ago
- Official implementation of the paper "Chain-of-Experts: When LLMs Meet Complex Operation Research Problems"☆112Updated 11 months ago
- A collection of LLMs for optimization, including modeling and solving☆37Updated 4 months ago
- [AAAI 2026] Benchmarking Language Model Agents in Algorithm Search for Combinatorial Optimization☆34Updated 5 months ago
- Must-read Papers on Large Language Model (LLM) as Optimizers and Automatic Optimization for Prompting LLMs.☆252Updated last year
- RLVR for LLMs in optimization modeling☆39Updated last month
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆94Updated last year
- Reinforced Multi-LLM Agents training☆66Updated 7 months ago
- Natural Language for Optimization Modelling☆65Updated 7 months ago
- ☆39Updated last year
- ☆32Updated last year
- Code for paper: End-to-end Stochastic Optimization with Energy-based Model☆17Updated 2 years ago
- ☆43Updated last month
- code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning☆43Updated last year
- This my attempt to create Self-Correcting-LLM based on the paper Training Language Models to Self-Correct via Reinforcement Learning by g…☆38Updated 6 months ago
- Repo of "Large Language Model-based Human-Agent Collaboration for Complex Task Solving(EMNLP2024 Findings)"☆34Updated last year
- Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…☆115Updated last year
- [ICLR 2025] Official code of "Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization"☆18Updated last year
- [ACL 2024] ValueBench: Towards Comprehensively Evaluating Value Orientations and Understanding of Large Language Models☆24Updated last year
- Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"☆78Updated 7 months ago
- Direct preference optimization with f-divergences.☆15Updated last year
- ☆12Updated 10 months ago
- Enable Comprehensive LLM Evaluation on Graph Reasoning☆73Updated 7 months ago
- [NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents☆38Updated last year
- ORLM: Training Large Language Models for Optimization Modeling☆229Updated 4 months ago
- ☆74Updated last year
- Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".☆29Updated last year
- Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)☆199Updated 2 years ago
- [NeurIPS 2024] GITA: Graph to Image-Text Integration for Vision-Language Graph Reasoning☆53Updated last month
- [ICML 2025] "From Debate to Equilibrium: Belief-Driven Multi-Agent LLM Reasoning via Bayesian Nash Equilibrium"☆33Updated last month