yangzhch6 / ReSocraticLinks
OptiBench and ReSocratic Synthesis Method
☆25Updated 5 months ago
Alternatives and similar repositories for ReSocratic
Users that are interested in ReSocratic are comparing it to the libraries listed below
Sorting:
- The LLMOPT project offers a comprehensive set of resources, including the model, dataset, training framework, and inference code, enablin…☆76Updated 4 months ago
- Official implementation of the paper "Chain-of-Experts: When LLMs Meet Complex Operation Research Problems"☆103Updated 6 months ago
- Benchmarking Language Model Agents in Algorithm Search for Combinatorial Optimization☆25Updated 3 weeks ago
- Must-read Papers on Large Language Model (LLM) as Optimizers and Automatic Optimization for Prompting LLMs.☆247Updated last year
- Enable Comprehensive LLM Evaluation on Graph Reasoning☆73Updated 2 months ago
- [ICLR 2025] Official code of "Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization"☆17Updated last year
- ML4CO-Bench-101: Benchmark Machine Learning for Classic Combinatorial Problems on Graphs.☆22Updated 3 months ago
- RLVR for LLMs in optimization modeling☆15Updated this week
- [ICML'24 Oral] Rethinking Post-Hoc Search-Based Neural Approaches for Solving Large-Scale Traveling Salesman Problems☆37Updated 4 months ago
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆86Updated last year
- [NeurIPS 2024] ReEvo: Large Language Models as Hyper-Heuristics with Reflective Evolution☆205Updated 3 months ago
- ORLM: Training Large Language Models for Optimization Modeling☆180Updated 5 months ago
- ☆23Updated last year
- Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"☆72Updated 2 months ago
- A Python toolkit for Machine Learning (ML) practices for Combinatorial Optimization (CO).☆57Updated 3 weeks ago
- ☆11Updated 11 months ago
- Natural Language for Optimization Modelling☆55Updated 2 months ago
- LLM-Based Human-Agent Collaboration and Interaction Systems: A Survey | Awesome Human-Agent Collaboration | Human-AI Collaboration☆121Updated last week
- [ACL 2024] ValueBench: Towards Comprehensively Evaluating Value Orientations and Understanding of Large Language Models☆22Updated 7 months ago
- [ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet☆167Updated 2 months ago
- Repo for Anonymous purpose, pls don't distribute☆10Updated 11 months ago
- code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning☆41Updated last year
- Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".☆25Updated 10 months ago
- Benchmarking LLMs' Gaming Ability in Multi-Agent Environments☆87Updated 4 months ago
- ☆42Updated 4 months ago
- Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…☆108Updated last year
- This my attempt to create Self-Correcting-LLM based on the paper Training Language Models to Self-Correct via Reinforcement Learning by g…☆35Updated last month
- An index of algorithms for reinforcement learning from human feedback (rlhf))☆93Updated last year
- OR-LLM-Agent: Automating Modeling and Solving of Operations Research Optimization Problems with Reasoning LLM☆46Updated this week
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆189Updated 4 months ago