hyintell / LLMSymbolicLinks
☆21Updated last year
Alternatives and similar repositories for LLMSymbolic
Users that are interested in LLMSymbolic are comparing it to the libraries listed below
Sorting:
- ☆42Updated last year
- Code for ICML 2024 paper☆34Updated 3 months ago
- implementation of dualformer☆24Updated 9 months ago
- ☆51Updated 10 months ago
- Natural Language Reinforcement Learning☆100Updated 4 months ago
- ☆29Updated 9 months ago
- Process Reward Models That Think☆67Updated 3 weeks ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆38Updated last year
- ☆24Updated 8 months ago
- ☆76Updated last month
- SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning☆171Updated 3 months ago
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆35Updated last year
- Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs☆40Updated last year
- Official repo of paper LM2☆46Updated 10 months ago
- The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"☆36Updated 2 months ago
- ☆74Updated last month
- ☆19Updated 9 months ago
- Sotopia-RL: Reward Design for Social Intelligence☆46Updated 4 months ago
- Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environme…☆19Updated last year
- ☆27Updated last year
- [NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning☆144Updated 3 months ago
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆48Updated last year
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated last year
- ☆34Updated 7 months ago
- official implementation of paper "Process Reward Model with Q-value Rankings"☆65Updated 10 months ago
- WONDERBREAD benchmark + dataset for BPM tasks☆31Updated 4 months ago
- ☆29Updated 2 months ago
- o1 Chain of Thought Examples☆33Updated last year
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆123Updated last year
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆112Updated 4 months ago