metal-chart-generation / metalLinks
☆41Updated 7 months ago
Alternatives and similar repositories for metal
Users that are interested in metal are comparing it to the libraries listed below
Sorting:
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆50Updated 8 months ago
- ☆67Updated 9 months ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆87Updated last month
- ☆23Updated last year
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Updated 7 months ago
- Process Reward Models That Think☆70Updated last month
- Verifiers for LLM Reinforcement Learning☆79Updated 8 months ago
- ☆52Updated 7 months ago
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆94Updated 7 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆119Updated 7 months ago
- MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning☆110Updated last month
- ☆50Updated 11 months ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆101Updated 4 months ago
- When Reasoning Meets Its Laws☆33Updated last week
- LIMI: Less is More for Agency☆156Updated 2 months ago
- ☆63Updated 6 months ago
- Leveraging Base Language Models for Few-Shot Synthetic Data Generation☆40Updated 2 months ago
- JudgeLRM: Large Reasoning Models as a Judge☆40Updated last month
- Official repo of Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains.☆43Updated 7 months ago
- An automated data pipeline scaling RL to pretraining levels☆72Updated 3 months ago
- Official Code Release for "Training a Generally Curious Agent"☆43Updated 7 months ago
- ☆71Updated last year
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433☆116Updated last year
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆140Updated last year
- accompanying material for sleep-time compute paper☆118Updated 8 months ago
- Code for paper called Self-Training Elicits Concise Reasoning in Large Language Models☆41Updated 8 months ago
- ☆226Updated 10 months ago
- The open-source code of MetaStone-S1.☆106Updated 5 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆38Updated last year
- Geometric-Mean Policy Optimization☆96Updated last month