metal-chart-generation / metalLinks
☆34Updated 3 weeks ago
Alternatives and similar repositories for metal
Users that are interested in metal are comparing it to the libraries listed below
Sorting:
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆43Updated last month
- ☆24Updated 9 months ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆56Updated 2 months ago
- Verifiers for LLM Reinforcement Learning☆60Updated 2 months ago
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆90Updated last month
- ☆13Updated 6 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆51Updated 6 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆93Updated last week
- Code for paper called Self-Training Elicits Concise Reasoning in Large Language Models☆34Updated 2 months ago
- ☆47Updated 3 weeks ago
- ☆45Updated last month
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆62Updated 3 weeks ago
- ☆29Updated 2 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 9 months ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆17Updated last month
- Repo for "Z1: Efficient Test-time Scaling with Code"☆60Updated 2 months ago
- ☆65Updated 2 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆108Updated 8 months ago
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆44Updated 3 months ago
- Process Reward Models That Think☆41Updated 3 weeks ago
- ☆20Updated 10 months ago
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆57Updated 8 months ago
- [ACL 2025] Agentic Knowledgeable Self-awareness☆71Updated last week
- Scaling Computer-Use Grounding via UI Decomposition and Synthesis☆79Updated this week
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆58Updated last year
- ☆62Updated 11 months ago
- ☆20Updated this week
- Combining Base and Instruction-Tuned Language Models for Better Synthetic Data Generation☆32Updated 4 months ago
- ☆46Updated 4 months ago
- Official Code Release for "Training a Generally Curious Agent"☆25Updated last month