microsoft / VisEvalLinks
A benchmark designed to evaluate visualization generation methods.
☆52Updated 2 weeks ago
Alternatives and similar repositories for VisEval
Users that are interested in VisEval are comparing it to the libraries listed below
Sorting:
- ☆96Updated last year
- Awesome-Paper-list: Visualization meets LLM☆56Updated last month
- InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks (ICML 2024)☆160Updated 5 months ago
- MemGen: Weaving Generative Latent Memory for Self-Evolving Agents☆177Updated 2 weeks ago
- ☆69Updated 5 months ago
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆92Updated last year
- LLM-Based Human-Agent Collaboration and Interaction Systems: A Survey | Awesome Human-Agent Collaboration | Human-AI Collaboration☆163Updated last week
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆133Updated 9 months ago
- ☆60Updated 6 months ago
- [ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet☆209Updated last week
- ☆158Updated 3 weeks ago
- ☆84Updated 4 years ago
- 🔥🔥🔥Latest Papers, Codes on Uncertainty-based RL☆51Updated 2 months ago
- LLM for Scientific Research Survey☆113Updated 9 months ago
- [ACL'25 Main] ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation☆65Updated last week
- ☆134Updated last month
- Test-time preferenece optimization (ICML 2025).☆169Updated 6 months ago
- A research repo for experiments about Reinforcement Finetuning☆52Updated 7 months ago
- [NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents☆128Updated 7 months ago
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆86Updated 9 months ago
- Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay☆137Updated 5 months ago
- ☆419Updated 3 months ago
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization☆180Updated last year
- Resources on Large Language Models for Table Processing☆110Updated last year
- 🔥🔥🔥 ICLR 2025 Oral. Automating Agentic Workflow Generation.☆316Updated 3 months ago
- The implementation for ICLR 2025 Oral: From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions.☆49Updated 3 months ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆289Updated 3 weeks ago
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆131Updated 9 months ago
- ☆131Updated 8 months ago
- Accepted LLM Papers in NeurIPS 2024☆37Updated last year