microsoft / VisEvalLinks
A benchmark designed to evaluate visualization generation methods.
☆55Updated 2 months ago
Alternatives and similar repositories for VisEval
Users that are interested in VisEval are comparing it to the libraries listed below
Sorting:
- ☆102Updated last year
- Awesome-Paper-list: Visualization meets LLM☆61Updated 3 weeks ago
- ☆186Updated 3 months ago
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆103Updated last year
- ☆97Updated 9 months ago
- ☆70Updated 7 months ago
- InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks (ICML 2024)☆178Updated 7 months ago
- [ACL'25 Main] ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation☆74Updated last month
- MemGen: Weaving Generative Latent Memory for Self-Evolving Agents☆276Updated last month
- LLM-Based Human-Agent Collaboration and Interaction Systems: A Survey | Awesome Human-Agent Collaboration | Human-AI Collaboration☆180Updated last month
- ☆218Updated 2 weeks ago
- Test-time preferenece optimization (ICML 2025).☆177Updated 8 months ago
- [ICLR 2025] DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆99Updated 5 months ago
- LLM for Scientific Research Survey☆118Updated 11 months ago
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆162Updated last year
- [ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet☆234Updated 2 months ago
- A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"☆188Updated last month
- ☆212Updated 5 months ago
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization☆192Updated last year
- ☆102Updated 2 months ago
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆82Updated 2 months ago
- The implementation for ICLR 2025 Oral: From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions.☆52Updated 5 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆150Updated last year
- The code of RouterDC☆69Updated 9 months ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆306Updated 2 weeks ago
- ☆458Updated 5 months ago
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆93Updated last year
- This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language …☆166Updated 8 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆142Updated 11 months ago
- ☆35Updated last year