microsoft / VisEvalLinks
A benchmark designed to evaluate visualization generation methods.
☆44Updated 2 months ago
Alternatives and similar repositories for VisEval
Users that are interested in VisEval are comparing it to the libraries listed below
Sorting:
- ☆82Updated last year
- Awesome-Paper-list: Visualization meets LLM☆45Updated 2 weeks ago
- [ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet☆167Updated 2 months ago
- InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks (ICML 2024)☆146Updated 3 months ago
- ☆81Updated 4 years ago
- LLM for Scientific Research Survey☆98Updated 7 months ago
- Awesome Agent Training☆215Updated 3 weeks ago
- ☆81Updated 3 months ago
- ☆67Updated 2 months ago
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization☆165Updated last year
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆243Updated 2 weeks ago
- ☆80Updated 5 months ago
- Neural Code Intelligence Survey 2024; Reading lists and resources☆266Updated last month
- [NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents☆120Updated 5 months ago
- The implementation for ICLR 2025 Oral: From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions.☆45Updated 3 weeks ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆119Updated 6 months ago
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆148Updated 8 months ago
- A research repo for experiments about Reinforcement Finetuning☆51Updated 4 months ago
- ☆312Updated last month
- Official implementation of the paper Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers☆167Updated 5 months ago
- The code and data of DPA-RAG, accepted by WWW 2025 main conference.☆61Updated 7 months ago
- ☆154Updated 8 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆133Updated 11 months ago
- ☆274Updated 3 months ago
- Test-time preferenece optimization (ICML 2025).☆162Updated 3 months ago
- ☆405Updated last month
- ☆23Updated last year
- Implementation of "RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation".☆244Updated last year
- ncNet is a Transformer-based model for supporting NL2VIS.☆43Updated 11 months ago
- ☆38Updated last year