microsoft / VisEval
☆29Updated 3 weeks ago
Alternatives and similar repositories for VisEval:
Users that are interested in VisEval are comparing it to the libraries listed below
- ☆58Updated 6 months ago
- InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks (ICML 2024)☆104Updated last month
- ☆45Updated 3 months ago
- AFlow & MathAI☆17Updated 2 weeks ago
- ☆42Updated last month
- The Official Code Repository for GUI-World.☆46Updated last month
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆40Updated 2 months ago
- Vega-Lite Chart Dataset and NL Generation Framework using LLMs☆109Updated 8 months ago
- ☆49Updated 4 months ago
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization☆126Updated 8 months ago
- ☆75Updated 3 years ago
- ncNet is a Transformer-based model for supporting NL2VIS.☆37Updated 4 months ago
- ☆25Updated 2 months ago
- [ACL'24] Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correla…☆38Updated 2 weeks ago
- The code and data of DPA-RAG☆55Updated last week
- Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"☆49Updated 2 months ago
- [NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents☆71Updated 2 months ago
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆40Updated 2 months ago
- PGRAG☆45Updated 6 months ago
- [NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?☆116Updated 5 months ago
- Code and data for QueryAgent(ACL 2024)☆21Updated last month
- ☆54Updated last month
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆173Updated 4 months ago
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆104Updated last month
- Extensive Self-Contrast Enables Feedback-Free Language Model Alignment☆19Updated 9 months ago
- Code for the 2024 arXiv publication "Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Mo…☆23Updated 6 months ago
- ☆37Updated 2 months ago
- ☆37Updated 4 months ago
- This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"☆45Updated 6 months ago