InfiAgent/InfiAgent

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/InfiAgent/InfiAgent)

InfiAgent / InfiAgent

InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks (ICML 2024)

☆198

Alternatives and similar repositories for InfiAgent

Users that are interested in InfiAgent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

behavioral-data / BLADE
View on GitHub
[EMNLP 2024 Findings] Benchmarking Language Model Agents for Data-Driven Science
☆35Oct 25, 2024Updated last year
MetaCopilot / dseval
View on GitHub
☆33Jun 24, 2024Updated 2 years ago
xlang-ai / Spider2-V
View on GitHub
[NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
☆153Aug 26, 2024Updated last year
shirley-wu / daco
View on GitHub
[NeurIPS 2024 D&B Track] DACO: Towards Application-Driven and Comprehensive Data Analysis via Code Generation
☆14Mar 5, 2025Updated last year
guosyjlu / DS-Agent
View on GitHub
Official implementation of "DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning" in ICML'24
☆238Dec 3, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
XiangJinyu / APrompt
View on GitHub
An automatic prompt iteration and optimization generator suitable for any scenario
☆16Jan 31, 2025Updated last year
InfiXAI / InfiGUIAgent
View on GitHub
☆74May 23, 2025Updated last year
Shenzhi-Wang / recon
View on GitHub
The official source code for "Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling" (ACL 2024, Findings)
☆15Aug 12, 2024Updated last year
open-compass / CIBench
View on GitHub
Official Repo of "CIBench: Evaluation of LLMs as Code Interpreter "
☆15Jul 19, 2024Updated 2 years ago
Gentopia-AI / Gentopia
View on GitHub
Build Hierarchical Autonomous Agents through Config. Collaborative Growth of Specialized Agents.
☆328Nov 27, 2023Updated 2 years ago
OS-Agent-Survey / OS-Agent-Survey
View on GitHub
This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use" (ACL 2025 Oral).
☆486Aug 16, 2025Updated 11 months ago
SteveKGYang / MetaAligner
View on GitHub
Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models
☆24Sep 26, 2024Updated last year
LeapLabTHU / FamO2O
View on GitHub
Repository of "Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning" (NeurIPS 2023 Spotlight)
☆41Oct 30, 2023Updated 2 years ago
hurunyi / VideoShield
View on GitHub
[ICLR 2025] VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking (Official Implementation)
☆56May 30, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
LeapLabTHU / diver-ct
View on GitHub
☆14Dec 19, 2024Updated last year
SWE-Gym / SWE-Gym
View on GitHub
Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]
☆708Jul 29, 2025Updated 11 months ago
ridgesai / ridges-old
View on GitHub
☆12May 30, 2025Updated last year
zorazrw / agent-skill-induction
View on GitHub
Agent Skill Induction: "Inducing Programmatic Skills for Agentic Tasks"
☆42Apr 24, 2025Updated last year
microsoft / VisEval
View on GitHub
A benchmark designed to evaluate visualization generation methods.
☆60Updated this week
snap-stanford / MLAgentBench
View on GitHub
☆346Jun 19, 2024Updated 2 years ago
Reason-Wang / NAT
View on GitHub
[NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…
☆28Mar 14, 2024Updated 2 years ago
HKUSTDial / ChartInsights
View on GitHub
Officical repository for the paper“ChartInsights: Evaluating Multimodal Large Language Models for Low-Level Chart Question Answering”(EMN…
☆22Nov 16, 2024Updated last year
MASWorks / ML-Agent
View on GitHub
The official implementation of "ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering"
☆69Jun 21, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SimengSun / ChapterBreak
View on GitHub
☆12Jun 5, 2024Updated 2 years ago
hurunyi / Robust-Wide
View on GitHub
[ECCV 2024] Robust-Wide: Robust Watermarking against Instruction-driven Image Editing (Official Implementation)
☆38May 30, 2025Updated last year
multi-swe-bench / MagentLess
View on GitHub
☆13Jul 31, 2025Updated 11 months ago
ZJU-CTAG / B4
View on GitHub
Code for ASE'24 paper "B4: Towards Optimal Assessment of Plausible Code Solutions with Plausible Tests"
☆11Sep 10, 2024Updated last year
LiqiangJing / DSBench
View on GitHub
[ICLR 2025] DSBench: How Far are Data Science Agents from Becoming Data Science Experts?
☆125Aug 17, 2025Updated 11 months ago
ADaM-BJTU / O1-CODER
View on GitHub
AN O1 REPLICATION FOR CODING
☆332Dec 11, 2024Updated last year
StyxXuan / LoraRetriever
View on GitHub
☆17Apr 29, 2025Updated last year
SalesforceAIResearch / xLAM
View on GitHub
xLAM: A Family of Large Action Models to Empower AI Agent Systems
☆634Jun 2, 2026Updated last month
OSU-NLP-Group / GUI-Agents-Paper-List
View on GitHub
Awesome GUI Agent Paper List
☆861Jun 28, 2026Updated 3 weeks ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
X-LANCE / text2sql-multiturn-GPT
View on GitHub
[NAACL 2024] CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-Editions
☆13May 7, 2024Updated 2 years ago
zhao-ht / LearnAct
View on GitHub
Code for paper Empowering Large Language Model Agents through Action Learning
☆34Aug 8, 2024Updated last year
LR32768 / DL_theory_exp
View on GitHub
☆16Apr 12, 2024Updated 2 years ago
Alibaba-Quark / SSP
View on GitHub
Search Self-Play: Pushing the Frontier of Agent Capability without Supervision
☆103Mar 4, 2026Updated 4 months ago
zhichaoxu-shufe / context-aware-decoding-qfs
View on GitHub
☆14Jan 10, 2024Updated 2 years ago
princeton-nlp / WhatICLLearns
View on GitHub
[ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning
☆21Jul 9, 2023Updated 3 years ago
THUDM / WebRL
View on GitHub
Building Open LLM Web Agents with Self-Evolving Online Curriculum RL
☆535Jun 6, 2025Updated last year