xlang-ai/Spider2-V

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xlang-ai/Spider2-V)

xlang-ai / Spider2-V

[NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

☆153

Alternatives and similar repositories for Spider2-V

Users that are interested in Spider2-V are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kiaia / GIRAFFE
View on GitHub
Extending context length of visual language models
☆12Dec 18, 2024Updated last year
chang-github-00 / LLM-Predictive-Decoding
View on GitHub
☆16Jul 9, 2025Updated last year
HKUNLP / subgoal-theorem-prover
View on GitHub
Code for the paper "Decomposing the Enigma: Subgoal-based Demonstration Learning for Formal Theorem Proving"
☆20May 25, 2023Updated 3 years ago
zhaoxlpku / DynaAct
View on GitHub
☆15Nov 12, 2025Updated 8 months ago
HKUNLP / SymGen
View on GitHub
[EMNLP'23] Code for Generating Data for Symbolic Language with Large Language Models
☆18Oct 21, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
google-research / arcade-nl2code
View on GitHub
☆55Aug 25, 2023Updated 2 years ago
zhaoxlpku / SubgoalXL
View on GitHub
☆26Aug 23, 2024Updated last year
xlang-ai / EVOR
View on GitHub
☆70Dec 15, 2024Updated last year
xlang-ai / aguvis
View on GitHub
[ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction
☆389Mar 7, 2025Updated last year
JHU-CLSP / turking-bench
View on GitHub
Web-grounded natural language instructions
☆18Nov 25, 2024Updated last year
OpenLemur / Lemur
View on GitHub
[ICLR 2024] Lemur: Open Foundation Models for Language Agents
☆558Oct 28, 2023Updated 2 years ago
zhongwanjun / CARP
View on GitHub
code for the table-based open domain question answering project, with paper title: "Reasoning over Hybrid Chain for Table-and-Text Open D…
☆12Sep 16, 2022Updated 3 years ago
Timothyxxx / KVCachePapers
View on GitHub
☆20May 24, 2024Updated 2 years ago
qtli / GSM-Plus
View on GitHub
GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.
☆66Jul 8, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
HazyResearch / wonderbread
View on GitHub
WONDERBREAD benchmark + dataset for BPM tasks
☆35Jul 30, 2025Updated 11 months ago
Timothyxxx / EnvInteractiveLMPapers
View on GitHub
Paper collections of methods that using language to interact with environment, including interact with real world, simulated world or WWW…
☆128Jul 26, 2023Updated 3 years ago
shuaichenchang / prompt-text-to-sql
View on GitHub
☆29Aug 18, 2023Updated 2 years ago
OS-Copilot / OS-Sentinel
View on GitHub
[ACL 2026] Code, benchmark and environment for "OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic…
☆49Jul 5, 2026Updated 3 weeks ago
xlang-ai / Binder
View on GitHub
[ICLR 2023] Code for the paper "Binding Language Models in Symbolic Languages"
☆326Aug 25, 2023Updated 2 years ago
xlang-ai / xlang-paper-reading
View on GitHub
Paper collection on building and evaluating language model agents via executable language grounding
☆364Apr 29, 2024Updated 2 years ago
xlang-ai / OSWorld-G
View on GitHub
[NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesis
☆172Jun 18, 2026Updated last month
xlang-ai / AgentTrek
View on GitHub
[ICLR2025 Spotlight] Agent Trajectory Synthesis via Guiding Replay with Web Tutorials
☆60Feb 21, 2025Updated last year
zhxieml / remiss-jailbreak
View on GitHub
☆33Jun 24, 2024Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
microsoft / text-to-sql-schema-expansion-generalization
View on GitHub
Bridging the Generalization Gap in Text-to-SQL Parsing with Schema Expansion
☆13Jul 26, 2023Updated 3 years ago
xlang-ai / UnifiedSKG
View on GitHub
[EMNLP 2022] Unifying and multi-tasking structured knowledge grounding with language models
☆566Aug 22, 2023Updated 2 years ago
InfiAgent / InfiAgent
View on GitHub
InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks (ICML 2024)
☆199May 29, 2025Updated last year
ranpox / openreview-visualization
View on GitHub
OpenReivew Submission Visualization (ICLR 2024/2025)
☆153Oct 17, 2024Updated last year
xlang-ai / BRIGHT
View on GitHub
[ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
☆210Sep 13, 2025Updated 10 months ago
XinyuanLu00 / SciTab
View on GitHub
The project page for "SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables"
☆23Dec 21, 2023Updated 2 years ago
DreamLM / Dream-VLX
View on GitHub
Dream-VL and Dream-VLA, a diffusion VLM and a diffusion VLA.
☆114Jan 14, 2026Updated 6 months ago
quge2023 / TA-SQL
View on GitHub
☆60Nov 18, 2024Updated last year
allenai / noncompliance
View on GitHub
This repository contains data, code and models for contextual noncompliance.
☆26Jul 18, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
koalazf99 / nanoverl
View on GitHub
Collections of RLxLM experiments using minimal codes
☆14Feb 17, 2025Updated last year
HKUNLP / STRING
View on GitHub
[ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"
☆82Nov 25, 2024Updated last year
lfy79001 / S3Eval
View on GitHub
[NAACL 2024] A Synthetic, Scalable and Systematic Evaluation Suite for Large Language Models
☆33Jun 10, 2024Updated 2 years ago
HKUNLP / HumanPrompt
View on GitHub
A framework for human-readable prompt-based method with large language models. Specially designed for researchers. (Deprecated, check out…
☆131Feb 25, 2023Updated 3 years ago
HKUNLP / critic-rl
View on GitHub
[ICML 2025] Teaching Language Models to Critique via Reinforcement Learning
☆127May 6, 2025Updated last year
RUCBM / GUICourse
View on GitHub
GUICourse: From General Vision Langauge Models to Versatile GUI Agents
☆143Mar 1, 2026Updated 4 months ago
xlang-ai / text2reward
View on GitHub
[ICLR 2024 Spotlight] Text2Reward: Reward Shaping with Language Models for Reinforcement Learning
☆210Dec 17, 2024Updated last year