[ICLR/AAAI/KDD2026] Open-Source LLM-Based Data Analysis Agents
☆105Jun 8, 2026Updated this week
Alternatives and similar repositories for DataMind
Users that are interested in DataMind are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Aligning Agentic World Models via Knowledgeable Experience Learning☆36May 15, 2026Updated 3 weeks ago
- ☆15Jan 9, 2026Updated 5 months ago
- official code repo for paper "Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging"☆25Oct 11, 2025Updated 8 months ago
- A Structured Grammar for Chart Annotation☆15May 8, 2025Updated last year
- A PTA exported exercise paper format helper which cleans the results.☆12Jan 7, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- awesome LLM papers! 🚀 🚀 🚀☆43Jul 3, 2025Updated 11 months ago
- A holistic framework for advancing LLMs as data science agents☆49May 19, 2026Updated 3 weeks ago
- [NeurIPS 2025] MergeBench: A Benchmark for Merging Domain-Specialized LLMs☆47Feb 11, 2026Updated 4 months ago
- BPfold: Deep generalizable prediction of RNA secondary structure via base pair motif energy.☆35May 27, 2026Updated 2 weeks ago
- The 4th rank system of the SemEval 2021 Task4.☆10May 7, 2022Updated 4 years ago
- ☆26May 26, 2025Updated last year
- [VLDB' 25] Synthesizing High-quality Text-to-SQL Data at Scale. SynSQL-2.5M is the first million-scale cross-domain text-to-SQL dataset.☆450Sep 8, 2025Updated 9 months ago
- MDPO: Overcoming the Training-Inference Divide of Masked Diffusion Language Models☆44Jan 28, 2026Updated 4 months ago
- [NeurIPS'25] Official Repository for the Paper "SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning"☆141Nov 20, 2025Updated 6 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [NeurIPS DB 2025] IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering☆46Oct 15, 2025Updated 7 months ago
- Official repository for Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning☆12Sep 2, 2024Updated last year
- ☆37Oct 9, 2025Updated 8 months ago
- Improving Symbolic Music Generation with Inference-Time Alignment☆22Aug 2, 2025Updated 10 months ago
- Officical repository for the paper“ChartInsights: Evaluating Multimodal Large Language Models for Low-Level Chart Question Answering”(EMN…☆22Nov 16, 2024Updated last year
- Continuously updated paper list on advancements in Data Agents. Companion repo to our paper "A Survey of Data Agents: Emerging Paradigm o…☆577Jun 4, 2026Updated last week
- [ACL'25] UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench☆36Aug 12, 2025Updated 10 months ago
- Pokemon game using Google Maps☆11Oct 31, 2015Updated 10 years ago
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates☆26Jul 1, 2025Updated 11 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Implementation of Relation Classification via Convolutional Deep Neural Network.☆19Aug 24, 2021Updated 4 years ago
- Steering Vector Repo from "Extracting Latent Steering Vectors from Pretrained Language Models" - ACL2022 Findings☆11Mar 14, 2022Updated 4 years ago
- ☆10May 31, 2021Updated 5 years ago
- ☆32Aug 11, 2025Updated 10 months ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Mar 20, 2024Updated 2 years ago
- ☆16Feb 8, 2024Updated 2 years ago
- A four-day course on Python, the Scientific Python stack and PySpark, adapted from a training course I gave to one of our clients in Dece…☆10Feb 3, 2016Updated 10 years ago
- Official implementation of WildFX Dataset Generating pipeline.☆18Oct 21, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Demo code contrasting Google Dataflow (Apache Beam) with Apache Spark☆14Sep 1, 2016Updated 9 years ago
- ☆10Mar 13, 2023Updated 3 years ago
- [ICLR'26] Official PyTorch implementation of "Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models".☆65Mar 5, 2026Updated 3 months ago
- Toda la geografía de México: Estados, municipios, sectores electorales, etc☆11Jun 15, 2017Updated 8 years ago
- [ICML 2025] EffiCoder: Enhancing Code Generation in Large Language Models through Efficiency-Aware Fine-tuning☆16May 24, 2025Updated last year
- ☆10Feb 2, 2023Updated 3 years ago
- ☆10Nov 17, 2022Updated 3 years ago