xlang-ai / Spider2
[ICLR 2025] Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
☆297Updated this week
Alternatives and similar repositories for Spider2:
Users that are interested in Spider2 are comparing it to the libraries listed below
- Contextual Harnessing for Efficient SQL Synthesis☆162Updated 2 months ago
- A MULTI-GENERATOR ENSEMBLE FRAMEWORK FOR NATURAL LANGUAGE TO SQL☆337Updated last week
- [NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?☆116Updated 5 months ago
- "GraphAgent: Agentic Graph Language Assistant"☆239Updated 3 weeks ago
- The official repo for paper, LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods.☆243Updated last month
- [EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation☆283Updated 3 months ago
- "MiniRAG: Making RAG Simpler with Small and Free Language Models"☆518Updated this week
- MAC-SQL: A Multi-Agent Collaborative Framework for Text-to-SQL☆219Updated 7 months ago
- AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning (NeurIPS 2024)☆172Updated 2 weeks ago
- This is a continuously updated handbook for readers to easily track the latest NL2SQL (Text-to-SQL) techniques in the literature and prov…☆338Updated this week
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆213Updated 5 months ago
- STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases (NeurIPS D&B 2024)☆297Updated 3 weeks ago
- Implementation of "RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation".☆206Updated 8 months ago
- The official implementation of Self-Play Preference Optimization (SPPO)☆471Updated last week
- ☆95Updated 9 months ago
- ☆281Updated this week
- ☆346Updated 10 months ago
- The source code of CodeS (SIGMOD 2024).☆153Updated 2 months ago
- A curated list of resources on graph-based retrieval-augmented generation (GraphRAG) for customized large language models.☆300Updated this week
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆209Updated 3 months ago
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆121Updated last month
- [ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.☆158Updated 2 months ago
- Search-o1: Agentic Search-Enhanced Large Reasoning Models☆515Updated this week
- Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24☆132Updated 7 months ago
- Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception☆107Updated last week
- AWM: Agent Workflow Memory☆233Updated 2 months ago
- Repository for "MultiHop-RAG: A Dataset for Evaluating Retrieval-Augmented Generation Across Documents" (COLM 2024)☆249Updated 2 months ago
- Official repository for the paper “The Dawn of Natural Language to SQL: Are We Fully Ready?” (VLDB'24)☆88Updated 4 months ago
- This is the repository for the Tool Learning survey.☆294Updated 2 months ago
- Official Repository of ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning☆207Updated 4 months ago