bird-bench / BIRD-CRITIC-1Links
[NeurIPS 2025 Main] SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in Real-World Applications
☆775Updated 2 months ago
Alternatives and similar repositories for BIRD-CRITIC-1
Users that are interested in BIRD-CRITIC-1 are comparing it to the libraries listed below
Sorting:
- [TKDE2025] Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL | A curated list of resources (surveys, papers, benchma…☆831Updated this week
- Science-Star: A Platform for Building, Extending, and Experimenting with Scientific Agents.☆737Updated last month
- ☆209Updated 2 weeks ago
- ☆356Updated 5 months ago
- Repo-level benchmark for real-world Code Agents: from repo understanding → env setup → incremental dev/bug-fixing → task delivery, with c…☆238Updated 2 months ago
- 🧠 Prometheus: A Knowledge-Graph-Driven 🤖 AI Agent that maps 🗺, understands 🧩, and repairs 🛠 complex codebases — not by guessing, but…☆430Updated last week
- Auto-Manage Your Personal Task Context with AI.☆1,281Updated this week
- ☆515Updated 9 months ago
- [BIRD-INTERACT] Re-imagines Text-to-SQL evaluation via lens of dynamic interactions.☆451Updated 2 weeks ago
- RAS: Retrieval-And-Structuring for Knowledge-Intensive LLM Generation☆57Updated 2 weeks ago
- ☆952Updated 4 months ago
- 智川x-agent☆1,081Updated 3 months ago
- We introduce the Audio Logical Reasoning (ALR) dataset, consisting of 6,446 text-audio annotated samples specifically designed for comple…