bird-bench / BIRD-CRITIC-1Links
BIRD-CRITIC 1.0: Can Large Language Models Solve USER SQL Issues in Real-World Database Applications?
☆571Updated 3 weeks ago
Alternatives and similar repositories for BIRD-CRITIC-1
Users that are interested in BIRD-CRITIC-1 are comparing it to the libraries listed below
Sorting:
- ☆551Updated 2 months ago
- ☆135Updated last month
- ☆514Updated 3 months ago
- ☆174Updated 4 months ago
- RAS: Retrieval-And-Structuring for Knowledge-Intensive LLM Generation☆48Updated 2 weeks ago
- A clean and extensible agentic RAG system with modular implementation.☆98Updated last month
- One-stop data intelligence agent, providing insights from all mainstream data formats in a single dialogue box, including documents, data…☆524Updated 6 months ago
- We introduce temporal working memory (TWM), which aims to enhance the temporal modeling capabilities of Multimodal foundation models (MFM…☆308Updated 4 months ago
- SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automated Survey Writing☆143Updated 2 months ago
- [ACL 2024] Knowledge Fusion by Evolving Weights of Language Models☆37Updated 8 months ago
- ☆505Updated last month
- Official Repository for Paper: The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning☆49Updated last month
- Open-Tax is an AI-powered cloud platform transforming tax compliance through automated data integration, real-time anomaly detection, and…☆403Updated 3 months ago
- on a mission to fund every open source project. Tokenize and trade GitHub repos☆74Updated last week
- Code Efficiency Benchmark☆78Updated 3 weeks ago
- A multimodal personal assistant that allows Large Language Models (LLMs) to run code locally, acting as an autonomous agent capable of co…☆205Updated 4 months ago
- ☆409Updated last month
- A Speech-to-Text Input Method For Windows☆474Updated last week
- [ACL2024 Findings] Knowledge-to-SQL: Enhancing SQL Generation with Data Expert LLM☆57Updated 2 months ago
- ☆533Updated 4 months ago
- ☆1,379Updated 7 months ago
- ☆195Updated this week
- LLM-FuzzX is a user-friendly fuzz testing tool for Large Language Models (e.g., GPT, Claude, LLaMA), featuring advanced task-aware mutati…☆112Updated 3 weeks ago
- ☆603Updated last year
- The codes for a paper☆14Updated 2 months ago
- Framework that enables fine-tuning of vision-language grounding models on custom datasets☆602Updated last month
- 日历软件重写☆453Updated 2 months ago
- Vexa is a decentralized AI agent platform built on BNB Chain.☆351Updated last month
- 面向飞书聊天机器人的全功能AI服务器端实现,用一个容器,实现在飞书对话框里操作属于自己的Manus。☆499Updated last week
- When Agent Becomes the Scientist – Building Closed-Loop System from Hypothesis to Verification☆197Updated 2 weeks ago