bird-bench / BIRD-CRITIC-1Links
BIRD-CRITIC 1.0: Can Large Language Models Solve USER SQL Issues in Real-World Database Applications?
☆574Updated last week
Alternatives and similar repositories for BIRD-CRITIC-1
Users that are interested in BIRD-CRITIC-1 are comparing it to the libraries listed below
Sorting:
- ☆204Updated 2 months ago
- ☆872Updated 2 months ago
- R-KV: Redundancy-aware KV Cache Compression for Reasoning Models☆392Updated last week
- ☆515Updated 3 months ago
- [ACL 2024] Knowledge Fusion by Evolving Weights of Language Models☆37Updated 9 months ago
- We introduce temporal working memory (TWM), which aims to enhance the temporal modeling capabilities of Multimodal foundation models (MFM…☆309Updated 4 months ago
- Code Efficiency Benchmark☆78Updated last month
- RAS: Retrieval-And-Structuring for Knowledge-Intensive LLM Generation☆49Updated last month
- ☆1,378Updated 8 months ago
- ☆174Updated 4 months ago
- A clean and extensible agentic RAG system with modular implementation.☆103Updated last month
- Tokenize The Virtual Agents Onchain☆240Updated 2 weeks ago
- AI-powered tool for efficient abstract and PDF screening in systematic reviews.☆337Updated 3 weeks ago
- ☆422Updated 9 months ago
- A timestamp for Code LLMs☆72Updated 3 weeks ago
- ☆533Updated 4 months ago
- Welcome to BlockSeek's official documentation. BlockSeek combines state-of-the-art AI with blockchain technology to revolutionize cryptoc…☆310Updated 4 months ago
- Official Repository for Paper: The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning☆50Updated 2 months ago
- [ACL2024 Findings] Knowledge-to-SQL: Enhancing SQL Generation with Data Expert LLM☆57Updated 3 months ago
- UpTop is a BNB Chain-based liquidity protocol that allows users to unilaterally add BNB to liquidity pools, earn high yields, and support…☆75Updated last week
- Framework that enables fine-tuning of vision-language grounding models on custom datasets☆601Updated 2 months ago
- ☆690Updated 2 months ago
- 4th Place Solution for the Kaggle Competition: LMSYS - Chatbot Arena Human Preference Predictions☆173Updated 7 months ago
- Open-Tax is an AI-powered cloud platform transforming tax compliance through automated data integration, real-time anomaly detection, and…☆404Updated 4 months ago
- Res-SAM Framework for GPR Underground Hazard Detection☆594Updated 2 weeks ago
- LLM-FuzzX is a user-friendly fuzz testing tool for Large Language Models (e.g., GPT, Claude, LLaMA), featuring advanced task-aware mutati…☆112Updated last month
- Dataset approched by A Benchmark and Frequency Compression Method for Infrared Few-Shot Object Detection☆1,008Updated 2 months ago
- ☆603Updated last year
- ☆409Updated this week
- 日历软件重写☆453Updated 2 months ago