bird-bench / BIRD-CRITIC-1
BIRD-CRITIC 1.0: Can Large Language Models Solve USER SQL Issues in Real-World Database Applications?
☆395Updated last month
Alternatives and similar repositories for BIRD-CRITIC-1:
Users that are interested in BIRD-CRITIC-1 are comparing it to the libraries listed below
- ☆381Updated last month
- We introduce temporal working memory (TWM), which aims to enhance the temporal modeling capabilities of Multimodal foundation models (MFM…☆308Updated 2 months ago
- [ACL 2024] Knowledge Fusion by Evolving Weights of Language Models☆37Updated 6 months ago
- ☆81Updated last month
- A timestamp for Code LLMs☆72Updated 3 weeks ago
- Intervening Anchor Token: Decoding Strategy in Alleviating Hallucinations for MLLMs☆152Updated 2 weeks ago
- ☆535Updated last month
- LLM-FuzzX is a user-friendly fuzz testing tool for Large Language Models (e.g., GPT, Claude, LLaMA), featuring advanced task-aware mutati…☆111Updated 2 months ago
- ☆209Updated last month
- Open-Tax is an AI-powered cloud platform transforming tax compliance through automated data integration, real-time anomaly detection, and…☆402Updated last month
- AIGC Creative Suite☆202Updated last month
- 日历软件重写☆102Updated last week
- PVPAI LLM 🔥The First Open-Source DeFAI Large Language Model Powered by DeepSeek.☆304Updated 2 months ago
- ☆231Updated last month
- H2HDB is a comprehensive database for organising and managing H@H comic collections.☆202Updated this week
- ☆601Updated last year
- ☆150Updated 2 months ago
- 4th Place Solution for the Kaggle Competition: LMSYS - Chatbot Arena Human Preference Predictions☆173Updated 5 months ago
- OmniAgent Framework is an advanced, modular AI orchestration system that transforms Web3 development by seamlessly integrating artificial…☆319Updated 2 months ago
- A multimodal personal assistant that allows Large Language Models (LLMs) to run code locally, acting as an autonomous agent capable of co…☆202Updated 2 months ago
- A public good tool to help users verify Safe (Gnosis Safe) transactions before signing or execution.☆527Updated 2 weeks ago
- AML end to end system☆838Updated 3 months ago
- Welcome to BlockSeek's official documentation. BlockSeek combines state-of-the-art AI with blockchain technology to revolutionize cryptoc…☆307Updated last month
- 一个基于Mybatis封装的类JdbcTemplate风格的ORM工具☆485Updated 2 weeks ago
- open-exp-plugin 是一个示例插件,旨在展示如何开发和扩展 ThingLinks 平台的功能。此插件提供了一个实验性功能扩展示例,帮助开发者深入了解如何利用 ThingLinks 的 API 和插件架构进行自定义开发和集成。☆115Updated this week
- ☆302Updated last week
- https://x.com/wmchain☆303Updated this week
- ☆1,381Updated 5 months ago
- awesome-aptos☆124Updated last year
- The codes for a paper☆13Updated 3 weeks ago