archerfish-bench / benchmark
Benchmark framework for running text-to-sql
☆49Updated 2 months ago
Alternatives and similar repositories for benchmark:
Users that are interested in benchmark are comparing it to the libraries listed below
- UNITE: A Unified Benchmark for Text-to-SQL Evaluation☆70Updated 10 months ago
- A System for (Optimized) Semantic Computation☆92Updated this week
- Framework for building data agent workflows☆83Updated 7 months ago
- state-of-the-art search over vector embeddings and structured data (SIGMOD '24)☆70Updated 3 weeks ago
- Evaluate the accuracy of LLM generated outputs☆644Updated last month
- 🦫 BEAVER: An Enterprise Benchmark for Text-to-SQL☆14Updated last month
- ☆357Updated last year
- ☆64Updated 5 months ago
- Numbers Station Text to SQL model code.☆246Updated last year
- ☆42Updated 4 months ago
- TAG-Bench: A benchmark for table-augmented generation (TAG)☆712Updated last week
- This project provides a demo for text-to-SQL based on CodeS.☆52Updated 9 months ago
- FlockMTL: DuckDB extension to seamlessly combine analytics and semantic analysis using language models (LMs)☆108Updated this week
- Semantic Evaluation for Text-to-SQL with Distilled Test Suites☆264Updated 9 months ago
- [ICLR 2023] Code for the paper "Binding Language Models in Symbolic Languages"☆312Updated last year
- CodexDB generates code for SQL query processing via OpenAI's GPT-3 Codex model.☆101Updated 4 months ago
- Code for extracting, parsing and annotating tables from GitTables (https://gittables.github.io).☆43Updated 3 years ago
- A efficient and effective few-shot NL2SQL method on GPT-4.☆511Updated 3 weeks ago
- Code and data for the paper "DBCᴏᴘɪʟᴏᴛ: Natural Language Querying over Massive Database via Schema Routing" (EDBT 2025)☆88Updated 2 weeks ago
- Framework for benchmarking vector search engines☆316Updated this week
- ☆143Updated last week
- Tune efficiently any LLM model from HuggingFace using distributed training (multiple GPU) and DeepSpeed. Uses Ray AIR to orchestrate the …☆56Updated last year
- ☆133Updated 2 months ago
- Pixeltable — AI Data infrastructure providing a declarative, incremental approach for multimodal workloads.☆166Updated this week
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆173Updated 7 months ago
- Playground for using large language models into the Modern Data Stack for entity matching☆107Updated 2 years ago
- [ICLR 2025 Oral] Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows☆379Updated this week
- Foundation Models for Data Tasks☆105Updated last year
- OpenTelemetry Instrumentation for AI Observability☆360Updated this week
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆419Updated this week