defog-ai / sql-eval
Evaluate the accuracy of LLM generated outputs
☆635Updated 2 weeks ago
Alternatives and similar repositories for sql-eval:
Users that are interested in sql-eval are comparing it to the libraries listed below
- A efficient and effective few-shot NL2SQL method on GPT-4.☆500Updated this week
- ☆308Updated last year
- ☆355Updated 11 months ago
- Numbers Station Text to SQL model code.☆244Updated last year
- Automated Evaluation of RAG Systems☆560Updated 4 months ago
- Semantic Evaluation for Text-to-SQL with Distilled Test Suites☆258Updated 9 months ago
- MAC-SQL: A Multi-Agent Collaborative Framework for Text-to-SQL☆231Updated 2 weeks ago
- Contextual Harnessing for Efficient SQL Synthesis☆181Updated 3 months ago
- ☆845Updated 4 months ago
- TAG-Bench: A benchmark for table-augmented generation (TAG)☆694Updated 3 weeks ago
- A MULTI-GENERATOR ENSEMBLE FRAMEWORK FOR NATURAL LANGUAGE TO SQL☆439Updated this week
- Open-source tool to visualise your RAG 🔮☆1,114Updated 2 months ago
- HyDE: Precise Zero-Shot Dense Retrieval without Relevance Labels☆518Updated 3 months ago
- A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.☆744Updated last week
- Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.☆287Updated 3 months ago
- Fine-Tuning Embedding for RAG with Synthetic Data☆488Updated last year
- UNITE: A Unified Benchmark for Text-to-SQL Evaluation☆67Updated 10 months ago
- ☆578Updated last month
- The source code of CodeS (SIGMOD 2024).☆160Updated 3 months ago
- ☆99Updated 11 months ago
- LOTUS: A semantic query engine for fast and easy LLM-powered data processing☆1,117Updated this week
- The code for the paper C3: Zero-shot Text-to-SQL with ChatGPT☆139Updated 7 months ago
- The Pytorch implementation of RESDSQL (AAAI 2023).☆253Updated 10 months ago
- RAGChecker: A Fine-grained Framework For Diagnosing RAG☆788Updated 3 months ago
- High-performance retrieval engine for unstructured data☆1,217Updated this week
- NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…☆313Updated last year
- Evaluate your LLM's response with Prometheus and GPT4 💯☆879Updated 2 months ago
- A tool for evaluating LLMs☆406Updated 10 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,327Updated 2 weeks ago
- The official repository of "ChatDB: Augmenting LLMs with Databases as Their Symbolic Memory".☆564Updated last year