peterbaile / beaverLinks
𦫠BEAVER: An Enterprise Benchmark for Text-to-SQL
β18Updated last month
Alternatives and similar repositories for beaver
Users that are interested in beaver are comparing it to the libraries listed below
Sorting:
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"β191Updated 7 months ago
- Benchmarking library for RAGβ213Updated last month
- Comprehensive benchmark for RAGβ198Updated last month
- β91Updated 2 weeks ago
- Introduction page of a challenging text-to-SQL dataset: KaggleDBQAβ38Updated last year
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrievalβ150Updated last month
- Code for the paper "Understanding the Effects of Noise in Text-to-SQL: An Examination of the BIRD-Bench Benchmark".β19Updated last year
- β284Updated last year
- Document Ranking with Large Language Models.β169Updated last month
- Detect-Then-Explain Framework for Text-to-SQL taskβ11Updated last year
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".β176Updated 3 weeks ago
- The prediction results of ChatGPT on various datasets of Text-to-SQL.β102Updated 2 years ago
- β182Updated 2 weeks ago
- [NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".β130Updated last year
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervisionβ90Updated 8 months ago
- Use contrastive learning to train a large language model (LLM) as a retrieverβ11Updated 11 months ago
- Semantic Evaluation for Text-to-SQL with Distilled Test Suitesβ283Updated last year
- Repository for MuSiQue: Multi-hop Questions via Single-hop Question Composition, TACL 2022β150Updated last year
- The implementation of "RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question" [ACL 2023]β16Updated last year
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuningβ244Updated last year
- Awesome LLM for NLG Evaluation Papersβ24Updated last year
- RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation.β134Updated 2 months ago
- Data for paper "Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness"β31Updated 2 years ago
- Please visit https://github.com/HKUSTDial/NL2SQL360 to get the official code!β10Updated 10 months ago
- UNITE: A Unified Benchmark for Text-to-SQL Evaluationβ78Updated last month
- β16Updated last year
- π² Code for our EMNLP 2023 paper - π "Tree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Modeβ¦β50Updated last year
- [SUKI'22] Table Retrieval May Not Necessitate Table-Specific Model Designβ22Updated 2 years ago
- Contextual Harnessing for Efficient SQL Synthesisβ224Updated last month
- A comprehensive paper list of Reasoning over Tables.β28Updated 2 years ago