peterbaile / beaverLinks
𦫠BEAVER: An Enterprise Benchmark for Text-to-SQL
β19Updated 2 months ago
Alternatives and similar repositories for beaver
Users that are interested in beaver are comparing it to the libraries listed below
Sorting:
- β96Updated 2 weeks ago
- Benchmarking library for RAGβ219Updated 3 weeks ago
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"β192Updated 8 months ago
- β284Updated last year
- Introduction page of a challenging text-to-SQL dataset: KaggleDBQAβ38Updated last year
- Code for the paper "Understanding the Effects of Noise in Text-to-SQL: An Examination of the BIRD-Bench Benchmark".β18Updated last year
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrievalβ153Updated 2 months ago
- Semantic Evaluation for Text-to-SQL with Distilled Test Suitesβ288Updated last year
- Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23β225Updated last year
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomicβ¦β366Updated 3 months ago
- Please visit https://github.com/HKUSTDial/NL2SQL360 to get the official code!β10Updated 11 months ago
- β184Updated last month
- Document Ranking with Large Language Models.β178Updated 2 months ago
- [EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627β492Updated 10 months ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".β188Updated last month
- This repository contains all the code for the DTS-SQL paperβ53Updated last year
- π² Code for our EMNLP 2023 paper - π "Tree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Modeβ¦β51Updated last year
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervisionβ93Updated 9 months ago
- Comprehensive benchmark for RAGβ204Updated last month
- ACL2023 - AlignScore, a metric for factual consistency evaluation.β136Updated last year
- [NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".β130Updated last year
- RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation.β137Updated 2 months ago
- Code for paper "G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment"β366Updated last year
- RARR: Researching and Revising What Language Models Say, Using Language Modelsβ48Updated 2 years ago
- Repository for MuSiQue: Multi-hop Questions via Single-hop Question Composition, TACL 2022β154Updated last year
- Awesome LLM for NLG Evaluation Papersβ24Updated last year
- Companion code for FanOutQA: Multi-Hop, Multi-Document Question Answering for Large Language Models (ACL 2024)β54Updated last month
- ICLR 2022 Paper, SOTA Table Pre-training Model, TAPEX: Table Pre-training via Learning a Neural SQL Executorβ296Updated 2 years ago
- The prediction results of ChatGPT on various datasets of Text-to-SQL.β102Updated 2 years ago
- SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Modelsβ550Updated last year