saparina / ambrosiaLinks
πΈππΉβππππΈ: A Benchmark for Parsing Ambiguous Questions into Database Queries
β13Updated last year
Alternatives and similar repositories for ambrosia
Users that are interested in ambrosia are comparing it to the libraries listed below
Sorting:
- β122Updated 2 weeks ago
- The source code for the schema filter (question + schema only)β49Updated last year
- Code for the paper "Understanding the Effects of Noise in Text-to-SQL: An Examination of the BIRD-Bench Benchmark".β18Updated last year
- [ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".β258Updated last year
- This repository contains all the code for the DTS-SQL paperβ54Updated last year
- β92Updated last year
- Comprehensive benchmark for RAGβ239Updated 5 months ago
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"β211Updated 11 months ago
- The source code of CodeS (SIGMOD 2024).β193Updated last year
- This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.β526Updated last year
- β52Updated last year
- UNITE: A Unified Benchmark for Text-to-SQL Evaluationβ82Updated 5 months ago
- Repoformer: Selective Retrieval for Repository-Level Code Completion (ICML 2024)β61Updated 5 months ago
- Detect-Then-Explain Framework for Text-to-SQL taskβ10Updated last year
- Semantic Evaluation for Text-to-SQL with Distilled Test Suitesβ305Updated last year
- Resources on Large Language Models for Table Processingβ110Updated last year
- π² Code for our EMNLP 2023 paper - π "Tree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Modeβ¦β52Updated last year
- 𦫠BEAVER: An Enterprise Benchmark for Text-to-SQLβ23Updated 6 months ago
- Code for paper Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understandingβ86Updated last year
- β52Updated last week
- β189Updated 4 months ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don'tβ¦β125Updated last year
- Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Zihaβ¦β133Updated last year
- This repo contains code for paper: "Uncertainty Estimation and Quantification for LLMs: A Simple Supervised Approach".β22Updated last year
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels β¦β282Updated 2 years ago
- [EMNLP 2023] Poisoning Retrieval Corpora by Injecting Adversarial Passages https://arxiv.org/abs/2310.19156β41Updated last year
- The prediction results of ChatGPT on various datasets of Text-to-SQL.β102Updated 2 years ago
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomicβ¦β403Updated 7 months ago
- The code base for paper: "ReAcTable: Enhancing ReAct for Table Question Answering"β34Updated last year
- Baselines for all tasks from Long Code Arena benchmarks ποΈβ36Updated 7 months ago