saparina / ambrosiaLinks
πΈππΉβππππΈ: A Benchmark for Parsing Ambiguous Questions into Database Queries
β14Updated last year
Alternatives and similar repositories for ambrosia
Users that are interested in ambrosia are comparing it to the libraries listed below
Sorting:
- Detect-Then-Explain Framework for Text-to-SQL taskβ10Updated 2 years ago
- β134Updated 2 months ago
- Code for the paper "Understanding the Effects of Noise in Text-to-SQL: An Examination of the BIRD-Bench Benchmark".β18Updated last year
- Code for paper Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understandingβ88Updated last year
- 𦫠BEAVER: An Enterprise Benchmark for Text-to-SQLβ25Updated 7 months ago
- The source code for the schema filter (question + schema only)β47Updated last year
- This repository contains all the code for the DTS-SQL paperβ53Updated last year
- [ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".β264Updated last year
- Comprehensive benchmark for RAGβ256Updated 7 months ago
- π² Code for our EMNLP 2023 paper - π "Tree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Modeβ¦β52Updated 2 years ago
- code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"β142Updated last year
- β57Updated last year
- Resources on Large Language Models for Table Processingβ110Updated last year
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don'tβ¦β126Updated last year
- Baselines for all tasks from Long Code Arena benchmarks ποΈβ39Updated 9 months ago
- A curated list of papers related to constrained decoding of LLM, along with their relevant code and resources.β314Updated 3 months ago
- A Survey on Data Selection for Language Modelsβ254Updated 8 months ago
- [NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".β132Updated last year
- This repo contains code for paper: "Uncertainty Estimation and Quantification for LLMs: A Simple Supervised Approach".β24Updated last year
- Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refβ¦β69Updated 10 months ago
- Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Zihaβ¦β132Updated last year
- Characterization of relational table embeddings (VLDB 2024).β32Updated last year
- Code and data for "MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models"β50Updated 2 months ago
- β25Updated 7 months ago
- Semantic Evaluation for Text-to-SQL with Distilled Test Suitesβ313Updated last year
- The source code of CodeS (SIGMOD 2024).β194Updated last year
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"β217Updated last year
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)β127Updated last year
- CodeRAG-Bench: Can Retrieval Augment Code Generation?β164Updated last year
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasetsβ226Updated last year