spapicchio / QATCHLinks
Official implementation of QATCH: Benchmarking SQL-centric tasks with Table Representation Learning Models on Your Data
β30Updated 3 weeks ago
Alternatives and similar repositories for QATCH
Users that are interested in QATCH are comparing it to the libraries listed below
Sorting:
- Interpretability for sequence generation models π πβ432Updated 3 months ago
- SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Modelsβ550Updated last year
- β367Updated last year
- Train Llama 2 & 3 on the SQuAD v2 task as an example of how to specialize a generalized (foundation) model.β52Updated last year
- Code for paper "G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment"β366Updated last year
- β184Updated last month
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: β¦β337Updated 2 years ago
- Multilingual Large Language Models Evaluation Benchmarkβ128Updated 11 months ago
- ACL2023 - AlignScore, a metric for factual consistency evaluation.β136Updated last year
- This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.β498Updated last year
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomicβ¦β366Updated 3 months ago
- [EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627β492Updated 10 months ago
- The implementation of "RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question" [ACL 2023]β16Updated last year
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.β163Updated last year
- Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generationβ206Updated last year
- Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"β456Updated last year
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023β70Updated last year
- Scalable training for dense retrieval models.β299Updated 2 months ago
- List of papers on hallucination detection in LLMs.β931Updated last month
- Source Code of Paper "GPTScore: Evaluate as You Desire"β254Updated 2 years ago
- Calculate perplexity on a text with pre-trained language models. Support MLM (eg. DeBERTa), recurrent LM (eg. GPT3), and encoder-decoder β¦β162Updated last month
- The prime repository for state-of-the-art Multilingual Question Answering research and development.β736Updated 7 months ago
- Inquisitive Parrots for Searchβ194Updated 2 months ago
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"β192Updated 8 months ago
- [ICLR 2024 & NeurIPS 2023 WS] An Evaluator LM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically dβ¦β303Updated last year
- Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03β¦β545Updated last year
- β321Updated this week
- Benchmarking library for RAGβ219Updated 3 weeks ago
- β56Updated 2 months ago
- Fusion-in-Decoderβ581Updated last year