spapicchio / QATCHLinks
Official implementation of QATCH: Benchmarking SQL-centric tasks with Table Representation Learning Models on Your Data
β30Updated last month
Alternatives and similar repositories for QATCH
Users that are interested in QATCH are comparing it to the libraries listed below
Sorting:
- Interpretability for sequence generation models π πβ437Updated 4 months ago
- Multilingual Large Language Models Evaluation Benchmarkβ130Updated last year
- β185Updated 2 months ago
- The implementation of "RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question" [ACL 2023]β16Updated last year
- SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Modelsβ557Updated last year
- ACL2023 - AlignScore, a metric for factual consistency evaluation.β137Updated last year
- [EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627β494Updated 10 months ago
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.β163Updated last year
- Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generationβ209Updated last year
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomicβ¦β376Updated 4 months ago
- Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03β¦β547Updated last year
- This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.β499Updated last year
- Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"β456Updated last year
- Train and Infer Powerful Sentence Embeddings with AnglE | π₯ SOTA on STS and MTEB Leaderboardβ552Updated 5 months ago
- β367Updated last year
- Code for paper "G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment"β373Updated last year
- 𦫠BEAVER: An Enterprise Benchmark for Text-to-SQLβ19Updated 3 months ago
- Long Document Summarization Papersβ149Updated 2 years ago
- [NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".β130Updated last year
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"β196Updated 9 months ago
- The code for HerO: a fact-checking pipeline based on open LLMs (the runner-up in AVeriTeC)β10Updated 5 months ago
- OpenICL is an open-source framework to facilitate research, development, and prototyping of in-context learning.β570Updated last year
- Source Code of Paper "GPTScore: Evaluate as You Desire"