spapicchio / QATCHLinks
Official implementation of QATCH: Benchmarking SQL-centric tasks with Table Representation Learning Models on Your Data
β32Updated 6 months ago
Alternatives and similar repositories for QATCH
Users that are interested in QATCH are comparing it to the libraries listed below
Sorting:
- Interpretability for sequence generation models π πβ453Updated this week
- Train Llama 2 & 3 on the SQuAD v2 task as an example of how to specialize a generalized (foundation) model.β53Updated last year
- β187Updated 7 months ago
- Multilingual Large Language Models Evaluation Benchmarkβ133Updated last year
- β432Updated this week
- This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.β552Updated last year
- ACL2023 - AlignScore, a metric for factual consistency evaluation.β150Updated last year
- [EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627β510Updated last year
- SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Modelsβ598Updated last year
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomicβ¦β415Updated 9 months ago
- A collection of large question answering datasetsβ429Updated last year
- Train and Infer Powerful Sentence Embeddings with AnglE | π₯ SOTA on STS and MTEB Leaderboardβ567Updated 3 months ago
- Code for paper "G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment"β407Updated 2 years ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: β¦β340Updated 2 years ago
- All-in-one text de-duplicationβ741Updated last month
- Calculate perplexity on a text with pre-trained language models. Support MLM (eg. DeBERTa), recurrent LM (eg. GPT3), and encoder-decoder β¦β165Updated 7 months ago
- Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"β457Updated 2 years ago
- The code for HerO: a fact-checking pipeline based on open LLMs (the runner-up in AVeriTeC)β12Updated 10 months ago
- β373Updated 2 years ago
- Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generationβ213Updated last year
- SemEval2024-task8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detectionβ80Updated last year
- The implementation of "RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question" [ACL 2023]β16Updated last year
- List of papers on hallucination detection in LLMs.β1,041Updated 3 weeks ago
- Long Document Summarization Papersβ154Updated 2 years ago
- This repo contains the code for generating the ToxiGen dataset, published at ACL 2022.β345Updated last year
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"β223Updated last year
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.β165Updated 2 years ago
- Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03β¦β553Updated 2 years ago
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023β72Updated last year
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasetsβ227Updated last year