spapicchio / QATCH
Official implementation of QATCH: Benchmarking SQL-centric tasks with Table Representation Learning Models on Your Data
☆27Updated last week
Alternatives and similar repositories for QATCH:
Users that are interested in QATCH are comparing it to the libraries listed below
- The implementation of "RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question" [ACL 2023]☆14Updated 10 months ago
- A python package for benchmarking interpretability techniques on Transformers.☆213Updated 5 months ago
- ☆174Updated 2 years ago
- Interpretability for sequence generation models 🐛 🔍☆405Updated 3 months ago
- Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation☆197Updated last year
- Source Code of Paper "GPTScore: Evaluate as You Desire"☆242Updated 2 years ago
- RARR: Researching and Revising What Language Models Say, Using Language Models☆45Updated last year
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.☆162Updated last year
- ☆45Updated 5 months ago
- Scalable training for dense retrieval models.☆276Updated this week
- Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"☆444Updated last year
- ☆125Updated last month
- A Survey of Attributions for Large Language Models☆195Updated 6 months ago
- Reverse Instructions to generate instruction tuning data with corpus examples☆208Updated 11 months ago
- Repository for research in the field of Responsible NLP at Meta.☆196Updated 3 months ago
- Benchmarking library for RAG☆169Updated this week
- Long Document Summarization Papers☆142Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆125Updated 11 months ago
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆209Updated 3 months ago
- ACL2023 - AlignScore, a metric for factual consistency evaluation.☆121Updated 11 months ago
- ☆11Updated 2 years ago
- Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"☆48Updated 2 years ago
- 🔍 A statutory article retrieval dataset in French. (ACL 2022)☆39Updated last year
- [ICLR 2024 & NeurIPS 2023 WS] An Evaluator LM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically d…☆297Updated last year
- A Survey on Data Selection for Language Models☆213Updated 4 months ago
- [EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627☆475Updated 4 months ago
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…☆323Updated 9 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆175Updated last month
- This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.☆438Updated last year
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆83Updated 6 months ago