spapicchio / QATCH
Official implementation of QATCH: Benchmarking SQL-centric tasks with Table Representation Learning Models on Your Data
☆25Updated 3 weeks ago
Alternatives and similar repositories for QATCH:
Users that are interested in QATCH are comparing it to the libraries listed below
- ☆173Updated last year
- Multilingual Large Language Models Evaluation Benchmark☆115Updated 4 months ago
- A python package for benchmarking interpretability techniques on Transformers.☆212Updated 3 months ago
- Benchmarking library for RAG☆154Updated this week
- Inquisitive Parrots for Search☆183Updated 10 months ago
- Interpretability for sequence generation models 🐛 🔍☆394Updated 2 months ago
- A framework for few-shot evaluation of autoregressive language models.☆102Updated last year
- The implementation of "RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question" [ACL 2023]☆14Updated 9 months ago
- RARR: Researching and Revising What Language Models Say, Using Language Models☆43Updated last year
- Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"☆47Updated 2 years ago
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆132Updated last month
- ☆42Updated 4 months ago
- A Large-Scale Dataset for Long Text and Multi-Table Summarization☆16Updated 10 months ago
- Long Document Summarization Papers☆140Updated last year
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.☆162Updated last year
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆327Updated last year
- Source Code of Paper "GPTScore: Evaluate as You Desire"☆238Updated last year
- ☆349Updated last year
- Use Large Language Models like OpenAI's GPT-3.5 for data annotation and model enhancement. This framework combines human expertise with L…☆32Updated last year
- Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation☆195Updated 11 months ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆141Updated last year
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆99Updated 2 years ago
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆200Updated 2 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆124Updated 10 months ago
- Retrieval-Augmented Generation battle!☆48Updated last month
- ACL2023 - AlignScore, a metric for factual consistency evaluation.☆119Updated 10 months ago
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆92Updated last year
- Train Llama 2 & 3 on the SQuAD v2 task as an example of how to specialize a generalized (foundation) model.☆50Updated 7 months ago
- ☆50Updated last month
- ☆65Updated last year