spapicchio / QATCH
Official implementation of QATCH: Benchmarking SQL-centric tasks with Table Representation Learning Models on Your Data
β25Updated last month
Related projects β
Alternatives and complementary repositories for QATCH
- Interpretability for sequence generation models π πβ377Updated last week
- A python package for benchmarking interpretability techniques on Transformers.β212Updated last month
- β167Updated last year
- A Survey on Data Selection for Language Modelsβ182Updated last month
- β333Updated 11 months ago
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.β160Updated last year
- β211Updated 8 months ago
- [NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".β115Updated 6 months ago
- β32Updated 2 months ago
- Source Code of Paper "GPTScore: Evaluate as You Desire"β231Updated last year
- Calculate perplexity on a text with pre-trained language models. Support MLM (eg. DeBERTa), recurrent LM (eg. GPT3), and encoder-decoder β¦β136Updated last month
- Benchmarking library for RAGβ123Updated this week
- Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generationβ193Updated 9 months ago
- This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.β410Updated 9 months ago
- [EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627β461Updated last month
- π A statutory article retrieval dataset in French. (ACL 2022)β38Updated last year
- A Simple but Powerful SOTA NER Model | Official Code For Label Supervised LLaMA Finetuningβ142Updated 8 months ago
- β56Updated 2 years ago
- Repository for research in the field of Responsible NLP at Meta.β186Updated last week
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: β¦β323Updated last year
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"β115Updated last month
- β265Updated 11 months ago
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.β141Updated last year
- A Survey of Attributions for Large Language Modelsβ168Updated 2 months ago
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomicβ¦β292Updated 6 months ago
- SpanMarker for Named Entity Recognitionβ401Updated 3 months ago
- Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"β431Updated last year
- A framework for few-shot evaluation of autoregressive language models.β101Updated last year
- ACL2023 - AlignScore, a metric for factual consistency evaluation.β111Updated 8 months ago
- β111Updated last year