TREMA-UNH / rubric-grading-workbench
A Workbench for Autograding Retrieve/Generate Systems
☆14Updated 2 months ago
Alternatives and similar repositories for rubric-grading-workbench:
Users that are interested in rubric-grading-workbench are comparing it to the libraries listed below
- Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021☆29Updated last year
- Dense hybrid representations for text retrieval☆62Updated last year
- Code and dataset for the emnlp paper titled Instruct and Extract: Instruction Tuning for On-Demand Information Extraction☆49Updated last year
- Submission archive for the MS MARCO passage ranking leaderboard☆13Updated last year
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆43Updated last year
- ☆28Updated last year
- ☆38Updated last month
- This repository contains code and data for the EMNLP 2022 paper "CONDAQA: A Contrastive Reading Comprehension Dataset for Reasoning about…☆10Updated 2 years ago
- One-stop shop for running and fine-tuning transformer-based language models for retrieval☆44Updated this week
- Cross language information retrieval pipeline☆18Updated last year
- ☆55Updated 2 years ago
- 🌾 Universal, customizable and deployable fine-grained evaluation for text generation.☆22Updated last year
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 3 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆43Updated 5 months ago
- ☆45Updated 2 years ago
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.☆22Updated last year
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval☆40Updated 3 months ago
- PyTorch code for "FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization" (NAACL 2022)☆38Updated 2 years ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆73Updated 2 years ago
- Resources for the shared task on conversational question answering SCAI-QReCC 2021☆27Updated 2 years ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆17Updated 3 months ago
- Code, data, and pretrained models for the paper "Generating Wikipedia Article Sections from Diverse Data Sources"☆20Updated 3 years ago
- ☆29Updated 11 months ago
- In-BoXBART: Get Instructions into Biomedical Multi-task Learning☆14Updated 2 years ago
- Simple Questions Generate Named Entity Recognition Datasets (EMNLP 2022)☆76Updated last year
- Retrieval-Augmented Generation battle!☆48Updated last month
- A Python framework for conversational search☆40Updated 3 years ago
- ☆67Updated 3 years ago
- ☆16Updated 2 years ago
- The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an …☆31Updated last year