webis-de / webis-tldr-17-corpus
Code for constructing TLDR corpus from Reddit dataset
☆24Updated 2 years ago
Related projects: ⓘ
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆69Updated 6 months ago
- The NewSHead dataset is a multi-doc headline dataset used in NHNet for training a headline summarization model.☆35Updated 2 years ago
- ☆86Updated 2 years ago
- ☆75Updated 9 months ago
- No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval☆27Updated last year
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆43Updated 4 months ago
- ☆97Updated 2 years ago
- A python tool for building large scale Wikipedia-based Information Retrieval datasets☆44Updated 3 years ago
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16Updated 2 years ago
- Code for Stage-wise Fine-tuning for Graph-to-Text Generation☆26Updated last year
- The codebase for our ACL2023 paper: Did You Read the Instructions? Rethinking the Effectiveness of Task Definitions in Instruction Learni…☆26Updated last year
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆74Updated 9 months ago
- Code for equipping pretrained language models (BART, GPT-2, XLNet) with commonsense knowledge for generating implicit knowledge statement…☆16Updated 3 years ago
- Neural models of common sense. 🤖☆91Updated 11 months ago
- For experiments involving instruct gpt. Currently used for documenting open research questions.☆71Updated last year
- WinoGrande: An Adversarial Winograd Schema Challenge at Scale☆87Updated 4 years ago
- This is the code for loading the SenseBERT model, described in our paper from ACL 2020.☆42Updated last year
- Do Multilingual Language Models Think Better in English?☆41Updated last year
- The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an …☆29Updated last year
- A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.☆68Updated last month
- Apps built using Inspired Cognition's Critique.☆58Updated last year
- ☆27Updated last month
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated last year
- Official implementation of the paper "IteraTeR: Understanding Iterative Revision from Human-Written Text" (ACL 2022)☆76Updated 10 months ago
- An open source toolkit for multimodal generative conversational task assistants, helping assist people with real-world complex tasks☆35Updated 3 months ago
- The pipeline for the OSCAR corpus☆161Updated 9 months ago
- This repository contains all the code for collecting large scale amounts of code from GitHub.☆105Updated last year
- Open source library for few shot NLP☆78Updated last year
- Code for Relevance-guided Supervision for OpenQA with ColBERT (TACL'21)☆40Updated 3 years ago
- ☆178Updated last year