epfl-dlab / QuotebankLinks
Code and data for the WSDM '21 paper "Quotebank: A Corpus of Quotations from a Decade of News"
☆19Updated 4 years ago
Alternatives and similar repositories for Quotebank
Users that are interested in Quotebank are comparing it to the libraries listed below
Sorting:
- Linguistic and stylistic complexity measures for (literary) texts☆82Updated last year
- A module to compute textual lexical richness (aka lexical diversity).☆110Updated 2 years ago
- Dutch coreference resolution & dialogue analysis using deterministic rules☆21Updated 2 years ago
- A set of media framing annotations, along with scripts for obtaining the corresponding news articles☆52Updated 6 years ago
- A multilingual lexicon of words to hurt.☆90Updated last month
- This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs☆184Updated last year
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆90Updated last month
- Mining individual characters in multiparty dialogue☆172Updated 2 years ago
- Contextualised Word Representations for Lexical Semantic Change Analysis☆32Updated 5 years ago
- ☆53Updated last year
- Package to extract connotation frames☆87Updated last year
- Unsupervised method for extracting quotation-speaker pairs from large news corpora.☆29Updated 7 years ago
- Annotated dataset of 100 works of fiction to support tasks in natural language processing and the computational humanities.☆362Updated 2 years ago
- Easier Automatic Sentence Simplification Evaluation☆161Updated last year
- Corpus of Attribution-Annotated news articles covering the campaigns during the year leading up to the 2016 US Presidential election.☆20Updated 7 years ago
- This repository contains all new resources that were created for the NAACL-2018 paper "Inducing a Lexicon of Abusive Words -- A Feature-B…☆29Updated 6 years ago
- A Python wrapper around the topic modeling functions of MALLET.☆103Updated 10 months ago
- Alignment and annotation for comparable documents.☆22Updated 6 years ago
- This is a simple Python package for calculating a variety of lexical diversity indices☆79Updated last year
- ☆64Updated 2 years ago
- ☆21Updated last year
- A tokenizer and sentence splitter for German and English web and social media texts.☆147Updated 8 months ago
- Scripts for large-scale prediction of lexical semantic change.☆12Updated 2 years ago
- potato: portable text annotation tool☆350Updated last month
- Diagnostic tests for linguistic capacities in language models☆66Updated 3 years ago
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆156Updated 2 years ago
- WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Wor…☆179Updated last month
- BERT and ELECTRA models trained on Europeana Newspapers☆38Updated 3 years ago
- coFR: COreference resolution tool for FRench (and singletons).☆25Updated 5 years ago
- GSRL is a seq2seq model for end-to-end dependency- and span-based SRL (IJCAI2021).☆18Updated 3 years ago