epfl-dlab / quootstrapLinks
Unsupervised method for extracting quotation-speaker pairs from large news corpora.
☆29Updated 6 years ago
Alternatives and similar repositories for quootstrap
Users that are interested in quootstrap are comparing it to the libraries listed below
Sorting:
- A natural language processing tool for automatically detecting quotations in text.☆15Updated 3 years ago
- Corpus of Attribution-Annotated news articles covering the campaigns during the year leading up to the 2016 US Presidential election.☆20Updated 6 years ago
- linguistic converter / merging tool for multi-level annotated corpora. graph-based (using Python and NetworkX).☆51Updated 2 years ago
- Repository for rstWeb, a browser based annotation interface for Rhetorical Structure Theory☆43Updated 7 months ago
- Corpus and annotations for the CL-Aff Shared Task from the University of Pennsylvania☆19Updated 3 years ago
- Sentence specificity prediction☆25Updated 6 years ago
- Code and data for ACL2016 article "Which argument is more convincing? Analyzing and predicting convincingness of Web arguments using bidi…☆28Updated 8 years ago
- Cross-lingual metaphor detection.☆66Updated 6 years ago
- Metaphor classification for verbs and content words☆65Updated last year
- A set of media framing annotations, along with scripts for obtaining the corresponding news articles☆52Updated 5 years ago
- Multi-Annotator Competence Estimation tool☆63Updated 5 years ago
- Harassment Lexicon and Corpus☆30Updated 7 years ago
- STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)☆66Updated this week
- Twpipe is a pipeline toolkit that parses raw tweets into universal dependencies.☆28Updated 6 years ago
- Mining Discourse Markers for Unsupervised Sentence Representation Learning☆60Updated 2 years ago
- Linguistic and stylistic complexity measures for (literary) texts☆81Updated last year
- ☆64Updated 2 years ago
- Predict edit intentions on Wikipedia☆19Updated 6 years ago
- MAGPIE: A sense-annotated corpus of potentially idiomatic expressions☆27Updated 4 years ago
- Disambiguate is a tool for training and using state of the art neural WSD models☆60Updated 3 years ago
- Metaphor dataset: literal versus non-literal uses of words☆14Updated 9 years ago
- Distribution of word meanings in Wikipedia for English, Italian, French, German and Spanish.☆10Updated 4 years ago
- The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016…☆68Updated 3 years ago
- Message Understanding Conference 3 Corpus☆10Updated 4 years ago
- ☆54Updated 3 years ago
- Code and data for the WSDM '19 paper "Crosslingual Document Embedding as Reduced-Rank Ridge Regression (Cr5)"☆30Updated 5 years ago
- Processing the MPQA Corpus☆27Updated 6 years ago
- Diagnostic tests for linguistic capacities in language models☆66Updated 3 years ago
- ☆83Updated 4 years ago
- data and scripts for the shared task "Task 1: Paraphrase and Semantic Similarity in Twitter (PIT)" at SemEval 2015☆43Updated 4 years ago