Priya22 / pdnc-lrec2022
Repo for the LREC 2022 paper The Project Dialogism Novel Corpus: A Dataset for Quotation Attribution in Literary Texts.
☆13Updated 2 years ago
Alternatives and similar repositories for pdnc-lrec2022
Users that are interested in pdnc-lrec2022 are comparing it to the libraries listed below
Sorting:
- GC4LM: A Colossal (Biased) language model for German☆13Updated 4 years ago
- A python module for evaluating NERC and NEL system performances as defined in the HIPE shared tasks (formerly CLEF-HIPE-2020-scorer).☆14Updated 11 months ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Updated 3 years ago
- BERT models for many languages created from Wikipedia texts☆33Updated 4 years ago
- ☆17Updated last year
- An implementation of GrASP (Shnarch et. al., 2017)☆21Updated 2 years ago
- Bayesian Assessment of Hypotheses☆24Updated last year
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.☆19Updated 2 years ago
- Download and load spaCy models on-the-fly☆15Updated 2 years ago
- ☆17Updated 2 years ago
- Data for the HIPE 2022 shared task.☆18Updated last year
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆80Updated 10 months ago
- These are lists for a variety of languages containing words that are distinctive to each language.☆38Updated 3 years ago
- The Universal Decompositional Semantics (UDS) dataset and the Decomp toolkit☆57Updated last year
- BERT and ELECTRA models trained on Europeana Newspapers☆38Updated 3 years ago
- ☆17Updated 2 years ago
- The official repository for the The Project Dialogism Novel Corpus, a dataset of annotated quotations in full-length English novels.☆39Updated last year
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at…☆22Updated 9 months ago
- Tool for parsing and converting various span encoding schemes.☆23Updated last year
- UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them☆37Updated 3 years ago
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆52Updated last year
- Repository for code and metadata to support work described in "Authorless Topic Models: Biasing Models Away from Known Structure"☆28Updated 5 years ago
- MultiCite code and data. Models are available on Huggingface.☆31Updated 3 years ago
- MAGPIE: A sense-annotated corpus of potentially idiomatic expressions☆27Updated 4 years ago
- ☆64Updated 2 years ago
- ☆13Updated 3 years ago
- Statistics on multilingual datasets☆17Updated 2 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆27Updated 3 years ago
- Featurize words into orthographic and phonological vectors.☆41Updated last year
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆18Updated 9 months ago