Priya22 / pdnc-lrec2022
Repo for the LREC 2022 paper The Project Dialogism Novel Corpus: A Dataset for Quotation Attribution in Literary Texts.
☆13Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for pdnc-lrec2022
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆46Updated 3 years ago
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.☆19Updated 2 years ago
- BERT models for many languages created from Wikipedia texts☆34Updated 4 years ago
- Download and load spaCy models on-the-fly☆14Updated last year
- These are lists for a variety of languages containing words that are distinctive to each language.☆34Updated 2 years ago
- ☆12Updated 2 years ago
- GC4LM: A Colossal (Biased) language model for German☆13Updated 3 years ago
- A flexible sentence segmentation library using CRF model and regex rules☆24Updated 8 months ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆76Updated 4 months ago
- 🐸 KERMIT - A lightweight library to encode and interpret Universal Syntactic Embeddings☆58Updated last year
- MAGPIE: A sense-annotated corpus of potentially idiomatic expressions☆25Updated 4 years ago
- ML Reproducibility Challenge 2020: Electra reimplementation using PyTorch and Transformers☆12Updated 3 years ago
- UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them☆37Updated 2 years ago
- ☆17Updated last year
- Converter from UD-trees to BART representation☆36Updated 8 months ago
- Corresponding code repo for the paper at COLING 2020 - ARGMIN 2020: "DebateSum: A large-scale argument mining and summarization dataset"☆53Updated 2 years ago
- Calculate Krippendorff's Alpha on any DataFrame☆35Updated last year
- Searching in-memory corpus with Corpus Query Language (CQL)☆18Updated 3 years ago
- Generate BERT vocabularies and pretraining examples from Wikipedias☆18Updated 4 years ago
- The official repository for the The Project Dialogism Novel Corpus, a dataset of annotated quotations in full-length English novels.☆39Updated last year
- A small repository to test Captum Explainable AI with a trained Flair transformers-based text classifier.☆26Updated 3 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆36Updated 2 years ago
- ☆27Updated 3 months ago
- GrammarTagger — A Neural Multilingual Grammar Profiler for Language Learning☆27Updated 3 years ago
- ☆38Updated 4 years ago
- Getting interpretable dimensions in word embedding spaces.☆14Updated last year
- ☆53Updated 10 months ago
- Package to extract connotation frames☆80Updated 11 months ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆26Updated 3 years ago
- Featurize words into orthographic and phonological vectors.☆40Updated last year