dbamman / litbank
Annotated dataset of 100 works of fiction to support tasks in natural language processing and the computational humanities.
☆343Updated 2 years ago
Alternatives and similar repositories for litbank:
Users that are interested in litbank are comparing it to the libraries listed below
- A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology…☆221Updated 2 years ago
- Tutorial on computational models of language change☆114Updated 5 years ago
- Implementation of the ClausIE information extraction system for python+spacy☆220Updated 2 years ago
- An elaborate and exhaustive paper list for Named Entity Recognition (NER)☆394Updated 2 years ago
- ☆228Updated 3 years ago
- Metaphor classification for verbs and content words☆65Updated last year
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆253Updated 4 months ago
- spaCy + UDPipe☆161Updated 2 years ago
- Scientific Document Summarization Corpus and Annotations from the WING NUS group.☆211Updated last year
- The official released annotations, both in .prop pointer format and as conll files. Does not contain the source texts☆137Updated 2 years ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆148Updated last year
- A tool for holistic analysis of language generations systems☆467Updated 2 years ago
- A multilingual lexicon of words to hurt.☆82Updated 2 months ago
- BERT for Coreference Resolution☆446Updated 2 years ago
- 💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy☆730Updated 5 months ago
- Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.co…☆310Updated 2 years ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆354Updated last year
- ☆83Updated 4 years ago
- Benchmarks for intrinsic word embeddings evaluation.☆60Updated 6 years ago
- A frame-semantic parsing system based on a softmax-margin SegRNN.☆229Updated 2 years ago
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.☆310Updated 2 weeks ago
- One million English sentences, each split into two sentences that together preserve the original meaning, extracted from Wikipedia edits.☆124Updated 5 years ago
- Large-scale multi-document summarization dataset and code☆279Updated last year
- Cross-lingual metaphor detection.☆66Updated 5 years ago
- AmbiverseNLU: A Natural Language Understanding suite by Max Planck Institute for Informatics☆209Updated last year
- Open Information Extraction (OpenIE) and Open Relation Extraction (ORE) papers and data.☆163Updated 4 years ago
- This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs☆173Updated last year
- Easier Automatic Sentence Simplification Evaluation☆160Updated last year
- PENMAN notation (e.g. AMR) in Python☆142Updated 4 months ago
- A toolkit for evaluating the linguistic knowledge and transferability of contextual representations. Code for "Linguistic Knowledge and T…☆210Updated 3 years ago