nreimers / truecaser
Language independent truecaser in Python.
☆160Updated 3 years ago
Alternatives and similar repositories for truecaser:
Users that are interested in truecaser are comparing it to the libraries listed below
- A python true casing utility that restores case information for texts☆88Updated 2 years ago
- Automatic extraction of edited sentences from text edition histories.☆82Updated 3 years ago
- Guidelines.☆96Updated 7 months ago
- A way to do annotations for NER. TALEN: Tool for Annotation of Low-resource ENtities☆114Updated 2 years ago
- This is a CoNLL formatted version of the OntoNotes 5.0 release.☆189Updated 10 years ago
- spaCy + UDPipe☆161Updated 2 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 7 months ago
- State-of-the-art Supervised Sentence Simplification System from ACL 2014☆46Updated 6 years ago
- Large corpus of uncompressed and compressed sentences from news articles.☆123Updated 7 years ago
- Text Simplification System and Dataset☆124Updated last year
- Named Entity Recognition based on dictionaries☆242Updated 6 years ago
- Neural network models for joint POS tagging and dependency parsing (CoNLL 2017-2018)☆158Updated 5 years ago
- Text tokenization and sentence segmentation (segtok v2)☆202Updated 3 years ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆254Updated 6 months ago
- A python module for word inflections designed for use with spaCy.☆92Updated 5 years ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆169Updated 3 years ago
- Appraise evaluation system for manual evaluation of machine translation output☆74Updated 3 years ago
- Named Entity Recognition data for Europeana Newspapers☆171Updated last year
- Easier Automatic Sentence Simplification Evaluation☆160Updated last year
- One million English sentences, each split into two sentences that together preserve the original meaning, extracted from Wikipedia edits.☆123Updated 5 years ago
- Implementation of Hobbs' algorithm for coreference resolution in python☆44Updated 4 years ago
- Tools for downloading and analyzing summaries and evaluating summarization systems. https://summari.es/☆147Updated last year
- DRESS simplification model (EMNLP 2017) described in http://aclweb.org/anthology/D/D17/D17-1062.pdf☆155Updated 3 years ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆149Updated last year
- Python wrapper for wit.ai's Duckling Clojure library☆131Updated 3 years ago
- PredPatt: Predicate-Argument Extraction from Universal Dependencies☆111Updated 4 years ago
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆72Updated 10 years ago
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.☆314Updated last month
- Concatenated Power Mean Embeddings as Universal Cross-Lingual Sentence Representations☆185Updated 4 years ago
- The official released annotations, both in .prop pointer format and as conll files. Does not contain the source texts☆137Updated 2 years ago