clarinsi / csmtiserLinks
A tool for text normalisation via character-level machine translation
☆13Updated 5 years ago
Alternatives and similar repositories for csmtiser
Users that are interested in csmtiser are comparing it to the libraries listed below
Sorting:
- Alignment and annotation for comparable documents.☆22Updated 6 years ago
- Open-source tools for morphological tagging, segmentation and stemming.☆40Updated 6 years ago
- This dataset contains naturally-occurring English sentences that feature non-trivial noun-verb ambiguity.☆35Updated 6 years ago
- Repository for rstWeb, a browser based annotation interface for Rhetorical Structure Theory☆45Updated last month
- CoNLL 2018 Shared Task Team UDPipe-Future☆39Updated 4 years ago
- Format conversion and graphical representation of [Universal Dependencies](http://universaldependencies.org) trees.☆12Updated last year
- Transition-based UCCA Parser☆73Updated 4 years ago
- Data and scripts for the proper evaluation of cross-lingual embeddings in multiple languages☆14Updated 5 years ago
- A transition-based parser for Universal Dependencies with BiLSTM word and character representations.☆82Updated 3 years ago
- A collection of English tweets annotated in Universal Dependencies.☆39Updated 3 years ago
- ☆47Updated 8 years ago
- The Universal Decompositional Semantics (UDS) dataset and the Decomp toolkit☆58Updated last month
- Appraise evaluation system for manual evaluation of machine translation output☆77Updated 4 years ago
- PredPatt: Predicate-Argument Extraction from Universal Dependencies☆111Updated 4 years ago
- Neural Semantic Graph Parser☆29Updated 7 years ago
- ☆33Updated 4 years ago
- GC4LM: A Colossal (Biased) language model for German☆13Updated 4 years ago
- UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them☆38Updated 3 years ago
- ☆23Updated 8 years ago
- Fast Word Clustering Software☆78Updated 7 months ago
- Various utilities for processing the data.☆211Updated this week
- A way to do annotations for NER. TALEN: Tool for Annotation of Low-resource ENtities☆118Updated 2 months ago
- Repository for the Georgetown University Multilayer Corpus (GUM)☆99Updated last month
- ☆48Updated 6 years ago
- Workshop on Noisy User-generated Text (W-NUT)☆30Updated 4 months ago
- The Benchmark of Linguistic Minimal Pairs☆153Updated 2 years ago
- Keras implementation of ontology aware token embeddings☆49Updated 6 years ago
- Python interface for converting Penn Treebank trees to Stanford Dependencies and Universal Depenencies☆70Updated 6 years ago
- Jupyter extension to visualize dependency structures☆28Updated 7 years ago
- Baselines and corpus accompanying paper Neural Network Acceptability Judgments☆56Updated 5 years ago