LanguageMachines / CLIN28_ST_spelling_correction
Scripts that were used for preparing and converting the Wikipedia documents that are part of the CLIN28 shared task on spelling correction
☆10Updated 7 years ago
Alternatives and similar repositories for CLIN28_ST_spelling_correction:
Users that are interested in CLIN28_ST_spelling_correction are comparing it to the libraries listed below
- CoNLL 2018 Shared Task Team UDPipe-Future☆39Updated 4 years ago
- numeric fused-head identification and resolution☆33Updated 5 years ago
- A way to do annotations for NER. TALEN: Tool for Annotation of Low-resource ENtities☆112Updated 2 years ago
- C++ implementation of Generalised Brown clustering and python scripts for feature generation☆41Updated 8 years ago
- Python library for vector space models☆13Updated 6 years ago
- A compound splitter based on the semantic regularities in the vector space of word embeddings.☆16Updated 7 years ago
- Decoding platform for machine translation research☆54Updated 5 years ago
- Doing things with embeddings☆64Updated 2 years ago
- Jupyter extension to visualize dependency structures☆28Updated 6 years ago
- KenLM extension for spaCy 2.0.☆16Updated 7 years ago
- A python wrapper for Semaphore, a Shallow Semantic Parser that identifies roles in a text.☆12Updated 11 years ago
- Appraise evaluation system for manual evaluation of machine translation output☆74Updated 3 years ago
- A neural network that jointly part-of-speech tags and lemmatizes sentences, boosting accuracy for morphologically-rich languages (Czech, …☆34Updated 5 years ago
- ☆22Updated 7 years ago
- In this project, we use skip-gram model to embed Wikipedia Concepts and Entities. The English version of Wikipedia contains more than fiv…☆56Updated 7 years ago
- Keras implementation of ontology aware token embeddings☆48Updated 6 years ago
- Incremental learning of word embeddings with context informativeness.☆95Updated last year
- Watset: Automatic Induction of Synsets from a Graph of Synonyms☆16Updated 5 years ago
- The Attract-Repel algorithm presented in (Mrkšić et al., TACL 2017), with accompanying resources.☆63Updated 7 years ago
- Automatic extraction of edited sentences from text edition histories.☆82Updated 2 years ago
- ☆23Updated 7 years ago
- Code for EMNLP 2016 paper: Morphological Priors for Probabilistic Word Embeddings☆52Updated 8 years ago
- Labeled examples from wiki dumps in Python☆67Updated 8 years ago
- Sume is an implementation of the concept-based ILP model for summarization.☆38Updated 6 years ago
- German Morphological Analyzer☆47Updated 3 years ago
- scripts and data for ACL 16 paper☆14Updated 8 years ago
- ADEL is a robust and efficient entity linking framework that is adaptive to text genres and language, entity types for the classification…☆17Updated 5 years ago
- Alignment and annotation for comparable documents.☆22Updated 6 years ago
- Inter-annotator agreement for Doccano☆27Updated 4 years ago
- Bidirectional Long-Short Term Memory tagger (bi-LSTM) (in DyNet) -- hierarchical (with word and character embeddings)☆122Updated last year