steveash / NETransliteration-COLING2018
Code and data used in named entity transliteration experiments
☆57Updated 6 years ago
Alternatives and similar repositories for NETransliteration-COLING2018:
Users that are interested in NETransliteration-COLING2018 are comparing it to the libraries listed below
- Doing things with embeddings☆64Updated 2 years ago
- A simple neural truecaser written in pytorch and allennlp.☆33Updated 8 months ago
- Fast supervised sentence boundary detection using the averaged perceptron☆90Updated 6 years ago
- Automatic extraction of edited sentences from text edition histories.☆82Updated 3 years ago
- OpenNeuroSpell contains parts of NeuroSpell (http://neurospell.com/en.php) released as open-source. More code will be published as soon a…☆20Updated 4 months ago
- ☆12Updated 9 years ago
- Corpus preprocessing☆95Updated 11 months ago
- ☆21Updated 5 years ago
- CoNLL 2018 Shared Task Team UDPipe-Future☆39Updated 4 years ago
- A way to do annotations for NER. TALEN: Tool for Annotation of Low-resource ENtities☆113Updated 2 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆191Updated 4 years ago
- BERT models for many languages created from Wikipedia texts☆33Updated 4 years ago
- Demonstration of the results in "Text Normalization using Memory Augmented Neural Networks", Authors: Subhojeet Pramanik, Aman Hussain☆60Updated 5 years ago
- numeric fused-head identification and resolution☆33Updated 5 years ago
- Fast Word Clustering Software☆78Updated 3 weeks ago
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆72Updated 10 years ago
- Open-source tools for morphological tagging, segmentation and stemming.☆41Updated 5 years ago
- A transition-based parser for Universal Dependencies with BiLSTM word and character representations.☆80Updated 2 years ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Updated 3 years ago
- Efficient Markov Chain word alignment☆53Updated 3 years ago
- ☆45Updated 7 months ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Updated 3 years ago
- Tool to fix bitexts and tag near-duplicates for removal☆29Updated 3 weeks ago
- LSTM Language Model with Subword Units Input Representations☆42Updated 3 years ago
- UniParse: A universal graph-based parsing toolkit☆10Updated 5 years ago
- Workshop on Noisy User-generated Text (W-NUT)☆30Updated last week
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆12Updated last year
- ☆25Updated 4 years ago
- Efficient Low-Memory Aligner☆142Updated last month
- ☆42Updated 6 years ago