MartinThoma / lidtkLinks
Language Identification Toolkit
☆18Updated 4 years ago
Alternatives and similar repositories for lidtk
Users that are interested in lidtk are comparing it to the libraries listed below
Sorting:
- A multilingual, multi-style and multi-granularity dataset for cross-language textual similarity detection☆60Updated 8 years ago
- Keras implementation of ontology aware token embeddings☆49Updated 7 years ago
- Fast supervised sentence boundary detection using the averaged perceptron☆91Updated 6 years ago
- Utility scripts in Python☆37Updated 5 months ago
- 📄Neural Sentential Paraphrase Generation to Augment Chatbot Training Dataset☆21Updated 2 years ago
- Context Aware Language Models☆28Updated 7 years ago
- Getting started with AllenNLP and PyTorch by training a tweet classifier☆66Updated 8 years ago
- General-Purpose Neural Networks for Sentence Boundary Detection☆73Updated 2 years ago
- Normalizes lexically ill-formed text to its most likely clean text, e.g. "c u thr 2nite!" -> "see you there tonight!".☆63Updated 10 years ago
- Bidirectional Long-Short Term Memory tagger (bi-LSTM) (in DyNet) -- hierarchical (with word and character embeddings)☆123Updated 2 years ago
- ☆95Updated 2 years ago
- A toolkit for generating paraphrase vector representations for words in context☆23Updated 10 years ago
- Semantic embeddings of entities☆66Updated 9 years ago
- Modularizing Unsupervised Sense Embedding☆29Updated 7 years ago
- C++ implementation of Generalised Brown clustering and python scripts for feature generation☆41Updated 9 years ago
- Language modeling scripts based on TensorFlow☆58Updated 6 years ago
- ☆21Updated 8 years ago
- Obtaining word embeddings from a WordNet ontology☆50Updated last year
- Universal segmenter based on the Universal Dependency framework, written by Y. Shao, Uppsala University☆34Updated 6 years ago
- Semantic Textual Similarity in Python☆80Updated 8 years ago
- Semantic Entity Retrieval Toolkit☆110Updated 8 years ago
- Code base for representation learning of very short texts, such as tweets. By Cedric De Boom, IBCN, Ghent University, Belgium.☆34Updated 9 years ago
- Multi lingual character based named entity recognizer☆24Updated 7 years ago
- ✨ Web interface for NeuralCoref coreference resolution☆34Updated 2 years ago
- A curated question answering research dataset of factoid questions☆49Updated 6 years ago
- Decoding platform for machine translation research☆54Updated 6 years ago
- Non-distributional linguistic word vector representations.☆62Updated 8 years ago
- numeric fused-head identification and resolution☆33Updated 6 years ago
- Code for EMNLP 2018 paper "Auto-Encoding Dictionary Definitions into Consistent Word Embeddings"☆36Updated 7 years ago
- utility class for building/evaluating document representations☆53Updated 5 years ago