davidsbatista / lexiconsLinks
Dictionaries of names, surnames, acronyms and it's extensions, stop-words, etc., which I gathered for different experiments.
☆28Updated 8 years ago
Alternatives and similar repositories for lexicons
Users that are interested in lexicons are comparing it to the libraries listed below
Sorting:
- A simple neural truecaser written in pytorch and allennlp.☆33Updated last year
- BERT models for many languages created from Wikipedia texts☆33Updated 5 years ago
- Expletives vomiting library...☆13Updated 8 years ago
- Code and data used in named entity transliteration experiments☆57Updated 7 years ago
- ☆31Updated 8 years ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Updated 4 years ago
- ☆32Updated 4 years ago
- c++ mosestokenizer☆18Updated last year
- A curated list of Natural Language Generation papers, tutorials, and blogs.☆12Updated 6 years ago
- Text processing library for sentiment analysis and related tasks☆27Updated 7 years ago
- Keras implementation of ontology aware token embeddings☆49Updated 6 years ago
- 📄Neural Sentential Paraphrase Generation to Augment Chatbot Training Dataset☆21Updated 2 years ago
- Language modeling scripts based on TensorFlow☆58Updated 6 years ago
- A collection of English tweets annotated in Universal Dependencies.☆39Updated 4 years ago
- ADS Project☆14Updated 9 years ago
- Code and data related to "Efficient, Compositional, Order-Sensitive n-gram Embeddings" (EACL 2017)☆14Updated 8 years ago
- General-Purpose Neural Networks for Sentence Boundary Detection☆73Updated 2 years ago
- Hierarchical word clustering, following "Brown clustering" (Brown et al., 1992)☆70Updated 10 years ago
- The WebSplit Benchmark introducing "Split and Rephrase" task☆63Updated 7 years ago
- Pre-trained models and code and data to train and use models from "Pushing the Limits of Paraphrastic Sentence Embeddings with Millions o…☆103Updated last year
- Brown clustering in Python☆22Updated 7 years ago
- Embeddings for n-grams☆11Updated 7 years ago
- Implementation of a simple frame identification approach (SimpleFrameId) described in the paper "Out-of-domain FrameNet Semantic Role Lab…☆15Updated 8 years ago
- Code and data for segmentation experiments.☆20Updated 10 years ago
- Generating Questions and Distractors automatically from Multimedia. Undergraduate Thesis work.☆22Updated 9 years ago
- ✨ Web interface for NeuralCoref coreference resolution☆34Updated 2 years ago
- ☆34Updated 5 years ago
- Open-source tools for morphological tagging, segmentation and stemming.☆40Updated 6 years ago
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Updated 2 years ago
- Neural Network for Automatic Negation Detection☆20Updated 9 years ago