davidsbatista / lexicons
Dictionaries of names, surnames, acronyms and it's extensions, stop-words, etc., which I gathered for different experiments.
β28Updated 7 years ago
Alternatives and similar repositories for lexicons:
Users that are interested in lexicons are comparing it to the libraries listed below
- A simple neural truecaser written in pytorch and allennlp.β33Updated 8 months ago
- πNeural Sentential Paraphrase Generation to Augment Chatbot Training Datasetβ21Updated 2 years ago
- List of corpora annotated for coreference for different languagesβ17Updated 6 months ago
- A curated list of Natural Language Generation papers, tutorials, and blogs.β12Updated 6 years ago
- BERT models for many languages created from Wikipedia textsβ33Updated 4 years ago
- An example of how to use spaCy for extremely large files without running into memory issuesβ36Updated 2 years ago
- This repository contains the code for the Form-Context Model and its Attentive Mimicking variant.β31Updated 4 years ago
- EigenSent: Spectral sentence embeddings using higher-order Dynamic Mode Decompositionβ12Updated 5 years ago
- Deep-learning based sentence auto-segmentation from unstructured text w/o punctuationβ36Updated 7 years ago
- A small repository to test Captum Explainable AI with a trained Flair transformers-based text classifier.β26Updated 3 years ago
- Scripts and tools for doing unsupervised acceptability prediction.β15Updated last year
- ADS Projectβ14Updated 9 years ago
- Training a model without a dataset for natural language inference (NLI)β25Updated 4 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)β48Updated 3 years ago
- Repository for Findings of EMNLP 2020 "Context-aware Stand-alone Neural Spelling Correction"β18Updated 4 years ago
- Text processing library for sentiment analysis and related tasksβ27Updated 6 years ago
- β17Updated last year
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languagesβ13Updated 2 years ago
- β12Updated last year
- An implementation of GrASP (Shnarch et. al., 2017)β21Updated 2 years ago
- β33Updated 3 years ago
- Practical ML and NLP with examples.β34Updated last year
- Minimal code to train ELMo models in recent versions of TensorFlowβ14Updated last year
- Neural Network for Automatic Negation Detectionβ20Updated 8 years ago
- A web interface to understand language-specific BERT-modelsβ17Updated 10 months ago
- Dynamic Entity Summarization (DynES)β20Updated 5 years ago
- Unofficial implementation of Adaptive Input in PyTorchβ12Updated 6 years ago
- Build a dialog dataset from online books in many languagesβ72Updated 2 years ago
- β22Updated 7 months ago
- LAReQA is a challenging benchmark for evaluating language agnostic answer retrieval from a multilingual candidate pool. This repository cβ¦β14Updated 4 years ago