kootenpv / tok
Fast and customizable tokenization
☆64Updated 5 years ago
Alternatives and similar repositories for tok:
Users that are interested in tok are comparing it to the libraries listed below
- Find strings/words in text; convenience and C speed☆126Updated 2 years ago
- An easy to use open-source library for advanced Deep Learning and Natural Language Processing☆112Updated 9 months ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learning☆41Updated 4 years ago
- Code and data accompanying the paper "Approaching nested named entity recognition with parallel LSTM-CRFs."☆26Updated 2 years ago
- A python module for word inflections designed for use with spaCy.☆92Updated 5 years ago
- Parse natural language time expressions in python☆130Updated 2 years ago
- allennlp + streamlit demo☆22Updated 5 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated 5 months ago
- Enso: An Open Source Library for Benchmarking Embeddings + Transfer Learning Methods☆95Updated 4 years ago
- Natural Language Data Augmentation Tool for Conversational Systems☆115Updated 2 years ago
- Example using Polyaxon to experiment with pre-training spaCy☆65Updated 3 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 9 months ago
- Server/Client around Spacy to load spacy only once☆46Updated 7 years ago
- jiant-dev☆28Updated 4 years ago
- Super lightweight function registries for your library☆179Updated 11 months ago
- Lightning Fast Language Prediction 🚀☆166Updated 6 years ago
- numeric fused-head identification and resolution☆33Updated 5 years ago
- Textpipe: clean and extract metadata from text☆301Updated 3 years ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆190Updated 2 years ago
- Polyglot skipgram embeddings, and their many health benefits☆12Updated 5 years ago
- interactive explorer for language models☆133Updated 3 years ago
- ☆30Updated 2 years ago
- SemEval 2019 Hyperpartisan News Detection - team Bertha von Suttner contribution☆22Updated 5 years ago
- ☆70Updated 2 years ago
- A Lightweight NLP Data Loader for All Deep Learning Frameworks in Python☆181Updated last year
- Learning BPE embeddings by first learning a segmentation model and then training word2vec☆19Updated 2 years ago
- Incremental learning of word embeddings with context informativeness.☆94Updated last year
- spaCy pipeline component for adding text readability meta data to Doc objects.☆56Updated 6 years ago
- Jupyter Widget for data annotation☆138Updated 2 years ago
- Jupyter extension to visualize dependency structures☆28Updated 7 years ago