kootenpv / tokLinks
Fast and customizable tokenization
☆64Updated 5 years ago
Alternatives and similar repositories for tok
Users that are interested in tok are comparing it to the libraries listed below
Sorting:
- Find strings/words in text; convenience and C speed☆126Updated 2 years ago
- Code and data accompanying the paper "Approaching nested named entity recognition with parallel LSTM-CRFs."☆26Updated 2 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated 5 months ago
- spaCy + UDPipe☆161Updated 3 years ago
- Enso: An Open Source Library for Benchmarking Embeddings + Transfer Learning Methods☆95Updated 4 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learning☆41Updated 5 years ago
- A Lightweight NLP Data Loader for All Deep Learning Frameworks in Python☆181Updated last year
- Lightning Fast Language Prediction 🚀☆167Updated 6 years ago
- Example using Polyaxon to experiment with pre-training spaCy☆65Updated 3 years ago
- Server/Client around Spacy to load spacy only once☆46Updated 7 years ago
- A fully customisable language detection pipeline for spaCy☆92Updated 6 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 10 months ago
- Polyglot skipgram embeddings, and their many health benefits☆12Updated 5 years ago
- Natural Language Data Augmentation Tool for Conversational Systems☆115Updated 2 years ago
- An easy to use open-source library for advanced Deep Learning and Natural Language Processing☆112Updated 10 months ago
- A spell checker built from GloVe word vectors☆81Updated 7 years ago
- interactive explorer for language models☆133Updated 3 years ago
- numeric fused-head identification and resolution☆33Updated 5 years ago
- Interactive Model Iteration with Weak Supervision and Pre-Trained Embeddings☆77Updated 2 years ago
- Official details for: [1803.08493] Context is Everything: Finding Meaning Statistically in Semantic Spaces☆39Updated 5 years ago
- allennlp + streamlit demo☆22Updated 5 years ago
- Jupyter Widget for data annotation☆139Updated 2 years ago
- ULMFiT + Siamese Network for Sentence Vectors☆33Updated 6 years ago
- Jupyter extension to visualize dependency structures☆28Updated 7 years ago
- ☆123Updated 2 years ago
- Easy-to-use text representations extraction library based on the Transformers library.☆32Updated 2 years ago
- Learning BPE embeddings by first learning a segmentation model and then training word2vec☆19Updated 2 years ago
- Natural language generation language☆56Updated 6 years ago
- LASER multilingual sentence embeddings as a pip package☆223Updated last year
- sumgram is a tool that summarizes a collection of text documents by generating the most frequent sumgrams (conjoined ngrams)☆55Updated 10 months ago