kootenpv / tok
Fast and customizable tokenization
☆64Updated 5 years ago
Alternatives and similar repositories for tok:
Users that are interested in tok are comparing it to the libraries listed below
- Find strings/words in text; convenience and C speed☆126Updated 2 years ago
- Code and data accompanying the paper "Approaching nested named entity recognition with parallel LSTM-CRFs."☆26Updated 2 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learning☆42Updated 4 years ago
- An easy to use open-source library for advanced Deep Learning and Natural Language Processing☆112Updated 5 months ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆50Updated last month
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆64Updated 2 years ago
- Interactive Model Iteration with Weak Supervision and Pre-Trained Embeddings☆76Updated 2 years ago
- Example using Polyaxon to experiment with pre-training spaCy☆65Updated 3 years ago
- A Lightweight NLP Data Loader for All Deep Learning Frameworks in Python☆180Updated last year
- A fast and memory-optimized string library for heavy-text manipulation in Python☆250Updated 4 years ago
- allennlp + streamlit demo☆22Updated 5 years ago
- Enso: An Open Source Library for Benchmarking Embeddings + Transfer Learning Methods☆96Updated 4 years ago
- ☆122Updated last year
- A python module for word inflections designed for use with spaCy.☆92Updated 4 years ago
- ☆9Updated 4 years ago
- spaCy + UDPipe☆161Updated 2 years ago
- Lightning Fast Language Prediction 🚀☆165Updated 5 years ago
- ☆30Updated 2 years ago
- Tokenize and clean strings in Python☆13Updated 7 years ago
- Set up the CTRL text-generating model on Google Compute Engine with just a few console commands.☆151Updated 5 years ago
- Python stream processing for humans☆184Updated 2 months ago
- Misspelling Oblivious Word Embeddings☆203Updated 5 years ago
- Textpipe: clean and extract metadata from text☆301Updated 3 years ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆192Updated last year
- 🤹♀️ Query spaCy's linguistic annotations using GraphQL☆86Updated 6 years ago
- Server/Client around Spacy to load spacy only once☆46Updated 7 years ago
- numeric fused-head identification and resolution☆33Updated 5 years ago
- Easy-to-use text representations extraction library based on the Transformers library.☆32Updated 2 years ago
- Official details for: [1803.08493] Context is Everything: Finding Meaning Statistically in Semantic Spaces☆39Updated 5 years ago