kootenpv / tok
Fast and customizable tokenization
☆64Updated 5 years ago
Alternatives and similar repositories for tok:
Users that are interested in tok are comparing it to the libraries listed below
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learning☆41Updated 4 years ago
- Find strings/words in text; convenience and C speed☆126Updated 2 years ago
- spaCy + UDPipe☆160Updated 2 years ago
- Lightning Fast Language Prediction 🚀☆165Updated 5 years ago
- A python module for word inflections designed for use with spaCy.☆92Updated 5 years ago
- An easy to use open-source library for advanced Deep Learning and Natural Language Processing☆112Updated 6 months ago
- Server/Client around Spacy to load spacy only once☆46Updated 7 years ago
- Textpipe: clean and extract metadata from text☆302Updated 3 years ago
- Enso: An Open Source Library for Benchmarking Embeddings + Transfer Learning Methods☆95Updated 4 years ago
- Example using Polyaxon to experiment with pre-training spaCy☆65Updated 3 years ago
- Misspelling Oblivious Word Embeddings☆203Updated 5 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 6 months ago
- Code and data accompanying the paper "Approaching nested named entity recognition with parallel LSTM-CRFs."☆26Updated 2 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated 2 months ago
- allennlp + streamlit demo☆22Updated 5 years ago
- A fast and memory-optimized string library for heavy-text manipulation in Python☆250Updated 4 years ago
- Polyglot skipgram embeddings, and their many health benefits☆12Updated 5 years ago
- 🤹♀️ Query spaCy's linguistic annotations using GraphQL☆86Updated 6 years ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆148Updated last year
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆190Updated last year
- A Lightweight NLP Data Loader for All Deep Learning Frameworks in Python☆181Updated last year
- Use ML-Annotate to label data for machine learning purposes☆107Updated 4 years ago
- Language detection extension for spaCy 2.0+☆112Updated 6 years ago
- A small tool that EXPLains spACY parse results. See what I did there?☆83Updated 2 years ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆67Updated 2 years ago
- interactive explorer for language models☆132Updated 3 years ago
- Interactive Model Iteration with Weak Supervision and Pre-Trained Embeddings☆76Updated 2 years ago
- A fully customisable language detection pipeline for spaCy☆92Updated 5 years ago
- Incremental learning of word embeddings with context informativeness.☆94Updated last year
- Intelligently expand and create contractions in text leveraging grammar checking and Word Mover's Distance.☆75Updated 3 years ago