idoshlomo / online_vectorizers
Extension of scikit-learn TfidfVectorizer and CountVectorizer that allows for online learning / partial fit.
☆33Updated 7 years ago
Alternatives and similar repositories for online_vectorizers:
Users that are interested in online_vectorizers are comparing it to the libraries listed below
- Find strings/words in text; convenience and C speed☆126Updated 2 years ago
- Storage and retrieval of Word Embeddings in various databases☆51Updated 6 years ago
- Bag of, not words, but tricks!☆68Updated last year
- The weights for the embedding layer of Scandinavian UMLFiT language models☆32Updated 5 years ago
- fastText Quick Start Guide, published by Packt☆49Updated 2 years ago
- Example using Polyaxon to experiment with pre-training spaCy☆65Updated 3 years ago
- Super Simple Similarities Service☆149Updated 3 weeks ago
- Pyinfer is a model agnostic tool for ML developers and researchers to benchmark the inference statistics for machine learning models or f…☆24Updated 4 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 9 months ago
- Intelligently expand and create contractions in text leveraging grammar checking and Word Mover's Distance.☆77Updated 3 years ago
- Relatively simple text classification powered by spaCy☆41Updated 9 years ago
- ULMFiT + Siamese Network for Sentence Vectors☆34Updated 6 years ago
- Sentence transformers models for SpaCy☆107Updated 2 years ago
- An easy to use open-source library for advanced Deep Learning and Natural Language Processing☆112Updated 9 months ago
- CLANA is a toolkit for classifier analysis.☆30Updated 2 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- Polish BERT☆70Updated 4 years ago
- 🌊 Machine learning dataset loaders for testing and example scripts☆47Updated 3 years ago
- Fuzzy matching and more functionality for spaCy.☆256Updated 10 months ago
- Jupyter Widget for data annotation☆138Updated 2 years ago
- A fully customisable language detection pipeline for spaCy☆92Updated 6 years ago
- Implementation of GloVe in Keras☆45Updated 2 years ago
- Utilities for preprocessing text for deep learning with Keras☆180Updated 2 years ago
- 📝Natural language processing (NLP) utils: word embeddings (Word2Vec, GloVe, FastText, ...) and preprocessing transformers, compatible wi…☆63Updated last year
- Anonymization of legal cases (Fr) based on Flair embeddings☆88Updated 4 years ago
- ☄️ Parallel and distributed training with spaCy and Ray☆54Updated last year
- A lightweight command line interface for the management of arbitrary machine learning tasks☆19Updated 4 years ago
- Notebooks configured to be run with Binder, usually found on my blog.☆42Updated 2 years ago
- 📂 Additional lookup tables and data resources for spaCy☆105Updated 3 months ago
- ☆70Updated 2 years ago