ThoughtRiver / lmdb-embeddings
Fast word vectors with little memory usage in Python
☆419Updated 3 years ago
Alternatives and similar repositories for lmdb-embeddings:
Users that are interested in lmdb-embeddings are comparing it to the libraries listed below
- All-pair set similarity search on millions of sets in Python and on a laptop☆591Updated 2 years ago
- Scikit-learn style model finetuning for NLP☆707Updated this week
- Intuitive Annotation Tool for Information Extraction / Named Entity Recognition using localturk / Amazon Mechanical Turk☆265Updated 5 years ago
- Quantized word vectors that take 8x-16x less space than regular word vectors☆755Updated 4 years ago
- A context-preserving word cloud generator☆441Updated last year
- Feature engineering and machine learning: together at last!☆24Updated 4 years ago
- scikit-learn wrappers for Python fastText.☆233Updated 2 years ago
- Calculates Word Mover's Distance Insanely Fast☆461Updated last year
- A fast, efficient universal vector embedding utility package.☆1,637Updated last year
- Yet another Python binding for fastText☆227Updated 6 years ago
- Various Algorithms for Short Text Mining☆466Updated last week
- Organized Resources for Deep Learning in Natural Language Processing☆434Updated 4 years ago
- 🚀100 Times Faster Natural Language Processing in Python - iPython notebook☆335Updated 6 years ago
- Text Classification Library in Keras☆420Updated 6 years ago
- 🌊HMTL: Hierarchical Multi-Task Learning - A State-of-the-Art neural network model for several NLP tasks based on PyTorch and AllenNLP☆1,192Updated last year
- Efficient Counter that uses a limited (bounded) amount of memory regardless of data size.☆934Updated 2 years ago
- Dynamic Meta-Embeddings for Improved Sentence Representations☆332Updated 4 years ago
- Lazydata: Scalable data dependencies for Python projects☆625Updated 5 years ago
- Code for the paper "DeepType: Multilingual Entity Linking by Neural Type System Evolution"☆649Updated last year
- Textpipe: clean and extract metadata from text☆301Updated 3 years ago
- 💫 Scripts, tools and resources for developing spaCy☆125Updated 5 years ago
- tensorflow port of the lda2vec model for unsupervised learning of document + topic + word embeddings☆437Updated 7 years ago
- A high-level, rapid development framework for machine learning projects☆345Updated last year
- Fast, DB Backed pretrained word embeddings for natural language processing.☆223Updated last year
- 🔡 Token level embeddings from BERT model on mxnet and gluonnlp☆452Updated 5 years ago
- code + contents of my website, and programming life☆362Updated last week
- NLP library designed for reproducible experimentation management☆293Updated 5 months ago
- 💫 REST microservices for various spaCy-related tasks☆240Updated 2 years ago
- ADAM - A Question Answering System. Inspired from IBM Watson☆355Updated 4 years ago
- a Deep Learning Framework for Text https://delft.readthedocs.io/☆389Updated last week