Belval / disklist
A python list implementation that uses the disk to handle very large collections
☆14Updated 5 years ago
Alternatives and similar repositories for disklist:
Users that are interested in disklist are comparing it to the libraries listed below
- Tools to manipulate and extract data from wikipedia dumps☆45Updated 11 years ago
- Python CLI to apply word2vec to all sorts of text documents.☆48Updated 7 years ago
- Notebooks and data associated to constructing and exploring a map of subreddits.☆55Updated 7 years ago
- Python wrapper around SVDLIBC, a fast library for sparse Singular Value Decomposition☆55Updated 11 years ago
- Burglary prediction for mortals☆10Updated 9 months ago
- Sentiment analysis made easy; built on top off solid libraries.☆24Updated 7 years ago
- Algorithms for "schema matching"☆26Updated 8 years ago
- Python search module for fast approximate string matching☆54Updated 2 years ago
- Course on Language Technologies and NLP☆15Updated 7 years ago
- Polyglot skipgram embeddings, and their many health benefits☆12Updated 5 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learning☆41Updated 4 years ago
- allennlp + streamlit demo☆22Updated 5 years ago
- Extract synonyms, keywords from sentences using modified implementation of Aho Corasick algorithm☆40Updated 7 years ago
- A pure Python implementation of Aho-Corasick algorithm.☆22Updated 6 years ago
- Relatively simple text classification powered by spaCy☆41Updated 9 years ago
- Jupyter notebook-post of advanced numpy techniques☆58Updated 6 years ago
- A clean and easy interface for performing nearest-neighbor lookups☆50Updated 5 years ago
- spaCy pipeline component for adding text readability meta data to Doc objects.☆56Updated 5 years ago
- Python 3 implementation and documentation of the Hermina-Janos local graph clustering algorithm.☆21Updated 2 years ago
- Titus 2 : Portable Format for Analytics (PFA) implementation for Python 3.4+☆23Updated 2 years ago
- Probabilistic Data Structures in Python (originally presented at PyData 2013)☆55Updated 3 years ago
- Example using Polyaxon to experiment with pre-training spaCy☆65Updated 3 years ago
- Text readability metrics in Python.☆11Updated 11 years ago
- Regex like pattern tree matching but on sentence's tree instead of Strings☆42Updated 6 years ago
- Accuracy-based Learning Classifier Systems (XCS)☆49Updated 10 months ago
- Tools and services for evaluating topic models☆15Updated 8 years ago
- Python wrapper for a C++ Double Metaphone☆15Updated 2 years ago
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json dump. Questions? https://gitter.im/idio-opensource/Lobby☆17Updated 2 years ago
- Topic Model or LDA in Cython☆21Updated 13 years ago