Belval / disklistLinks
A python list implementation that uses the disk to handle very large collections
☆14Updated 6 years ago
Alternatives and similar repositories for disklist
Users that are interested in disklist are comparing it to the libraries listed below
Sorting:
- Notebooks and data associated to constructing and exploring a map of subreddits.☆55Updated 8 years ago
- Sentiment analysis made easy; built on top off solid libraries.☆24Updated 8 years ago
- Using ML to extract campaign finance data from messy forms for journalism☆77Updated 3 years ago
- ☆98Updated 5 years ago
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆142Updated last year
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆58Updated 4 years ago
- varied english texts for modern NLP testing☆77Updated 3 years ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆183Updated 2 years ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆191Updated 2 years ago
- Train word embeddings with Gensim and vizualize them with TensorBoard☆34Updated 6 years ago
- Implementation of phonetic algorithm in python☆41Updated 7 years ago
- Linguistic Annotation and Visualization Tool for PDF Documents☆200Updated 6 years ago
- Python port for IWNLP.Lemmatizer☆18Updated 2 years ago
- Abydos NLP/IR library for Python☆194Updated 3 years ago
- Python scripts for creating stylistic word clouds☆87Updated 9 years ago
- Relatively simple text classification powered by spaCy☆41Updated 10 years ago
- A tool for visualizing trees, tailored specifically to the analysis of parse trees.☆83Updated 5 years ago
- Mechanical Turk on your own machine.☆207Updated last year
- ☆98Updated 4 years ago
- A knowledge base construction engine for richly formatted data☆412Updated 4 years ago
- Get list of common stop words in various languages in Python☆159Updated 3 months ago
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆250Updated 4 months ago
- 🍇 Edit and execute code snippets in the browser using Jupyter kernels☆224Updated 6 years ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆378Updated 3 years ago
- Python API and analysis of Chicago's bikeshare☆10Updated 3 years ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆171Updated 4 years ago
- Use ML-Annotate to label data for machine learning purposes☆110Updated 5 years ago
- ☆123Updated 2 years ago
- Extract synonyms, keywords from sentences using modified implementation of Aho Corasick algorithm☆40Updated 8 years ago
- Automatic labeling for topic model☆57Updated 10 years ago