chuanconggao / TopSim
Efficiently search the most similar strings against the query in Python.
☆18Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for TopSim
- Python search module for fast approximate string matching☆53Updated last year
- NYAN is a news filtering engine written in Python and some Ruby.☆15Updated last year
- Find which links on a web page are pagination links☆29Updated 7 years ago
- The slides, code examples and resources for the PyCon 2015 Ireland talk on building data pipelines☆13Updated 9 years ago
- implement some outlier detection algorithms☆11Updated 9 years ago
- A tiny python utility that converts data crawled from different services into a cloud of words☆30Updated 6 years ago
- Scrapy extension which writes crawled items to Kafka☆30Updated 6 years ago
- Practical Natural Language Processing Tools for Humans is build on the top of Senna Natural Language Processing (NLP) predictions: part-…☆22Updated 3 years ago
- tag doc using topN words with lda☆10Updated 9 years ago
- Machine Learning Versioning made Simple☆38Updated 2 years ago
- Some convenient natural language tools that build on NLTK.☆85Updated 10 years ago
- A disk-based key/value store in Python with no dependencies.☆21Updated 9 years ago
- Yet another regression toolkit☆12Updated 11 years ago
- simple python interface to SMAC.☆10Updated 10 years ago
- A pure Python implementation of Aho-Corasick algorithm.☆23Updated 6 years ago
- Small set of utilities to simplify writing Scrapy spiders.☆49Updated 9 years ago
- The top 10 solution to the "Growing Instability: Classifying Crisis Reports" challenge☆21Updated 7 years ago
- ☆13Updated 4 years ago
- A PDFMiner wrapper to ease the text extraction from pdf files.☆25Updated 11 years ago
- Tools and services for evaluating topic models☆15Updated 8 years ago
- A memory-based, optional-persistence naïve bayesian text classifier.☆35Updated 9 years ago
- Aho-Corasick string replacement utility☆23Updated 4 years ago
- A tool that evolves small brains capable of scanning and classifying an image.☆13Updated 8 years ago
- ☆9Updated 4 years ago
- Online machine learning algorithms (based on OLL C++ library)☆22Updated 7 years ago
- Simple and clean Python implementation of TextRank as per seminal paper by Rada Mihalcea and Paul Tarau. This implementation performs bot…☆11Updated 3 years ago
- ☆24Updated 6 years ago