chuanconggao / TopSim
Efficiently search the most similar strings against the query in Python.
☆18Updated 6 years ago
Related projects: ⓘ
- ☆33Updated this week
- Python search module for fast approximate string matching☆53Updated last year
- Practical Natural Language Processing Tools for Humans is build on the top of Senna Natural Language Processing (NLP) predictions: part-…☆22Updated 3 years ago
- implement some outlier detection algorithms☆11Updated 8 years ago
- ☆30Updated this week
- A pure Python implementation of Aho-Corasick algorithm.☆23Updated 6 years ago
- templatemaker is a Python library that can extract data from files with a similar format, like HTML pages.☆63Updated 3 years ago
- The slides, code examples and resources for the PyCon 2015 Ireland talk on building data pipelines☆13Updated 8 years ago
- Find which links on a web page are pagination links☆29Updated 7 years ago
- Tools and services for evaluating topic models☆15Updated 8 years ago
- simple python interface to SMAC.☆10Updated 10 years ago
- ☆24Updated this week
- Scrapy extension which writes crawled items to Kafka☆30Updated 5 years ago
- An Apache Lucene TokenFilter that uses a word2vec vectors for term expansion.☆24Updated 10 years ago
- A memory-based, optional-persistence naïve bayesian text classifier.☆35Updated 9 years ago
- allennlp + streamlit demo☆21Updated 4 years ago
- Easy to follow text classifying implementation using a Conv. Neural Network (Tensorflow)☆14Updated 7 years ago
- A tiny python utility that converts data crawled from different services into a cloud of words☆30Updated 6 years ago
- Experiment, Storage and Visualization Framework for Machine Learning research.☆31Updated 3 years ago
- ☆51Updated this week
- A PDFMiner wrapper to ease the text extraction from pdf files.☆25Updated 11 years ago
- A pure-python implementation of BK-Trees☆15Updated last year
- Some convenient natural language tools that build on NLTK.☆85Updated 10 years ago
- A tool that evolves small brains capable of scanning and classifying an image.☆12Updated 8 years ago
- Aho-Corasick string replacement utility☆23Updated 4 years ago
- Pipeline for distributed Natural Language Processing, made in Python☆64Updated 7 years ago
- Implementation of Monte Carlo Word Movers Distance in Python with TensorFlow☆13Updated 8 years ago
- Source code for exploring MLlib blog post☆11Updated 9 years ago
- Model explanation provides the ability to interpret the effect of the predictors on the composition of an individual score.☆12Updated 3 years ago
- CHeSF is the Chrome Headless Scraping Framework, a very very alpha code to scrape javascript intensive web pages☆20Updated 6 years ago