chuanconggao / TopSim
Efficiently search the most similar strings against the query in Python.
☆18Updated this week
Alternatives and similar repositories for TopSim:
Users that are interested in TopSim are comparing it to the libraries listed below
- Python search module for fast approximate string matching☆54Updated 2 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- The top 10 solution to the "Growing Instability: Classifying Crisis Reports" challenge☆21Updated 7 years ago
- A pure Python implementation of Aho-Corasick algorithm.☆22Updated 6 years ago
- A disk-based key/value store in Python with no dependencies.☆21Updated 9 years ago
- Tools and services for evaluating topic models☆15Updated 8 years ago
- Practical Natural Language Processing Tools for Humans is build on the top of Senna Natural Language Processing (NLP) predictions: part-…☆22Updated 3 years ago
- NYAN is a news filtering engine written in Python and some Ruby.☆15Updated last year
- Scrapy extension which writes crawled items to Kafka☆30Updated 6 years ago
- Neural Elastic Inference and Search☆19Updated 5 years ago
- Some convenient natural language tools that build on NLTK.☆85Updated 10 years ago
- Textprep is an analyzing tool for both parallel and non-parallel corpus and its down-stream Natural Language Processing and Machine Trans…☆32Updated 6 years ago
- implement some outlier detection algorithms☆11Updated 9 years ago
- Aho-Corasick string replacement utility☆24Updated 5 years ago
- tag doc using topN words with lda☆10Updated 9 years ago
- Distributed text analysis suite based on Celery☆95Updated 2 years ago
- Model explanation provides the ability to interpret the effect of the predictors on the composition of an individual score.☆13Updated 4 years ago
- Using Centroids of Word Embeddings and Word Mover's Distance for Biomedical Document Retrieval in Question Answering.☆14Updated 7 years ago
- An easy-to-use Python wrapper for the Don Best Sports Data API.☆16Updated 2 years ago
- Implementation of Monte Carlo Word Movers Distance in Python with TensorFlow☆13Updated 8 years ago
- The slides, code examples and resources for the PyCon 2015 Ireland talk on building data pipelines☆13Updated 9 years ago
- Utility to help search within a set of jupyter notebooks☆16Updated 5 years ago
- A tool that evolves small brains capable of scanning and classifying an image.☆13Updated 8 years ago
- Attentional Neural Network that translates text to phones.☆11Updated 7 years ago
- A word hashing method based on vectors of letter n-grams. Currently transforms text into sequences of numbers.☆10Updated 7 years ago
- allennlp + streamlit demo☆22Updated 5 years ago
- Small set of utilities to simplify writing Scrapy spiders.☆49Updated 9 years ago
- Object detection and text spotting from images of any size. Based on TensorFlow.☆10Updated 8 years ago
- A tiny python utility that converts data crawled from different services into a cloud of words☆30Updated 6 years ago
- unofficial git mirror of http://svn.whoosh.ca svn repo☆49Updated 15 years ago