schollz / string_matching
A simple and fast approach to selecting the best string in a list of strings despite errors or mispelling.
☆10Updated 9 years ago
Alternatives and similar repositories for string_matching:
Users that are interested in string_matching are comparing it to the libraries listed below
- Find which links on a web page are pagination links☆29Updated 8 years ago
- A disk-based key/value store in Python with no dependencies.☆21Updated 9 years ago
- Show summary of a large number of URLs in a Jupyter Notebook☆17Updated 3 years ago
- A high performance indexing and search system for managing big data☆17Updated 5 years ago
- Reduction is a python script which automatically summarizes a text by extracting the sentences which are deemed to be most important.☆55Updated 9 years ago
- Match tokenized words and phrases within the original, untokenized, often messy, text.☆20Updated last year
- Python bindings to the Compact Language Detector☆33Updated 4 years ago
- Implementation of Bayesian Sets for fast similarity searches.☆15Updated 13 years ago
- rapid nlp prototyping☆72Updated 2 years ago
- Generic Environment for Context-Aware Correction of Orthography☆22Updated 2 years ago
- Python wrapper for Apache OpenNLP tools☆34Updated 8 years ago
- Topic Model or LDA in Cython☆21Updated 13 years ago
- Python bindings for libwapiti☆66Updated 5 years ago
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']☆82Updated 8 years ago
- Stanford Tregex-inspired language for rule-based dependency tree manipulation.☆21Updated 7 years ago
- Semanticizest: dump parser and client☆20Updated 8 years ago
- Predicting closed questions on Stack Overflow☆46Updated 7 years ago
- Automated NLP sentiment predictions- batteries included, or use your own data☆18Updated 7 years ago
- A spell-checker extending Peter Norvig's with multi-typo correction, hamming distance weighting, and more.☆98Updated 4 years ago
- Natural language generation language☆55Updated 5 years ago
- extract difference between two html pages☆32Updated 6 years ago
- A benchmark framework for testing algorithms and pairwise metrics.☆67Updated 11 years ago
- Tool to visualize data quickly with no brain usage for plot creation☆46Updated 5 years ago
- workflow support for reproducible deduplication and merging☆16Updated last year
- ☆24Updated 6 years ago
- A cell magic for futurize☆10Updated 9 years ago
- Concept discovery and recommendation library built on top of the IBM Watson cognitive API.☆24Updated 8 years ago
- MongoDB-backed Python dict-like interface☆39Updated 7 years ago
- Dynamic Topic Model (based upon code released by David Blei at http://www.cs.princeton.edu/~blei/topicmodeling.html)☆31Updated 6 years ago