schollz / string_matchingLinks
A simple and fast approach to selecting the best string in a list of strings despite errors or mispelling.
☆9Updated 10 years ago
Alternatives and similar repositories for string_matching
Users that are interested in string_matching are comparing it to the libraries listed below
Sorting:
- A disk-based key/value store in Python with no dependencies.☆21Updated 10 years ago
- Reduction is a python script which automatically summarizes a text by extracting the sentences which are deemed to be most important.☆54Updated 10 years ago
- An index data structure for approximate string search.☆23Updated 6 years ago
- Show summary of a large number of URLs in a Jupyter Notebook☆17Updated 4 years ago
- Topic Model or LDA in Cython☆21Updated 14 years ago
- Python wrapper for Apache OpenNLP tools☆34Updated 8 years ago
- Python bindings to the Compact Language Detector☆33Updated 5 years ago
- ☆24Updated 7 years ago
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']☆81Updated 9 years ago
- Python search module for fast approximate string matching☆54Updated 2 years ago
- Natural language generation language☆56Updated 6 years ago
- WordNet Domains, WordNet Affect and SentiWords☆48Updated 9 years ago
- C++ Ternary Search Tree implementation with Python bindings☆43Updated 7 years ago
- Python wrapper around SVDLIBC, a fast library for sparse Singular Value Decomposition☆55Updated 11 years ago
- Non-Overlapping Aho-Corasick Python extension, for Python 2 (str and unicode) and Python 3☆51Updated 10 years ago
- Python bindings for libwapiti☆67Updated 5 years ago
- Information Retrieval Library (in Python)☆83Updated 3 years ago
- Generic Environment for Context-Aware Correction of Orthography☆22Updated 2 years ago
- Experiment, Storage and Visualization Framework for Machine Learning research.☆31Updated 4 years ago
- Collects all tweets from the sample Public stream using Twitter's streaming API, and saves them to a file for later use as a corpus.☆45Updated 4 years ago
- Active Learning for text classification using scikit-learn☆24Updated 6 years ago
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 8 years ago
- Repository for the CLiPS HAte speech DEtection System [HADES].☆24Updated 7 years ago
- Memory-efficient Count-Min Sketch Counter (based on Madoka C++ library)☆26Updated 6 years ago
- Dynamic Topic Model (based upon code released by David Blei at http://www.cs.princeton.edu/~blei/topicmodeling.html)☆31Updated 7 years ago
- A high performance indexing and search system for managing big data☆17Updated 6 years ago
- Thin wrapper for the Microsoft Cognitive Services☆60Updated 7 years ago
- Entity Linking for the masses☆56Updated 9 years ago
- A spell-checker extending Peter Norvig's with multi-typo correction, hamming distance weighting, and more.☆98Updated 4 years ago
- Exploring the shapes of stories using indico sentiment analysis APIs☆28Updated 9 years ago