schollz / string_matching
A simple and fast approach to selecting the best string in a list of strings despite errors or mispelling.
☆9Updated 10 years ago
Alternatives and similar repositories for string_matching:
Users that are interested in string_matching are comparing it to the libraries listed below
- A benchmark framework for testing algorithms and pairwise metrics.☆67Updated 12 years ago
- Complete Mechanical Turk API written in Python that uses the same names as the official documentation☆46Updated 8 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- Python utilities for detecting textual reuse☆21Updated 9 years ago
- Exploring the shapes of stories using indico sentiment analysis APIs☆28Updated 9 years ago
- Reduction is a python script which automatically summarizes a text by extracting the sentences which are deemed to be most important.☆55Updated 10 years ago
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 8 years ago
- Python search module for fast approximate string matching☆54Updated 2 years ago
- ☆24Updated 7 years ago
- Simple CORPORA list crawler☆10Updated 8 years ago
- This little program generates a thumbnail of a certain pdf for quick visualization. It is based on ImageMagick as it has all the function…☆17Updated 2 years ago
- Stanford Tregex-inspired language for rule-based dependency tree manipulation.☆21Updated 8 years ago
- Tools to manipulate and extract data from wikipedia dumps☆46Updated 11 years ago
- Python wrapper for Apache OpenNLP tools☆34Updated 8 years ago
- A python autocompletion library. Easycomplete has a simple API and utilizes google's autocomplete results & the english dictionary for no…☆40Updated 11 years ago
- A disk-based key/value store in Python with no dependencies.☆21Updated 10 years ago
- Efficiently search the most similar strings against the query in Python.☆18Updated last month
- Automatically exported from code.google.com/p/guess-language☆53Updated last year
- Repository for the CLiPS HAte speech DEtection System [HADES].☆24Updated 7 years ago
- Natural language generation language☆56Updated 6 years ago
- A Large Automatically-Constructed Resource of Predicate Paraphrases☆45Updated 5 years ago
- Wikipedia API wrapper for humans and elk. (en.wikipedia.org/w/api.php, get it?)☆36Updated 10 years ago
- extract difference between two html pages☆32Updated 6 years ago
- This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet…☆29Updated 4 months ago
- A PyData 2013 talk on straightforward, data-driven ways to handle natural language text in Python.☆50Updated 10 years ago
- rapid nlp prototyping☆71Updated 2 years ago
- Non-Overlapping Aho-Corasick Python extension, for Python 2 (str and unicode) and Python 3☆51Updated 9 years ago
- Simple approximate-nearest-neighbours in Python using locality sensitive hashing.☆140Updated 12 years ago
- ☆52Updated last year
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆151Updated 4 months ago