dracos / double-metaphone
A Python implementation of the Double Metaphone algorithm
☆61Updated 13 years ago
Related projects: ⓘ
- ... just because nltk is too heavy☆36Updated 14 years ago
- Lightweight, multilingual natural language processing☆63Updated 11 years ago
- Python library for creating word clouds from text☆51Updated 5 years ago
- Updates to Zope's keyphrase extractor (forked from 1.1.0)☆67Updated 7 years ago
- Preprocess text for NLP (tokenizing, lowercasing, stemming, sentence splitting, etc.)☆29Updated 13 years ago
- A Django based search engine powered by CouchDB, celery and whoosh.☆49Updated 8 years ago
- Python wrapper for the Vowpal Wabbit machine learning library.☆53Updated 11 years ago
- A Python version (almost a port) of ProPublica's TableFu☆231Updated 11 years ago
- Pretty fast parser for probabilistic context free grammars☆86Updated 11 years ago
- Code for High Performance Computing tutorial for EuroPython 2011☆100Updated 3 years ago
- Python interface to Solr☆276Updated 7 months ago
- Crab - A recommendation engine library for Python☆87Updated 13 years ago
- The reference implementation of the SPEAR ranking algorithm in Python.☆37Updated 8 years ago
- A python interface to WolframAlpha.☆15Updated 14 years ago
- Social Graph Analysis using Elastic MapReduce and PyPy☆55Updated 13 years ago
- ☆17Updated 7 years ago
- A disk-based key/value store in Python with no dependencies.☆21Updated 9 years ago
- A visualizer for multi-dimensional semantic data☆38Updated 12 years ago
- Demo code for learning_text_transformer☆25Updated 9 years ago
- templatemaker is a Python library that can extract data from files with a similar format, like HTML pages.☆63Updated 3 years ago
- IMPORTANT: Data Brewery is now Bubbles: https://github.com/stiivi/bubbles This brewery repository is NOT MAINTAINED any more.☆134Updated 11 years ago
- Python bindings for Neo4j☆59Updated 11 years ago
- vbench: A tool for benchmarking your code through time, for showing performance improvement or regressions☆246Updated 6 years ago
- John Langford's original release of Vowpal Wabbit -- a fast online learning algorithm☆57Updated last month
- TweeQL is a Query Language for Tweets: SELECT brand(text) AS brand, sentiment(text) AS sentiment FROM twitter_sample;☆193Updated 10 years ago
- A platform for storing large semantic networks on MongoDB☆23Updated 13 years ago
- MapReduce platform in python☆34Updated 9 years ago
- Ultra simple API for geocoding a single string against various web services.☆183Updated 10 years ago
- [unmaintained] Python version of arc90's *older* readability.js☆47Updated 12 years ago
- Latent Dirichlet Allocation for topic modeling of streamed data sources☆102Updated 9 years ago