davedash / textclusterLinks
Uses TF-IDF and inverted search to cluster search results
☆22Updated 14 years ago
Alternatives and similar repositories for textcluster
Users that are interested in textcluster are comparing it to the libraries listed below
Sorting:
- A Python version (almost a port) of ProPublica's TableFu☆231Updated 11 years ago
- collection of modules to build distributed and reliable concurrent systems in Python.☆205Updated 11 years ago
- ... just because nltk is too heavy☆35Updated 15 years ago
- templatemaker is a Python library that can extract data from files with a similar format, like HTML pages.☆63Updated 4 years ago
- csvcat☆22Updated 9 years ago
- Data analysis tool.☆85Updated 2 years ago
- a python port of https://github.com/twitter/twitter-text-rb also available via `pip install twitter_text`☆82Updated 7 years ago
- Ultra simple API for geocoding a single string against various web services.☆183Updated 11 years ago
- A Python implementation of the Double Metaphone algorithm☆61Updated 14 years ago
- A Django based search engine powered by CouchDB, celery and whoosh.☆49Updated 9 years ago
- 🌆 TouristFriend API lets you query Google Places, Yelp and Foursquare at the same time, with Bayesian rankings!☆29Updated 6 years ago
- RGP -- Redis Graph via Python☆30Updated 10 years ago
- A simple demonstration of a GeoDjango application.☆53Updated 14 years ago
- Tool to visualize data quickly with no brain usage for plot creation☆46Updated 6 years ago
- Preprocess text for NLP (tokenizing, lowercasing, stemming, sentence splitting, etc.)☆29Updated 14 years ago
- [discontinued] Python interfaces to the Meetup Web API☆110Updated 6 years ago
- Command line utilities☆32Updated 11 years ago
- Non-plurk fork of Solace. Might eventually be merged back. Solace is a Stackoverflow inspired platform.☆63Updated 14 years ago
- Slinky, a high-performance web crawler / text analytics in Python, Redis, Hadoop, R, Gephi☆41Updated 15 years ago
- workflow support for reproducible deduplication and merging☆16Updated 2 years ago
- MapReduce platform in python☆34Updated 10 years ago
- High Level Kafka Scanner☆19Updated 7 years ago
- A weather monitoring Dashboard built upon Python and Yahoo API☆14Updated 10 years ago
- Convert URL's to a normalized unicode format☆67Updated 7 years ago
- Python interface to Solr☆277Updated last year
- TweeQL is a Query Language for Tweets: SELECT brand(text) AS brand, sentiment(text) AS sentiment FROM twitter_sample;☆194Updated 11 years ago
- Asynchronous webchat using Bottle and gevent☆61Updated 14 years ago
- Interactive Programming Notebook for the Web Browser☆98Updated 4 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- simple python datastructure wrappings for redis☆105Updated 4 years ago