davedash / textclusterLinks
Uses TF-IDF and inverted search to cluster search results
☆22Updated 14 years ago
Alternatives and similar repositories for textcluster
Users that are interested in textcluster are comparing it to the libraries listed below
Sorting:
- A Python version (almost a port) of ProPublica's TableFu☆230Updated 12 years ago
- ... just because nltk is too heavy☆35Updated 15 years ago
- A Django based search engine powered by CouchDB, celery and whoosh.☆49Updated 9 years ago
- collection of modules to build distributed and reliable concurrent systems in Python.☆206Updated 12 years ago
- templatemaker is a Python library that can extract data from files with a similar format, like HTML pages.☆63Updated 5 years ago
- Non-plurk fork of Solace. Might eventually be merged back. Solace is a Stackoverflow inspired platform.☆63Updated 14 years ago
- RGP -- Redis Graph via Python☆30Updated 10 years ago
- csvcat☆22Updated 9 years ago
- a python port of https://github.com/twitter/twitter-text-rb also available via `pip install twitter_text`☆82Updated 7 years ago
- Slinky, a high-performance web crawler / text analytics in Python, Redis, Hadoop, R, Gephi☆41Updated 15 years ago
- Python library with common functionality for writing web scrapers☆102Updated 10 years ago
- A DSL to build Lucene text queries in Python.☆38Updated 8 years ago
- Data analysis tool.☆85Updated 2 years ago
- A friendlier interface to `socket`.☆14Updated 10 years ago
- The main goal this Python module is to provide functions to apply Text Classification.☆10Updated 9 years ago
- A high-performance distributed web crawling & scraping framework written with golang and python.☆30Updated 9 years ago
- A python library for foursquare API☆47Updated 13 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- A Python implementation of the Double Metaphone algorithm☆61Updated 14 years ago
- Preprocess text for NLP (tokenizing, lowercasing, stemming, sentence splitting, etc.)☆29Updated 14 years ago
- MapReduce platform in python☆34Updated 10 years ago
- Interactive Programming Notebook for the Web Browser☆98Updated 5 years ago
- Small set of utilities to simplify writing Scrapy spiders.☆49Updated 10 years ago
- Convert URL's to a normalized unicode format☆67Updated 7 years ago
- INTERVAL field for PostgreSQL (and an approximation for other backends)☆21Updated 2 years ago
- A utility for easily creating and releasing Python packages☆50Updated 5 years ago
- Definitions of Pardon jargon to help Python beginners understand Pythonista gobbletigook☆55Updated 5 years ago
- Plots various graphs for a series of plaintext files in a directory☆19Updated 9 years ago
- 🌆 TouristFriend API lets you query Google Places, Yelp and Foursquare at the same time, with Bayesian rankings!☆29Updated 6 years ago
- Manage uploaded documents (pdfs) with backend cloud processing of the pdfs into individual pngs per page☆103Updated 11 years ago