georgetown-analytics / product-classifier
Classify products into categories by their name with NLTK
☆28Updated 10 years ago
Alternatives and similar repositories for product-classifier
Users that are interested in product-classifier are comparing it to the libraries listed below
Sorting:
- A Topic Modeling toolbox☆92Updated 9 years ago
- Simple multi-language Python and NLTK-based implementation of text summarization☆59Updated 6 years ago
- December 14th Python Meetup Files☆37Updated 12 years ago
- Resize image on the fly using flask, zappa, pillow, opencv-python☆18Updated 7 years ago
- Frontera backend to guide a crawl using PageRank, HITS or other ranking algorithms based on the link structure of the web graph, even whe…☆55Updated 11 months ago
- Restrict crawl and scraping scope using matchers.☆25Updated 8 years ago
- Sample repo for luigi tasks & config☆36Updated 8 years ago
- ☆22Updated 8 years ago
- Easy extraction of keywords and engines from search engine results pages (SERPs).☆90Updated 3 years ago
- E-commerce scraping and analytics platform.☆52Updated 9 years ago
- Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations☆40Updated 11 months ago
- Analyze the structure and dynamics of an open source project's developer community, using graph algorithms, etc.☆58Updated 4 years ago
- Word2Vec models with Twitter data using Spark. Blog:☆65Updated 6 years ago
- Public Machine Learning and Data Competition Repo☆54Updated 9 years ago
- A Django based search engine powered by CouchDB, celery and whoosh.☆49Updated 9 years ago
- Modularly extensible semantic metadata validator☆84Updated 9 years ago
- Twitter visualizaton experiment using various python-based technologies.☆60Updated 8 years ago
- High Level Kafka Scanner☆19Updated 7 years ago
- Search 'from' and 'to' strings to learn a text cleaning mapping☆17Updated 9 years ago
- a django app to persist and retrieve scikit learn machine learning models☆48Updated 2 years ago
- Small set of utilities to simplify writing Scrapy spiders.☆49Updated 9 years ago
- 🌆 TouristFriend API lets you query Google Places, Yelp and Foursquare at the same time, with Bayesian rankings!☆29Updated 6 years ago
- Reduction is a python script which automatically summarizes a text by extracting the sentences which are deemed to be most important.☆55Updated 10 years ago
- Automatic Item List Extraction☆87Updated 8 years ago
- Paginating the web☆37Updated 11 years ago
- FacetView is a pure javascript frontend for ElasticSearch.☆290Updated 10 years ago
- ☆41Updated 4 years ago
- Scraper for categories and lists on ecommerce and other listing websites☆42Updated 4 years ago
- A slim, non-SWIG Python adapter to CTesseract (Tesseract OCR for C).☆24Updated 11 years ago
- Dynamic data analysis over the web. The logic to your data dashboards.☆156Updated 10 years ago