georgetown-analytics / product-classifierLinks
Classify products into categories by their name with NLTK
☆28Updated 10 years ago
Alternatives and similar repositories for product-classifier
Users that are interested in product-classifier are comparing it to the libraries listed below
Sorting:
- Sample repo for luigi tasks & config☆36Updated 9 years ago
- A Topic Modeling toolbox☆92Updated 9 years ago
- Twitter visualizaton experiment using various python-based technologies.☆60Updated 8 years ago
- Automatic Item List Extraction☆87Updated 9 years ago
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Updated 3 years ago
- code and slides for my PyGotham 2016 talk, "Higher-level Natural Language Processing with textacy"☆15Updated 8 years ago
- a scaleable and efficient crawelr with docker cluster , crawl million pages in 2 hours with a single machine☆97Updated last year
- A Machine Learning API with native redis caching and export + import using S3. Analyze entire datasets using an API for building, trainin…☆100Updated 2 years ago
- A fast python scikit-learn text sentiment API server.☆89Updated 9 years ago
- Frontera backend to guide a crawl using PageRank, HITS or other ranking algorithms based on the link structure of the web graph, even whe…☆55Updated last year
- A python tool for collecting tweets in mongoDB using the search API☆80Updated 2 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- My capstone project for Galvanize (Zipfian Academy)☆38Updated 6 years ago
- E-commerce scraping and analytics platform.☆53Updated 9 years ago
- A script to get summary of text content☆31Updated 8 years ago
- Resize image on the fly using flask, zappa, pillow, opencv-python☆18Updated 7 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- Generating the next read for our book club- with Data Science!☆40Updated 9 years ago
- Easy extraction of keywords and engines from search engine results pages (SERPs).☆90Updated 3 years ago
- Analyze the structure and dynamics of an open source project's developer community, using graph algorithms, etc.☆58Updated 4 years ago
- Word2Vec models with Twitter data using Spark. Blog:☆65Updated 6 years ago
- [NO LONGER MAINTAINED AS OPEN SOURCE - USE SCALETEXT.COM INSTEAD]☆108Updated 12 years ago
- Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations☆40Updated last year
- Data analysis tool.☆85Updated 2 years ago
- PyTennessee 2014: Statistical Data Analysis in Python☆85Updated 10 years ago
- Slides to learn a little natural language processing (NLP) with Python. Written in reST with S5/Docutils.☆28Updated 12 years ago
- Multidimensional data explorer and visualization tool.☆56Updated 8 years ago
- Modularly extensible semantic metadata validator☆84Updated 9 years ago
- Deprecated. Formerly: scripts to make it easier to set up and manipulate clusters at Amazon EC2☆110Updated 12 years ago
- A couple projects using scikit-learn illustrating project decision making.☆15Updated 8 years ago