PeachstoneIO / peachboxLinks
Python based data warehouse solution for the Lambda Architecture.
☆14Updated 10 years ago
Alternatives and similar repositories for peachbox
Users that are interested in peachbox are comparing it to the libraries listed below
Sorting:
- A platform for real-time streaming search☆102Updated 9 years ago
- Data science repo to help others☆12Updated 9 years ago
- A Topic Modeling toolbox☆92Updated 9 years ago
- Collects multimedia content shared through social networks.☆19Updated 10 years ago
- Apache Nutch fork tunned for web services and data discovery.☆10Updated 10 years ago
- Text classification using Naive Bayes and Elasticsearch☆154Updated 9 years ago
- Code reference from my Qbox blog posts.☆87Updated 10 years ago
- Utilities and examples to asssist in working with PySpark and Cassandra.☆36Updated 10 years ago
- Topic Modeling the Sarah Palin emails.☆34Updated 14 years ago
- Spark GCE Script Helps you deploy Spark cluster on Google Cloud.☆43Updated 10 years ago
- Fast, easy and intuitive machine learning prototyping.☆124Updated 11 years ago
- An implementation of the multi-armed bandit optimization pattern as a Flask extension☆81Updated 2 weeks ago
- Tools for writing, submitting, debugging, and monitoring Storm topologies in pure Python☆246Updated 2 years ago
- A Python wrapper for MADlib(http://madlib.net) - an open source library for scalable in-database machine learning algorithms☆63Updated 4 years ago
- code and slides for my PyGotham 2016 talk, "Higher-level Natural Language Processing with textacy"☆15Updated 9 years ago
- Recommendations Serving Engine using python☆28Updated 10 years ago
- PredictionIO word2vec engine template (Scala-based parallelized engine)☆12Updated 10 years ago
- Python binding for gumbo-parser using Cython☆14Updated 9 years ago
- Load a linkedin network w/ python py2neo into a neo4j database, serve it via node.js, and display it w/ sigma.js☆29Updated 12 years ago
- Natural Language Processing with Spark's MLlib☆62Updated 8 years ago
- ☆146Updated 9 years ago
- Seldon Spark Jobs☆26Updated 10 years ago
- Hadoop jobs for WikiReverse project. Parses Common Crawl data for links to Wikipedia articles.☆38Updated 7 years ago
- A single docker image that combines Neo4j Mazerunner and Apache Spark GraphX into a powerful all-in-one graph processing engine☆45Updated 6 years ago
- Latent Dirichlet Allocation for topic modeling of streamed data sources☆100Updated 10 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 9 years ago
- PredictionIO Classification Engine Template (Scala-based parallelized engine)☆39Updated 6 years ago
- Task Orchestration Tool Based on SWF and boto3☆38Updated 7 years ago
- Topic modeling web application☆40Updated 10 years ago
- A demo of how to use PageRank with Hadoop and SociaLite to identify anomalies in Healthcare Data☆47Updated 9 years ago