pauldix / working-with-big-dataLinks
Slides, code, and supplemental materials for the LiveLesson: Working with Big Data: Infrastructure, Algorithms, and Visualizations
☆55Updated 13 years ago
Alternatives and similar repositories for working-with-big-data
Users that are interested in working-with-big-data are comparing it to the libraries listed below
Sorting:
- A Seriously Fun guide to Big Data Analytics in Practice☆169Updated 10 years ago
- examples from "Thoughtful Machine Learning"☆128Updated 2 years ago
- ☆23Updated 10 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 9 years ago
- Real time and offline time series analysis with Spark, Spark Streaming and Storm☆21Updated 5 years ago
- Hubot + deep learning = image classification☆10Updated 10 years ago
- A demo of how to use PageRank with Hadoop and SociaLite to identify anomalies in Healthcare Data☆47Updated 10 years ago
- Example App for Elasticsearch Series☆55Updated 9 years ago
- A big data cluster management tool that creates and manages clusters of different technologies.☆21Updated 10 years ago
- ☆78Updated 3 years ago
- Vagrant, Apache Spark and Apache Zeppelin VM for teaching☆44Updated 8 years ago
- DataDuck ETL - the extract-transform-load framework for data warehousing☆60Updated 8 years ago
- Easy publishing with graph data included☆208Updated 9 years ago
- Human-Powered Data Analysis with Mechanical Turk☆300Updated 13 years ago
- A Machine Learning API with native redis caching and export + import using S3. Analyze entire datasets using an API for building, trainin…☆100Updated 3 years ago
- SQLAlchemy models and DDL and ERD generation from chop-dbhi/data-models style JSON endpoints.☆11Updated 2 years ago
- An efficient native implementation of the HyperLogLog cardinality estimator for Ruby☆36Updated 13 years ago
- Code and Presentation slides for Teaching the Elephant to Read☆17Updated 9 years ago
- CustomerML is an open source customer science platform leveraging the power of Predictiveworks and fully integrated with Elasticsearch an…☆48Updated 10 years ago
- A curated list of awesome Apache Spark packages and resources.☆40Updated 8 years ago
- Public code files for the DDL blog☆56Updated 7 years ago
- Code reference from my Qbox blog posts.☆87Updated 10 years ago
- Code required for the examples in Algorithms of the Intelligent Web, 2nd Edition☆27Updated 4 years ago
- ☆75Updated 10 years ago
- A ruby/c extension to Christian Borgelt's apriori item-set implementation☆55Updated 15 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 9 years ago
- A distributed bloom filter implementation based on redis☆40Updated 7 years ago
- big data books and papers☆30Updated 12 years ago
- End-to-end data science example running on Cloud Foundry☆19Updated 9 years ago
- Apache Zeppelin on Kubernetes.☆28Updated 6 years ago