pauldix / working-with-big-dataLinks
Slides, code, and supplemental materials for the LiveLesson: Working with Big Data: Infrastructure, Algorithms, and Visualizations
☆55Updated 13 years ago
Alternatives and similar repositories for working-with-big-data
Users that are interested in working-with-big-data are comparing it to the libraries listed below
Sorting:
- examples from "Thoughtful Machine Learning"☆128Updated 2 years ago
- A Seriously Fun guide to Big Data Analytics in Practice☆169Updated 10 years ago
- ☆23Updated 10 years ago
- Human-Powered Data Analysis with Mechanical Turk☆300Updated 13 years ago
- DataDuck ETL - the extract-transform-load framework for data warehousing☆60Updated 8 years ago
- An efficient native implementation of the HyperLogLog cardinality estimator for Ruby☆36Updated 13 years ago
- Benchmark of Rails and PostgreSQL JSON generation techniques☆33Updated 3 years ago
- A big data cluster management tool that creates and manages clusters of different technologies.☆21Updated 10 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 9 years ago
- SQLAlchemy models and DDL and ERD generation from chop-dbhi/data-models style JSON endpoints.☆11Updated 2 years ago
- A single docker image that combines Neo4j Mazerunner and Apache Spark GraphX into a powerful all-in-one graph processing engine☆45Updated 6 years ago
- A Machine Learning API with native redis caching and export + import using S3. Analyze entire datasets using an API for building, trainin…☆100Updated 3 years ago
- Implementation of the Confluent Schema Registry API as a Rails application☆91Updated last week
- Example App for Elasticsearch Series☆55Updated 9 years ago
- CustomerML is an open source customer science platform leveraging the power of Predictiveworks and fully integrated with Elasticsearch an…☆48Updated 10 years ago
- Real time and offline time series analysis with Spark, Spark Streaming and Storm☆21Updated 5 years ago
- A simple library that makes it easy to create drip marketing campaigns (or anything that should be performed on an offset schedule)☆30Updated 12 years ago
- ☆19Updated 8 years ago
- A platform for real-time streaming search☆102Updated 9 years ago
- Hubot + deep learning = image classification☆10Updated 10 years ago
- VoltDB Click Stream Processing Example.☆16Updated 8 years ago
- A distributed bloom filter implementation based on redis☆40Updated 7 years ago
- Graph Processing Algorithms on top of Neo4j☆39Updated 8 years ago
- A Ruby toolkit for cloud-friendly ETL☆38Updated 9 years ago
- Reference App for Ad Analytics, using Ruby on Rails.☆79Updated 2 years ago
- A collection of examples illustrating data processing, data science, and machine learning on the Pivotal Greenplum and HAWQ MPP databases☆20Updated 9 years ago
- Framework for micro services☆79Updated 11 years ago
- Apache Zeppelin on Kubernetes.☆28Updated 6 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 9 years ago
- Rails engine to manage your team's own Technology Radar☆16Updated 4 years ago