haifengl / bigdata
Introduction to Big Data
☆392Updated 9 months ago
Alternatives and similar repositories for bigdata:
Users that are interested in bigdata are comparing it to the libraries listed below
- BigData Ecosystem Dataset☆576Updated 3 years ago
- This page is a summary to keep the track of Hadoop related projects, and relevant projects around Big Data scene focused on the open sour…☆692Updated 4 years ago
- Spark Gotchas. A subjective compilation of the Apache Spark tips and tricks☆363Updated 7 years ago
- Data-Intensive Text Processing with MapReduce☆623Updated 4 years ago
- A curated list of amazingly awesome Hadoop and Hadoop ecosystem resources☆1,094Updated 10 months ago
- Information for setting up for the BerkeleyX Spark Intro MOOC, and lab assignments for the course☆349Updated 3 years ago
- Diagrams describing Apache Hadoop internals (2.3.0 or later).☆431Updated 5 years ago
- Apache Spark™ and Scala Workshops☆263Updated 7 months ago
- Practical examples of using Apache Spark in several different use cases☆102Updated 8 years ago
- Kite SDK☆393Updated 2 years ago
- Source code for Big Data: Principles and best practices of scalable realtime data systems☆332Updated 9 months ago
- A curated list of awesome HBase projects and resources.☆172Updated 2 years ago
- Gallery of Apache Zeppelin notebooks☆215Updated 5 years ago
- Coding exercises for Apache Spark☆104Updated 9 years ago
- Examples for High Performance Spark☆506Updated 4 months ago
- The Internals of Spark Structured Streaming☆418Updated 2 years ago
- Examples for learning spark☆332Updated 9 years ago
- MapReduce, Spark, Java, and Scala for Data Algorithms Book☆1,071Updated 5 months ago
- A curated list of awesome Apache Spark packages and resources.☆1,764Updated 4 months ago
- Next-generation web analytics processing with Scala, Spark, and Parquet.☆331Updated 9 years ago
- Source, data and turotials of the blog post video series of Hue, the Web UI for Hadoop.☆237Updated 8 years ago
- Literature Study☆165Updated 8 years ago
- Scala examples for learning to use Spark☆444Updated 4 years ago
- This tutorial provides a quick introduction to using Spark☆57Updated 8 years ago
- ☆54Updated 8 years ago
- Real Time Analytics and Data Pipelines based on Spark Streaming☆525Updated 5 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆470Updated 7 years ago
- A tool for monitoring and tuning Spark jobs for efficiency.☆357Updated 2 years ago
- The Internals of Apache Spark☆1,492Updated 5 months ago
- Examples of Spark 2.0☆211Updated 3 years ago