goldshtn / spark-workshopLinks
Labs and data files for a full-day Spark workshop
☆24Updated last month
Alternatives and similar repositories for spark-workshop
Users that are interested in spark-workshop are comparing it to the libraries listed below
Sorting:
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆67Updated 9 years ago
- Code examples and docker environment for Spark☆27Updated 9 years ago
- Source code for 'PySpark Recipes' by Raju Kumar Mishra☆25Updated 5 years ago
- Code and setup information for Introduction to Machine Learning with Spark☆12Updated 9 years ago
- A curated list of awesome Apache Spark packages and resources.☆40Updated 8 years ago
- Basic getting started with Kafka examples☆47Updated 6 years ago
- AWS Big Data Certification☆25Updated 5 months ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Geo-Located Data: Extracting Patterns from Mobile Data using Scikit-Learn and Cassandra☆29Updated 7 years ago
- Mastering Spark for Data Science, published by Packt☆47Updated 2 years ago
- Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered…☆16Updated 6 years ago
- Real-world Spark pipelines examples☆83Updated 7 years ago
- Spark with Scala example projects☆34Updated 6 years ago
- A collection of examples to help show different ways to managing state in Apache Flink☆27Updated 6 years ago
- Examples To Help You Learn Apache Spark☆77Updated 6 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 9 years ago
- Anomaly Detection model uses Spark for training and Spark Streaming for testing☆67Updated 9 years ago
- Sample Spark Code☆91Updated 6 years ago
- These are some code examples☆55Updated 5 years ago
- An example PySpark project with pytest☆16Updated 7 years ago
- Use Kafka and Apache Spark streaming to perform click stream analytics☆76Updated 5 years ago
- ☆37Updated 3 weeks ago
- ☆24Updated 9 years ago
- notebooks for nlp-on-spark☆13Updated 8 years ago
- personal cheatsheets on various technologies☆25Updated 8 years ago
- real time log event processing using spark, kafka & cassandra☆13Updated 10 years ago
- Vagrant, Apache Spark and Apache Zeppelin VM for teaching☆44Updated 7 years ago
- Code for Tutorial on designing clickstream analytics application using Hadoop☆54Updated 10 years ago
- Some class materials for a data processing course using PySpark☆52Updated 2 years ago
- ☆38Updated 7 years ago