zoharsan / RetailAnalytics
Hortonworks Data Platform Retail Analytics Demo
☆13Updated 8 years ago
Alternatives and similar repositories for RetailAnalytics:
Users that are interested in RetailAnalytics are comparing it to the libraries listed below
- Mastering Spark for Data Science, published by Packt☆47Updated 2 years ago
- Source code for 'PySpark Recipes' by Raju Kumar Mishra☆25Updated 5 years ago
- AWS Big Data Certification☆25Updated 2 months ago
- 3NF-normalize Yelp data on S3 with Spark and load it into Redshift - automate the whole thing with Apache Airflow☆12Updated 5 years ago
- Terraform script for launching multiple EMR clusters for training purposes.☆16Updated last year
- Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered…☆16Updated 5 years ago
- Labs and data files for a full-day Spark workshop☆24Updated last year
- Master complex big data processing, stream analytics, and machine learning with Apache Spark☆18Updated 2 years ago
- Source code for 'Pro Hadoop Data Analytics' by Kerry Koitzsch☆14Updated last year
- Supplementary material for Building a Modern Data Platform with Snowflake, from Pearson.☆21Updated 3 years ago
- A curated list of awesome Databricks resources, including Spark☆17Updated 9 months ago
- Apache Hadoop 3 Quick Start Guide, published by Packt☆14Updated last year
- Big Data Demystified meetup and blog examples☆31Updated 7 months ago
- Examples for High Performance Spark☆15Updated 4 months ago
- ☆16Updated last year
- The open source version of the Amazon Redshift Cluster Management Guide.☆48Updated last year
- Projects from my Hadoop training sessions☆17Updated 7 years ago
- ☆15Updated 8 years ago
- Kirk's Zeppelin Notebooks☆12Updated 6 years ago
- Spark NLP for Streamlit☆15Updated 3 years ago
- ☆15Updated last week
- Course content for Practical AI on the Google Cloud Platform☆10Updated 4 years ago
- Cassandra + Spark = ❤️ Machine Learning with Apache Spark & Cassandra☆20Updated 3 years ago
- Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on sing…☆45Updated 5 months ago
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Updated last year
- An example PySpark project with pytest☆17Updated 7 years ago
- Spark in Kaggle competitions☆9Updated 9 years ago
- Big Data Architect’s Handbook, published by Packt☆20Updated 2 years ago
- ☆26Updated last year
- Apache Spark in 7 Days [Video], by Packt Publishing☆18Updated 2 years ago