Workshop for Spark and Databricks
☆55Dec 6, 2019Updated 6 years ago
Alternatives and similar repositories for spark-saturday
Users that are interested in spark-saturday are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A set of example build and release pipelines for deploying Python and Scala to Azure Databricks and HDInsight☆14Jun 4, 2020Updated 5 years ago
- ☆16Jun 27, 2020Updated 5 years ago
- low-level helpers for Apache Spark libraries and tests☆16Dec 29, 2018Updated 7 years ago
- notebooks for nlp-on-spark☆13Jan 27, 2017Updated 9 years ago
- ☆31Sep 15, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Spark stream from kafka(json) to s3(parquet)☆15Nov 8, 2018Updated 7 years ago
- spark自学手册,包含了例如spark core、spark sql、spark streaming、spark-kafka、delta-lake,以及scala基础练习,还有一些例如master、shuffle源码分析,总结及翻译。☆18Jul 19, 2023Updated 2 years ago
- Intermediate Machine Learning with Scikit-learn, 4h interactive workshop☆131Apr 20, 2021Updated 5 years ago
- Template to deploy Synapse Analytics using best practices to deliver a proof of concept.☆21Mar 3, 2023Updated 3 years ago
- High performance HBase / Spark SQL engine☆28Jul 7, 2022Updated 3 years ago
- These are some code examples☆56Jan 12, 2020Updated 6 years ago
- Using Apache Airflow to author, run and monitor complex data pipelines.☆12Oct 24, 2018Updated 7 years ago
- ☆14Apr 6, 2023Updated 3 years ago
- Simulation of regular login activity on a site and random activity from a hacker using a brute-force password guessing attack.☆15Mar 17, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- MLflow samples - deprecated☆22May 9, 2023Updated 3 years ago
- Automatic summarization is the process of shortening a text document with software, in order to create a summary with the major points of…☆16Sep 15, 2018Updated 7 years ago
- Mastering Spark for Data Science, published by Packt☆50Apr 22, 2026Updated 2 weeks ago
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Mar 23, 2016Updated 10 years ago
- Simulation of job offers and CVs with real-time processing, classification, and analytics using Kafka, Ray, Spark, and Databricks. Includ…☆14Dec 25, 2024Updated last year
- ☆12Aug 5, 2024Updated last year
- Ansible scripts for deploying Kafka on EC2☆10Oct 7, 2016Updated 9 years ago
- A workshop on how to build a fullstack web application using AWS Amplify for university students.☆11Sep 6, 2020Updated 5 years ago
- Advanced Machine Learning with Scikit-learn part I☆144Apr 17, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Content for Meetups for DATA & AI - Microsoft DFW☆41May 6, 2019Updated 7 years ago
- The data product processor is a library for dynamically creating and executing Apache Spark Jobs based on a declarative description of a …☆13Apr 25, 2024Updated 2 years ago
- CentOS based Docker container for Time Series Analysis and Modeling.☆22Sep 19, 2019Updated 6 years ago
- Spark on Docker Swarm example code☆11Nov 27, 2016Updated 9 years ago
- MOVED TO https://github.com/jezdez/django-avatar☆15Oct 8, 2012Updated 13 years ago
- My HackerRank Solutions : https://www.hackerrank.com/RohanKhude☆12Jul 13, 2016Updated 9 years ago
- ☆20Apr 27, 2012Updated 14 years ago
- Customized Spark processor on NiFi☆15Dec 4, 2015Updated 10 years ago
- Playing with Financial Time Series☆27May 6, 2019Updated 7 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Applying GANs in improving question generation and answering☆12Oct 1, 2017Updated 8 years ago
- An IPython Notebook reader for Pelican☆10May 13, 2015Updated 10 years ago
- The Generative AI Assets Catalog is an organized repository designed for individuals seeking to explore the newest content released by AW…☆16Aug 22, 2025Updated 8 months ago
- ☆20Updated this week
- ☆16Jun 14, 2023Updated 2 years ago
- Tools and services for evaluating topic models☆15Apr 12, 2016Updated 10 years ago
- ☆12Aug 14, 2024Updated last year