Labs and data files for a full-day Spark workshop
☆25May 24, 2025Updated last year
Alternatives and similar repositories for spark-workshop
Users that are interested in spark-workshop are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Hands-On DevOps with Ansible [Video], Published By Packt☆14Jan 15, 2021Updated 5 years ago
- A generator for synthetic streams of financial transactions.☆16Feb 3, 2014Updated 12 years ago
- A repository of the web page for VLDB2020 @ Tokyo☆15Mar 12, 2022Updated 4 years ago
- Apache Spark programming exercises with Python☆13Apr 18, 2021Updated 5 years ago
- Docker files for the example code in Big Data for Chimps☆20May 19, 2015Updated 11 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- An innovative crop management system for farmers 🌾.☆10Feb 22, 2018Updated 8 years ago
- EcoEpi shinyApps☆18Aug 26, 2019Updated 6 years ago
- This repository contains the code and hyper-parameters for the paper: "Predicting taxi-passenger demand using streaming data, L. Moreira…☆13Jul 10, 2017Updated 8 years ago
- An open source project on estimating train delays in India.☆11Oct 29, 2018Updated 7 years ago
- An explainable Deep Machine Vision framework for Plant Stress Phenotyping☆15Jun 29, 2021Updated 4 years ago
- Regional Energy Analyst, the first data-driven software for the analysis of the future energy consumption of buildings across sectors, ci…☆15Jan 28, 2020Updated 6 years ago
- A simple example for PySpark based project.☆11Jun 3, 2016Updated 10 years ago
- A repo for tracking work regarding the Brigade Organizer's Playbook☆10Jun 18, 2024Updated last year
- Road to Azure Data Engineer Part-II: DP-201 - Designing an Azure Data Solution☆19Aug 16, 2020Updated 5 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Ebook for Data Scientist, Machine Learning, Deep Learning☆11Mar 16, 2021Updated 5 years ago
- Ensemble Learning Techniques Tutorial with Credit Card Fraud☆10Oct 22, 2017Updated 8 years ago
- Copy-on-write fork()-like memory dump using Process Snapshotting APIs☆13Jul 23, 2017Updated 8 years ago
- Analyzing plant disease epidemics☆16Jun 18, 2024Updated last year
- OBSOLETE: Prototype Neo4j Knowledge Graph for Coronavirus outbreaks (see NEW VERSION: https://github.com/covid-19-net/covid-19-community)☆18Nov 25, 2020Updated 5 years ago
- Python package for calculation mahalanobis distances from NumPy arrays☆14Jun 22, 2022Updated 3 years ago
- Source Code for 'Machine Learning Using R, Second Edition' by Karthik Ramasubramanian and Abhishek Singh☆17Feb 7, 2019Updated 7 years ago
- Docker Nginx Image w/LDAP Authentication, Zabbix agent monitoring, S6 init, logrotate based on Alpine☆13Dec 16, 2022Updated 3 years ago
- ☆10Oct 16, 2015Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Revolutionizing Design—ChatGPT's Role in Next-Generation Software Architecture☆12Jan 24, 2024Updated 2 years ago
- A Web-Based Visualization Tool for Biclustering of Multivariate Time Series☆10Feb 17, 2023Updated 3 years ago
- ☆10Jan 4, 2019Updated 7 years ago
- Presentations from the 2023 Fellowship.☆13Jan 31, 2024Updated 2 years ago
- Screening Meter Data Dissertation by Clayton Miller☆11Jun 7, 2017Updated 9 years ago
- Predicting demand for public transportation in Nairobi☆14Oct 5, 2018Updated 7 years ago
- This R package provides the tools to perform standard and robust wavelet variance analysis for time series (signal processing). Among ot…☆17Oct 8, 2025Updated 8 months ago
- A repo that summarizes how facial recognition works. Check out the Jupiter notebook☆15Aug 16, 2022Updated 3 years ago
- Mono.Cecil object model explorer☆13Sep 1, 2016Updated 9 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Generate waveform JSON information from audio files, compatible with http://waveformjs.org/☆17Apr 28, 2021Updated 5 years ago
- Jakarta Connectors☆14Jun 2, 2026Updated last week
- Maven overlay for local CAS development and testing☆18Aug 30, 2016Updated 9 years ago
- This repository contains the code for a workshop on Generative AI using Gemini.☆11Jul 22, 2024Updated last year
- Train Time Delay Prediction using machine learning☆20Mar 27, 2024Updated 2 years ago
- Bare minimum End-to-End ML application with Flask REST API Prediction Service☆11Jul 11, 2020Updated 5 years ago
- Effective Approaches for Time Series Anomaly Detection☆11Jun 6, 2020Updated 6 years ago