Labs and data files for a full-day Spark workshop
☆25May 24, 2025Updated 9 months ago
Alternatives and similar repositories for spark-workshop
Users that are interested in spark-workshop are comparing it to the libraries listed below
Sorting:
- ☆11Jun 23, 2022Updated 3 years ago
- Key-value Kafka Database☆12Feb 18, 2026Updated 2 weeks ago
- A repository of the web page for VLDB2020 @ Tokyo☆15Mar 12, 2022Updated 3 years ago
- Small, simple C# animation library built using the reactive extensions framework, utilizing Robert Penner's easing functions. Does not ma…☆14Aug 3, 2015Updated 10 years ago
- Will store links to known evaluation datasets alongside stats to characterize them☆24Mar 9, 2016Updated 9 years ago
- 500 Lines or Less☆27Aug 15, 2014Updated 11 years ago
- A Web-Based Visualization Tool for Biclustering of Multivariate Time Series☆10Feb 17, 2023Updated 3 years ago
- This RESTFul API consumes data from a Wellsite Information Transfer Standard Markup Language (WITSML) server and provides responses in fo…☆11Jul 31, 2022Updated 3 years ago
- ☆30Sep 27, 2017Updated 8 years ago
- A smart rebalancer written in bash☆18Feb 12, 2024Updated 2 years ago
- Example of using Protractor with Cucumber and Page Objects☆10Apr 12, 2017Updated 8 years ago
- ☆10Apr 14, 2023Updated 2 years ago
- ☆13Dec 28, 2018Updated 7 years ago
- Road to Continous Upgrade☆15Aug 12, 2025Updated 6 months ago
- A simple intrusion detection system that detects anomalous IP payloads, vertical and horizontal port scanning attacks in the selected net…☆10Apr 16, 2018Updated 7 years ago
- Information relating to topics on Data Engineering, Data Infrastructure, Data Storing, Data Warehouses and Business Analysis. For those i…☆10Aug 8, 2021Updated 4 years ago
- This repo hosts the data models for the Security components of OCF☆11Oct 11, 2022Updated 3 years ago
- Sample time-series application using car telemetry as the use case☆38Jun 27, 2025Updated 8 months ago
- An extended bootstrap dropdown widget for Yii 2 with submenu drilldown.☆15Jan 11, 2022Updated 4 years ago
- Tool useful to discover services behind unknown ports☆14May 20, 2021Updated 4 years ago
- Python script to give you subsets of the nmap "top-ports". For example, I want the 10th to 100th most common TCP ports. Spits out a comma…☆18Mar 8, 2020Updated 5 years ago
- deep learning course materials☆15Jun 24, 2020Updated 5 years ago
- GA Grid (Beta) is a distributive in memory Genetic Algorithm (GA) component for Apache Ignite. A GA is a method of solving complex optimi…☆11Nov 14, 2017Updated 8 years ago
- A simple example for PySpark based project.☆11Jun 3, 2016Updated 9 years ago
- Screening Meter Data Dissertation by Clayton Miller☆11Jun 7, 2017Updated 8 years ago
- R package to analyze shot group data: shape, precision, and accuracy☆10Aug 7, 2025Updated 6 months ago
- Dynamic Extensions for Network Objects☆11Feb 3, 2026Updated last month
- Pintograph simulator in Javascript☆12Oct 23, 2025Updated 4 months ago
- phData Pulse application log aggregation and monitoring☆13Apr 13, 2020Updated 5 years ago
- Bare minimum End-to-End ML application with Flask REST API Prediction Service☆11Jul 11, 2020Updated 5 years ago
- Regional Energy Analyst, the first data-driven software for the analysis of the future energy consumption of buildings across sectors, ci…☆15Jan 28, 2020Updated 6 years ago
- Xtremio Cinder Driver☆10Jun 10, 2018Updated 7 years ago
- Files for the Defcon Toronto Introduction to 64-bit Linux Exploitation☆15Feb 23, 2018Updated 8 years ago
- EMC ScaleIO Powershell Toolkit☆10Apr 13, 2016Updated 9 years ago
- Portfolio repository for work done in Springboard's Data Science Career Track☆11Apr 1, 2019Updated 6 years ago
- A collection of data analysis projects done using PySpark via Jupyter notebooks.☆10Oct 8, 2022Updated 3 years ago
- Extra functions for 'sf' Simple Features in R☆13Jan 19, 2026Updated last month
- Shell intended for forwarding-only ssh connection via jumphost☆11Jun 27, 2018Updated 7 years ago
- Very simple noSQL database, created as an example during a talk. See FineDB for a real high-performance noSQL database.☆14Sep 29, 2013Updated 12 years ago