goldshtn / spark-workshopView external linksLinks
Labs and data files for a full-day Spark workshop
β24May 24, 2025Updated 8 months ago
Alternatives and similar repositories for spark-workshop
Users that are interested in spark-workshop are comparing it to the libraries listed below
Sorting:
- An innovative crop management system for farmers πΎ.β10Feb 22, 2018Updated 7 years ago
- β16Sep 4, 2024Updated last year
- A simple intrusion detection system that detects anomalous IP payloads, vertical and horizontal port scanning attacks in the selected netβ¦β10Apr 16, 2018Updated 7 years ago
- β10Apr 14, 2023Updated 2 years ago
- Road to Continous Upgradeβ15Aug 12, 2025Updated 6 months ago
- Example of using Protractor with Cucumber and Page Objectsβ10Apr 12, 2017Updated 8 years ago
- Sample time-series application using car telemetry as the use caseβ38Jun 27, 2025Updated 7 months ago
- A quick transform between rgb and ciecam02 color model.β11Sep 20, 2018Updated 7 years ago
- Tool useful to discover services behind unknown portsβ14May 20, 2021Updated 4 years ago
- Regional Energy Analyst, the first data-driven software for the analysis of the future energy consumption of buildings across sectors, ciβ¦β15Jan 28, 2020Updated 6 years ago
- A collection of data analysis projects done using PySpark via Jupyter notebooks.β10Oct 8, 2022Updated 3 years ago
- An open source project on estimating train delays in India.β11Oct 29, 2018Updated 7 years ago
- Live-Armor: Building Custom Linux Live Images for Security Sandboxingβ11Mar 25, 2015Updated 10 years ago
- R package to analyze shot group data: shape, precision, and accuracyβ10Aug 7, 2025Updated 6 months ago
- GA Grid (Beta) is a distributive in memory Genetic Algorithm (GA) component for Apache Ignite. A GA is a method of solving complex optimiβ¦β11Nov 14, 2017Updated 8 years ago
- Pintograph simulator in Javascriptβ12Oct 23, 2025Updated 3 months ago
- Python script to give you subsets of the nmap "top-ports". For example, I want the 10th to 100th most common TCP ports. Spits out a commaβ¦β18Mar 8, 2020Updated 5 years ago
- Portfolio repository for work done in Springboard's Data Science Career Trackβ11Apr 1, 2019Updated 6 years ago
- phData Pulse application log aggregation and monitoringβ13Apr 13, 2020Updated 5 years ago
- Xtremio Cinder Driverβ10Jun 10, 2018Updated 7 years ago
- Files for the Defcon Toronto Introduction to 64-bit Linux Exploitationβ15Feb 23, 2018Updated 7 years ago
- Dynamic Extensions for Network Objectsβ11Feb 3, 2026Updated last week
- β12Jan 12, 2023Updated 3 years ago
- Gitstats application for OpenCPUβ12May 13, 2024Updated last year
- Mirror of Apache Kafka without ZooKeeper dependencyβ11Feb 4, 2019Updated 7 years ago
- Socks5 proxy server by golangβ11Oct 10, 2019Updated 6 years ago
- A custom AWS credential provider that allows your Hadoop or Spark application access S3 file system by assuming a roleβ10Jan 9, 2026Updated last month
- β11Mar 28, 2023Updated 2 years ago
- IoT Trucking App with Flink (with Table API & SQL)β14Jul 4, 2018Updated 7 years ago
- β10Jun 22, 2025Updated 7 months ago
- R package to get weather data using OpenWeatherMap APIβ13May 12, 2016Updated 9 years ago
- A Redis Publish/Subscribe NATS Connectorβ13Apr 14, 2023Updated 2 years ago
- Python library for writing Compute Modulesβ13Jan 29, 2026Updated 2 weeks ago
- Transfer entropy (conditional mutual information) estimators for the Julia languageβ14Nov 6, 2022Updated 3 years ago
- Pastenum is a text dump enumeration tool.β14Dec 9, 2013Updated 12 years ago
- A PoC that uses the DirSync protocol to poll Active Directory for changesβ13Aug 16, 2020Updated 5 years ago
- Returns a list of all Public IP addresses being used by your AWS account. You can configure which regions you want to query.β14Jun 7, 2020Updated 5 years ago
- Turn KML Files into tidy data frames:β12Jan 1, 2017Updated 9 years ago
- Hugecast - The Off-Heap Storage for Hazelcastβ22Dec 1, 2013Updated 12 years ago