techsuppdiva/spark-cheat-sheets

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/techsuppdiva/spark-cheat-sheets)

techsuppdiva / spark-cheat-sheets

This repo stores my Spark Tutorial slides.

☆15

Alternatives and similar repositories for spark-cheat-sheets

Users that are interested in spark-cheat-sheets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

KunihikoKido / fabric-aws-lambda
View on GitHub
☆10Jun 28, 2017Updated 9 years ago
Azure-Samples / hdinsight-spark-scala-kafka
View on GitHub
A basic example of how to read and write streaming data using Apache Spark and Kafka on HDInsight
☆13Mar 2, 2023Updated 3 years ago
sohrab- / microservice-simple-example
View on GitHub
Example microservice for Sixtree blog post
☆14Mar 30, 2016Updated 10 years ago
xetys / microxchng-workshop
View on GitHub
☆13Feb 16, 2017Updated 9 years ago
mitre / callisto
View on GitHub
☆16Feb 5, 2014Updated 12 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
steveloughran / zero-rename-committer
View on GitHub
Paper: A Zero-rename committer for object stores
☆20Nov 7, 2025Updated 8 months ago
Azure-Samples / data-factory-r-server-apache-spark-pipeline
View on GitHub
This tutorial highlights how to build a scalable machine-learning based data processing pipeline using Microsoft R Server with Apache Spa…
☆17Oct 6, 2016Updated 9 years ago
mrm1001 / spark_tutorial
View on GitHub
Code for the Spark tutorial at the Pydata conference in London June 2015
☆12Oct 9, 2016Updated 9 years ago
maropu / datasketches-spark
View on GitHub
Data Sketches for Apache Spark
☆22Dec 22, 2022Updated 3 years ago
julianser / hred-latent-piecewise
View on GitHub
☆19Aug 29, 2018Updated 7 years ago
salesforce / carbonj
View on GitHub
CarbonJ - A high-performance drop-in replacement to carbon-relay and carbon-cache
☆28Jul 1, 2026Updated 3 weeks ago
scalding-io / social-media-analytics
View on GitHub
Social Media Data Mining and Analytics - HyperLogLog, BloomFilter and CountMinSketch with Scalding & Algebird
☆27Oct 6, 2018Updated 7 years ago
dyatlov / Expedia-PHP-API
View on GitHub
PHP Wrapper for Expedia API
☆21Mar 6, 2014Updated 12 years ago
steven-matison / dfhz_hdp_mpack
View on GitHub
Install Ambari 2.7.5 with HDP 3.1.4 without using Hortonworks repositories.
☆49Oct 1, 2021Updated 4 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
yangcvo / Zabbix-Monitoring-Kafka
View on GitHub
Zabbix-Monitoring Kafka集群 Brokers服务,Kafka Consumer Monitoring
☆11Jun 7, 2017Updated 9 years ago
DelaramGlp / airo
View on GitHub
AI risk ontology
☆25Aug 1, 2025Updated 11 months ago
AdamPaternostro / Azure-Databricks-Log4J-To-AppInsights
View on GitHub
Connect your Spark Databricks clusters Log4J output to the Application Insights Appender
☆19Aug 4, 2020Updated 5 years ago
rjagerman / glintlda
View on GitHub
Scalable Distributed LDA implementation for Spark & Glint
☆29Sep 27, 2016Updated 9 years ago
jbellis / YCSB
View on GitHub
Yahoo! Cloud Serving Benchmark
☆20Jul 20, 2015Updated 11 years ago
hoch / motw-2015
View on GitHub
Boilerplate project for MOTW Workshop 2015
☆10Mar 3, 2016Updated 10 years ago
aws / amazon-neptune-sigv4-signer
View on GitHub
A library for Amazon Neptune that enables AWS Signature Version 4 signing for HTTP using Netty.
☆18May 28, 2026Updated last month
monksy / awesome-data-engineering
View on GitHub
A curated list of data engineering tools for software developers
☆13Jan 8, 2019Updated 7 years ago
ZackButcher / istio-workshop
View on GitHub
Istio Workshop
☆19Dec 15, 2017Updated 8 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
trib3 / leakycauldron
View on GitHub
☆10Updated this week
tambo-labs / tambo-demo-db-thing
View on GitHub
☆18Jul 10, 2026Updated last week
timshenkao / StringKernelSVM
View on GitHub
Implementation of string kernel approach for SVM
☆25Jun 17, 2013Updated 13 years ago
mazko / ESJava
View on GitHub
Java => ES6 Transpiler
☆16Apr 23, 2016Updated 10 years ago
awesome-spark / spark-gotchas
View on GitHub
Spark Gotchas. A subjective compilation of the Apache Spark tips and tricks
☆359Jun 6, 2017Updated 9 years ago
SainTechnologySolutions / allprogrammingtutorials
View on GitHub
☆16Nov 10, 2018Updated 7 years ago
mischavandenburg / az-104-azure-administrator
View on GitHub
Study notes and resources for the AZ-104 Azure Administrator exam and certification
☆22Jan 6, 2023Updated 3 years ago
qipeng / convolutionalRBM.m
View on GitHub
A MATLAB / MEX / CUDA-MEX implementation of Convolutional Restricted Boltzmann Machines.
☆25Dec 28, 2020Updated 5 years ago
mayur2810 / sope
View on GitHub
Apache Spark ETL Utilities
☆40Oct 23, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
kedify / otel-add-on
View on GitHub
Bridge between OTel and KEDA api
☆14Jul 10, 2026Updated last week
kbastani / oreilly-building-microservices-training
View on GitHub
Repository for building microservices training
☆18Mar 3, 2017Updated 9 years ago
michaldudak / pintograph
View on GitHub
Pintograph simulator in Javascript
☆12Jul 14, 2026Updated last week
bmuschko / todo
View on GitHub
A sample To Do web application built with Gradle.
☆33May 10, 2017Updated 9 years ago
quantiply / grafana-druid-wikipedia
View on GitHub
Example using Grafana with Druid
☆11Mar 27, 2015Updated 11 years ago
oovm / jupyter-protocol
View on GitHub
Jupyter Kernel Protocol for rust
☆14May 5, 2026Updated 2 months ago
maropu / spark-sql-flow-plugin
View on GitHub
Visualize column-level data lineage in Spark SQL
☆92May 13, 2022Updated 4 years ago