This repo stores my Spark Tutorial slides.
☆15Feb 8, 2016Updated 10 years ago
Alternatives and similar repositories for spark-cheat-sheets
Users that are interested in spark-cheat-sheets are comparing it to the libraries listed below
Sorting:
- ☆10Jun 28, 2017Updated 8 years ago
- A basic example of how to read and write streaming data using Apache Spark and Kafka on HDInsight☆13Mar 2, 2023Updated 3 years ago
- ☆13Feb 16, 2017Updated 9 years ago
- Example microservice for Sixtree blog post☆14Mar 30, 2016Updated 9 years ago
- Paper: A Zero-rename committer for object stores☆20Nov 7, 2025Updated 4 months ago
- AI risk ontology☆23Aug 1, 2025Updated 7 months ago
- ☆19Aug 29, 2018Updated 7 years ago
- This tutorial highlights how to build a scalable machine-learning based data processing pipeline using Microsoft R Server with Apache Spa…☆16Oct 6, 2016Updated 9 years ago
- Data Sketches for Apache Spark☆22Dec 22, 2022Updated 3 years ago
- Social Media Data Mining and Analytics - HyperLogLog, BloomFilter and CountMinSketch with Scalding & Algebird☆27Oct 6, 2018Updated 7 years ago
- Study notes and resources for the AZ-104 Azure Administrator exam and certification☆17Jan 6, 2023Updated 3 years ago
- Yahoo! Cloud Serving Benchmark☆20Jul 20, 2015Updated 10 years ago
- CarbonJ - A high-performance drop-in replacement to carbon-relay and carbon-cache☆27Mar 4, 2026Updated 2 weeks ago
- Code for the Spark tutorial at the Pydata conference in London June 2015☆12Oct 9, 2016Updated 9 years ago
- PHP Wrapper for Expedia API☆21Mar 6, 2014Updated 12 years ago
- Zabbix-Monitoring Kafka集群 Brokers服务,Kafka Consumer Monitoring☆11Jun 7, 2017Updated 8 years ago
- A sample To Do web application built with Gradle.☆33May 10, 2017Updated 8 years ago
- Scalable Distributed LDA implementation for Spark & Glint☆29Sep 27, 2016Updated 9 years ago
- Contains sample code for a lightning talk on HBase.☆39Oct 13, 2020Updated 5 years ago
- Gibbs sampler 4 the Hierarchical Dirichlet Process☆37Aug 16, 2015Updated 10 years ago
- Connect your Spark Databricks clusters Log4J output to the Application Insights Appender☆19Aug 4, 2020Updated 5 years ago
- Install Ambari 2.7.5 with HDP 3.1.4 without using Hortonworks repositories.☆49Oct 1, 2021Updated 4 years ago
- A curated list of data engineering tools for software developers☆13Jan 8, 2019Updated 7 years ago
- Boilerplate project for MOTW Workshop 2015☆10Mar 3, 2016Updated 10 years ago
- A library for Amazon Neptune that enables AWS Signature Version 4 signing for HTTP using Netty.☆17Oct 21, 2025Updated 5 months ago
- Java => ES6 Transpiler☆16Apr 23, 2016Updated 9 years ago
- Istio Workshop☆19Dec 15, 2017Updated 8 years ago
- ☆16Nov 10, 2018Updated 7 years ago
- A sample agentic ai platform to run agentic workflows on AWS using either EKS or Bedrock AgentCore with open source frameworks like LangC…☆81Mar 14, 2026Updated last week
- ☆44Oct 2, 2012Updated 13 years ago
- Spark Gotchas. A subjective compilation of the Apache Spark tips and tricks☆360Jun 6, 2017Updated 8 years ago
- Javascript library to talk to multiple OLAP backends from multiple frontends☆17Feb 4, 2013Updated 13 years ago
- Repository for building microservices training☆18Mar 3, 2017Updated 9 years ago
- Pprof serves via its HTTP server fprof profiling data in the format expected by the pprof visualization tools for Elixir.☆14Jan 10, 2023Updated 3 years ago
- Bridge between OTel and KEDA api☆12Updated this week
- Apache Spark ETL Utilities☆39Oct 23, 2024Updated last year
- Jupyter Kernel Protocol for rust☆14Mar 11, 2026Updated last week
- Example using Grafana with Druid☆11Mar 27, 2015Updated 10 years ago
- Pintograph simulator in Javascript☆12Oct 23, 2025Updated 4 months ago