holdenk/high-performance-spark-examples

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/holdenk/high-performance-spark-examples)

holdenk / high-performance-spark-examples

Examples for High Performance Spark

☆16

Alternatives and similar repositories for high-performance-spark-examples

Users that are interested in high-performance-spark-examples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mdymczyk / iot-pipeline
View on GitHub
☆15Sep 27, 2017Updated 8 years ago
jaceklaskowski / spark-delta-lake-workshop
View on GitHub
Spark and Delta Lake Workshop
☆22Jun 14, 2022Updated 4 years ago
holdenk / spark-validator
View on GitHub
A library you can include in your Spark job to validate the counters and perform operations on success. Goal is scala/java/python support…
☆111Feb 1, 2018Updated 8 years ago
scravy / pysparkextra
View on GitHub
☆10Jun 29, 2021Updated 5 years ago
yohanliyanage / jenkins-spark-deploy
View on GitHub
A Jenkins plugin that allows to deploy / stop Apache Spark applications in Spark standalone clusters.
☆10Oct 25, 2015Updated 10 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
foursquare / datasource-plugin-clouderamanager
View on GitHub
Cloudera Manager datasource for Grafana 3.x
☆19Jun 28, 2023Updated 3 years ago
holdenk / spark-structured-streaming-ml
View on GitHub
Structured Streaming Machine Learning example with Spark 2.0
☆95Apr 24, 2017Updated 9 years ago
spotify / gordon-gcp
View on GitHub
GCP Plugin for Gordon: Event-driven Cloud DNS
☆12Apr 5, 2023Updated 3 years ago
ApolloCrawler / microcrawler-js
View on GitHub
Scrapping made easy...
☆15Sep 3, 2016Updated 9 years ago
eddelbuettel / rapiserialize
View on GitHub
Serialization from the C API for R
☆14Jan 6, 2026Updated 6 months ago
m-a-j / combine
View on GitHub
rvest test grounds
☆10Jan 6, 2016Updated 10 years ago
cguegi / azure-databricks-airflow-example
View on GitHub
Example of orchestrating dependent Databricks jobs using Airflow
☆11Dec 19, 2019Updated 6 years ago
alexander-n-thomas / nlp.spark.annotate
View on GitHub
notebooks for nlp-on-spark
☆13Jan 27, 2017Updated 9 years ago
part-os / core-python
View on GitHub
☆15Mar 30, 2026Updated 3 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
lc0 / docker-shiny-server
View on GitHub
Docker container for Shiny Server
☆14Mar 31, 2016Updated 10 years ago
nielsutrecht / kafka-serializer-example
View on GitHub
Example of how to create your own custom serializers for Kafka queues including JSON, Smile and Kryo
☆25Mar 10, 2016Updated 10 years ago
networkop / cue-ansible
View on GitHub
CUE vs Ansible
☆18Aug 12, 2022Updated 3 years ago
scrapedia / scrapy-pipelines
View on GitHub
A collection of pipelines for Scrapy
☆16Apr 27, 2026Updated 2 months ago
trustedanalytics / spark-tk
View on GitHub
☆32Mar 20, 2020Updated 6 years ago
davecaos / kylie
View on GitHub
Kylie is a blond and small Elixir client for Cayley graph data base
☆12Apr 17, 2026Updated 3 months ago
erikerlandson / spark-kafka-sink
View on GitHub
A Kafka metric sink for Apache Spark
☆11Apr 13, 2017Updated 9 years ago
simonw / llm-templates-github
View on GitHub
Research prototype for new register_template_loaders LLM plugin hook
☆19Apr 7, 2025Updated last year
pdm-project / pdm-autoexport
View on GitHub
A PDM plugin to sync the exported files with the project file
☆15Sep 6, 2025Updated 10 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
cevoaustralia / glue-vscode
View on GitHub
Local Development of AWS Glue with Docker and Visual Studio Code
☆14Nov 29, 2021Updated 4 years ago
satyenrajpal / RL_algos
View on GitHub
Reinforcement Learning Algorithms
☆14May 28, 2018Updated 8 years ago
google-github-actions / github-workflow-job-to-pubsub
View on GitHub
Fulfills a GitHub workflow_job webhooks into a Pub/Sub queue.
☆12Mar 13, 2025Updated last year
mrm1001 / spark_tutorial
View on GitHub
Code for the Spark tutorial at the Pydata conference in London June 2015
☆12Oct 9, 2016Updated 9 years ago
taizilongxu / nyancat
View on GitHub
Nyancat in your terminal!
☆14May 29, 2018Updated 8 years ago
MichaelShoemaker / michaelshoemaker.github.io
View on GitHub
Portfolio Site
☆19Dec 28, 2025Updated 6 months ago
augerai / a2ml
View on GitHub
Common API for all "second gen" AutoML APIs: Auger.AI, Google Cloud AutoML and Azure AutoML
☆38Dec 21, 2024Updated last year
yorek / zeppelin
View on GitHub
Apache Zeppelin with support for SQL Server
☆16Sep 25, 2017Updated 8 years ago
selfuryon / awesome-cue-infra
View on GitHub
The example of using cue with Kubernetes, ArgoCD and Crossplane
☆18Sep 9, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
bergant / xbrlus
View on GitHub
R interface to XBRL US API
☆22Feb 22, 2018Updated 8 years ago
Shinichi-Nakagawa / airflow-docker
View on GitHub
Apache Airflow Docker Image.
☆16May 3, 2018Updated 8 years ago
Azure / DAICE_DatabricksSparkDevOps
View on GitHub
A set of example build and release pipelines for deploying Python and Scala to Azure Databricks and HDInsight
☆14Jun 4, 2020Updated 6 years ago
aws-samples / redshift-streaming-ingestion-patterns
View on GitHub
This is a collecton of CDK projects to show how to load data from streaming services into Amazon Redshift.
☆13Sep 10, 2024Updated last year
MrDataPsycho / data-pipelines-in-rust
View on GitHub
Data pipeline example written in Rust with Polars and DataFusion DataFrame package
☆40Mar 12, 2023Updated 3 years ago
ramhiser / noncensus
View on GitHub
U.S. Census Region and Demographic Data
☆26Jan 20, 2016Updated 10 years ago
MicroBioScopicData / Cryptos_Analysis
View on GitHub
Cryptocurrency Analysis with Python
☆16Sep 25, 2023Updated 2 years ago