owenrh/spark-fires

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/owenrh/spark-fires)

owenrh / spark-fires

Spark fires is a anti-pattern playground where we deliberately break Spark applications in various ways so you can observe what happens and potentially recognise the issue when you come across it in your day-to-day development and support activities.

☆42

Alternatives and similar repositories for spark-fires

Users that are interested in spark-fires are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

gbrueckl / Fabric.Toolbox
View on GitHub
Tools for Microsoft Fabric
☆26Jun 26, 2026Updated 3 weeks ago
nitish9413 / open_auto_loader
View on GitHub
OpenAutoLoader: A lightweight, open-source alternative to Databricks Auto Loader. Built with Polars and SQLite for efficient, incremental…
☆15Apr 8, 2026Updated 3 months ago
microsoft / synapse-spark-runtime
View on GitHub
Release notes for Apache Spark based Runtime for Azure Synapse Analytics and Microsoft Fabric
☆40Updated this week
petehanssens / dbt-fargate
View on GitHub
How to run DBT on AWS Fargate
☆13Oct 15, 2019Updated 6 years ago
nburkett / ShowPulse_Ticketmaster
View on GitHub
A data pipeline with Kafka, Spark Streaming, dbt, Docker, Airflow, and GCP!
☆12Jul 6, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
eapowertools-archive / qs-data-connection-analyzer
View on GitHub
Unsupported - The Qlik Sense Data Connection Analyzer is a Qlik application that parses script log files and queries the QRS API, allowin…
☆11Jan 3, 2023Updated 3 years ago
MrPowers / mack
View on GitHub
Delta Lake helper methods in PySpark
☆328Jan 19, 2026Updated 6 months ago
Pulsweb / MyScripts
View on GitHub
☆13May 12, 2026Updated 2 months ago
ssntpl / cloud-storage
View on GitHub
A powerful Laravel storage driver that enables seamless synchronization of files across multiple disks, with an integrated cache disk for…
☆15Nov 11, 2025Updated 8 months ago
Nick-Harvey / tensorshift
View on GitHub
☆10Nov 9, 2017Updated 8 years ago
ion-elgreco / polars-deltalake
View on GitHub
Native Polars I/O plugin for Delta Lake, backed by delta-kernel-rs.
☆19Jun 8, 2026Updated last month
dashbook / dashtool
View on GitHub
☆17Nov 27, 2025Updated 7 months ago
sashgorokhov / pyspark-spy
View on GitHub
Collect and aggregate on spark events for profitz
☆10Apr 22, 2022Updated 4 years ago
starterTree / starterTree
View on GitHub
command launcher organised in a tree structure with autocompletion
☆13May 4, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
giancarllotorres / IaC-Fabric
View on GitHub
☆15Aug 28, 2025Updated 10 months ago
Raiffeisen-DGTL / checkita-data-quality
View on GitHub
Fast data quality framework for modern data infrastructure
☆29Apr 2, 2026Updated 3 months ago
Arjunsharmahehe / bloom-filter
View on GitHub
A simple bloom filter implementaiton using python
☆12Mar 1, 2025Updated last year
daniellecrobinson / Data-Rescue-PDX
View on GitHub
Volunteer guide, and other materials for DATA RESCUE PDX
☆29Mar 4, 2017Updated 9 years ago
ecoen66 / homebridge-solaredge-inverter
View on GitHub
SolarEdge Inverter plugin for homebridge
☆15May 6, 2026Updated 2 months ago
janelznic / simplyjs
View on GitHub
Just simple JavaScript framework. Provides support for manipulating with DOM and events handling. Easy for use, optimized for performance…
☆11Feb 15, 2017Updated 9 years ago
hebench / reference-seal-backend
View on GitHub
The SEAL-CPU backend is a Reference backend engine for HEBench which is a shared library that implements the required functions specified…
☆11Mar 3, 2023Updated 3 years ago
bigdevwhale / cachetastic
View on GitHub
💾 Optimize Laravel caching with Cachetastic! Cache method results, force refresh, handle errors, and boost app performance effortlessly.
☆13Jan 26, 2026Updated 5 months ago
delta-io / delta-examples
View on GitHub
Delta Lake examples
☆241Oct 8, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
claudiocmm / data_engineering_projects
View on GitHub
A collection of Data Engineering projects using different cloud providers. Explore real-world implementations of data pipelines, transfor…
☆17Apr 7, 2025Updated last year
ffirg / openshift
View on GitHub
Scripts for stepping through OSE demo examples
☆13Apr 28, 2021Updated 5 years ago
rayriffy / sekai-next
View on GitHub
Sekai Viewer but built with Next, optimized for performance
☆11Jan 20, 2023Updated 3 years ago
simeg / dotfiles
View on GitHub
My personal dotfiles with automated macOS setup. Features smart installation scripts, Bats testing (bash), performance monitoring, and 2…
☆11Jul 13, 2026Updated last week
dbt-labs / dbot
View on GitHub
An LLM-powered chatbot with the added context of the dbt knowledge base.
☆39Dec 4, 2024Updated last year
analyticalmonk / pyspark_nlp_workshop
View on GitHub
Instructions and code for the workshop "From Big Data to NLP Insights: Unlocking the Power of PySpark and Spark NLP"
☆12May 9, 2023Updated 3 years ago
saboye / Data-Modeling-with-Postgres
View on GitHub
A project to design a fact and dimension star schema for optimizing queries on a flight booking database using PostgreSQL, a relational d…
☆12Aug 15, 2021Updated 4 years ago
microsoft / DataLineage
View on GitHub
Data Lineage for Spark components and PowerBI/AAS showing up in Azure Purview
☆20Jun 11, 2024Updated 2 years ago
cosmicThreePointO / use-scroll-fades
View on GitHub
A lightweight React hook that automatically manages fade overlays for scrollable containers. Provides smooth gradient transitions at the …
☆12Aug 11, 2025Updated 11 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
WrenchRB / w-backitemdisplay
View on GitHub
Backpack Attachments is a FiveM resource for attaching weapons and items to players' backs. It supports customizable attachment points, h…
☆10Nov 14, 2024Updated last year
iggisv9t / bootcamp4
View on GitHub
My best solution for mlbootcamp4 competition
☆11Jun 11, 2017Updated 9 years ago
mattp97 / Trotter-Qdrift-Simulation
View on GitHub
An experimental python library to compile and analyze the cost of any desired composite simulation in real or imaginary time, and with or…
☆11Feb 9, 2024Updated 2 years ago
FilipBartos / express-handlebars-criticalcss
View on GitHub
Demo repository for article "Express server, Handlebars & Critical Path Performance Optimization"
☆13Jan 12, 2017Updated 9 years ago
jamespic / pyspark-flame
View on GitHub
A low-overhead sampling profiler for PySpark, that outputs Flame Graphs
☆16Dec 17, 2020Updated 5 years ago
TranTaiDakLak / SunnyUI.AOT.WinForms
View on GitHub
Reworked SunnyUI for Native AOT. Fixed stability issues when running WinForms in AOT mode with .NET 8/9. Optimized for compatibility and …
☆16Aug 7, 2025Updated 11 months ago
LiterallyEthical / r3conwhal3
View on GitHub
r3conwhale aims to develop a multifunctional recon chain for web applications, intelligently interpreting collected data, and optimizing …
☆14Jul 3, 2024Updated 2 years ago