Spark fires is a anti-pattern playground where we deliberately break Spark applications in various ways so you can observe what happens and potentially recognise the issue when you come across it in your day-to-day development and support activities.
☆42Nov 18, 2024Updated last year
Alternatives and similar repositories for spark-fires
Users that are interested in spark-fires are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Release notes for Apache Spark based Runtime for Azure Synapse Analytics and Microsoft Fabric☆37Jun 22, 2026Updated last week
- Tools for Microsoft Fabric☆25Jul 17, 2025Updated 11 months ago
- How to run DBT on AWS Fargate☆13Oct 15, 2019Updated 6 years ago
- Genie Framework improves Spark Pool utilization by executing multiple Synapse notebooks on the same spark pool instance☆28Dec 19, 2023Updated 2 years ago
- A data pipeline with Kafka, Spark Streaming, dbt, Docker, Airflow, and GCP!☆12Jul 6, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆15Aug 28, 2025Updated 10 months ago
- ☆13May 12, 2026Updated last month
- Delta Lake helper methods in PySpark☆329Jan 19, 2026Updated 5 months ago
- ☆17Nov 27, 2025Updated 7 months ago
- Collect and aggregate on spark events for profitz☆10Apr 22, 2022Updated 4 years ago
- Type-annotate your spark dataframes and validate them☆14Feb 5, 2026Updated 4 months ago
- command launcher organised in a tree structure with autocompletion☆13May 4, 2022Updated 4 years ago
- Data Engineering framework written in Python based in Polars.☆14May 1, 2024Updated 2 years ago
- An example implementation of DevOps for Power BI in GitHub☆16Jul 27, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Just simple JavaScript framework. Provides support for manipulating with DOM and events handling. Easy for use, optimized for performance…☆11Feb 15, 2017Updated 9 years ago
- Instructions and code for the workshop "From Big Data to NLP Insights: Unlocking the Power of PySpark and Spark NLP"☆12May 9, 2023Updated 3 years ago
- Delta Lake examples☆238Oct 8, 2024Updated last year
- A project to design a fact and dimension star schema for optimizing queries on a flight booking database using PostgreSQL, a relational d…☆12Aug 15, 2021Updated 4 years ago
- Adds a Doctrine Id generator which uses an ordered UUID in MySQL for extra performance. Uses methods described in Karhik Appigatla's arti…☆10Jun 8, 2015Updated 11 years ago
- An LLM-powered chatbot with the added context of the dbt knowledge base.☆39Dec 4, 2024Updated last year
- A Tool which helps automating bigquery backup and restore operations☆18Jun 24, 2025Updated last year
- 🐆A lightweight, high-performance string manipulation library optimized for speed-sensitive applications.☆16Updated this week
- A from scratch Python implementation of Apache Kafka concepts including producers, brokers, topics, consumers, and offset management, bui…☆23Jul 29, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- My personal dotfiles with automated macOS setup. Features smart installation scripts, Bats testing (bash), performance monitoring, and 2…☆12Jun 22, 2026Updated last week
- A low-overhead sampling profiler for PySpark, that outputs Flame Graphs☆16Dec 17, 2020Updated 5 years ago
- ACID and BASE transactions explained☆15May 18, 2025Updated last year
- A Rust port of the WebGraph framework☆61Jun 10, 2026Updated 2 weeks ago
- A lightweight React hook that automatically manages fade overlays for scrollable containers. Provides smooth gradient transitions at the …☆12Aug 11, 2025Updated 10 months ago
- Power BI External Tool to run automated checks in a report☆21May 23, 2023Updated 3 years ago
- Data Lineage for Spark components and PowerBI/AAS showing up in Azure Purview☆20Jun 11, 2024Updated 2 years ago
- High performance async Mssql library for Python.☆23May 29, 2026Updated last month
- Optimizing loading training data from cloud bucket storage for cloud-based distributed deep learning. Official repository for Quantifying…☆11Jan 1, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆26Feb 14, 2025Updated last year
- Stable Magisk modules for performance and efficient battery usage on rooted Android devices.☆32Jun 18, 2026Updated last week
- A lightweight, declarative PySpark framework for data quality validation — check columns, rows, and entire datasets directly in your Spar…☆76Jun 8, 2026Updated 3 weeks ago
- Sekai Viewer but built with Next, optimized for performance☆11Jan 20, 2023Updated 3 years ago
- Turning PySpark Into a Universal DataFrame API☆522Jun 18, 2026Updated last week
- A survey app written in Flask☆13Apr 16, 2018Updated 8 years ago
- Portable Neovim configuration built with Nix.☆18May 1, 2026Updated last month