swoop-inc / spark-recordsView external linksLinks
Bulletproof Apache Spark jobs with fast root cause analysis of failures.
☆73Mar 14, 2021Updated 4 years ago
Alternatives and similar repositories for spark-records
Users that are interested in spark-records are comparing it to the libraries listed below
Sorting:
- Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive☆186Oct 15, 2025Updated 4 months ago
- A dynamic data completeness and accuracy library at enterprise scale for Apache Spark☆29Nov 4, 2024Updated last year
- Utilities for writing tests that use Apache Spark.☆24Dec 29, 2018Updated 7 years ago
- A tool to validate data, built around Apache Spark.☆100Feb 9, 2026Updated last week
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)☆454Feb 8, 2026Updated last week
- Essential Spark extensions and helper methods ✨😲☆766Sep 14, 2025Updated 5 months ago
- ## Auto-archived due to inactivity. ## Simple JVM Profiler Using StatsD and Other Metrics Backends☆15Oct 3, 2023Updated 2 years ago
- A Spark connector for the Azure Common Data Model☆15May 31, 2023Updated 2 years ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆61Sep 4, 2023Updated 2 years ago
- Run spark calculations from Ammonite☆117Feb 3, 2026Updated last week
- Spark data profiling utilities☆22Nov 24, 2018Updated 7 years ago
- Apache Amaterasu☆56Oct 18, 2019Updated 6 years ago
- A series of articles that explore working with data using Datafusion and Apache Arrow.☆10Mar 17, 2021Updated 4 years ago
- Source code for http://allaboutscala.com/scala-cheatsheet/☆10Jun 12, 2018Updated 7 years ago
- SQL for Redis☆11Sep 16, 2022Updated 3 years ago
- Movie Recommendation System Using Spark ML, Akka and Cassandra☆12Oct 4, 2019Updated 6 years ago
- ☆45Apr 27, 2020Updated 5 years ago
- Library for organizing batch processing pipelines in Apache Spark☆42Jan 4, 2017Updated 9 years ago
- Deriving Spark DataFrame schemas from case classes☆44Jun 24, 2024Updated last year
- Sample processing code using Spark 2.1+ and Scala☆51Jun 28, 2020Updated 5 years ago
- An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.☆432Jan 14, 2022Updated 4 years ago
- Create command line applications with a config file.☆16Mar 7, 2017Updated 8 years ago
- Traditionally, engineers were needed to implement business logic via data pipelines before business users can start using it. Using this …☆12Feb 5, 2026Updated last week
- Azure Synapse Analytics Samples☆14Feb 15, 2023Updated 3 years ago
- Atomic Scala Book Solutions - for Beginners and first time Functional Programmers☆12Mar 10, 2020Updated 5 years ago
- A bunch of low-level basic methods for data processing and monitoring with Scala Spark☆10Jun 29, 2018Updated 7 years ago
- Sketching data structures for scala, including t-digest☆15Sep 7, 2021Updated 4 years ago
- Data quality control tool built on spark and deequ☆25Jan 22, 2026Updated 3 weeks ago
- Cloud based Data Platform based on Apache Spark☆27Oct 7, 2025Updated 4 months ago
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆811Feb 5, 2026Updated last week
- A framework for creating composable and pluggable data processing pipelines using Apache Spark, and running them on a cluster.☆47Aug 1, 2016Updated 9 years ago
- This project enables you to use spring inside of a spark application.☆11May 6, 2015Updated 10 years ago
- All Certification and preparation, examples & others☆11Oct 18, 2018Updated 7 years ago
- Herd-UI is a search and discovery tool for business and technical users. Everyone in your organization can use Herd-UI to browse and unde…☆16Oct 1, 2022Updated 3 years ago
- Generates 27-character, time-ordered, k-sortable, URL-safe, globally unique identifiers.☆26May 19, 2019Updated 6 years ago
- explore kafka, spark, fs2 and pure functional programming in scala☆33Updated this week
- Delta Lake Examples☆11Apr 24, 2020Updated 5 years ago
- A tutorial about how to start with Cosmos DB - The information I would have loved to have before setting out with Cosmos DB.☆16Dec 8, 2022Updated 3 years ago
- ☆17Apr 8, 2023Updated 2 years ago