jasonsatran / spark-metaView external linksLinks
Spark data profiling utilities
☆22Nov 24, 2018Updated 7 years ago
Alternatives and similar repositories for spark-meta
Users that are interested in spark-meta are comparing it to the libraries listed below
Sorting:
- low-level helpers for Apache Spark libraries and tests☆16Dec 29, 2018Updated 7 years ago
- type-class based data cleansing library for Apache Spark SQL☆78Jun 23, 2019Updated 6 years ago
- Test suite to document the behavior of Spark☆21Apr 15, 2021Updated 4 years ago
- Spark functions to run popular phonetic and string matching algorithms☆59Feb 22, 2022Updated 3 years ago
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆73Mar 14, 2021Updated 4 years ago
- ☆36Aug 24, 2022Updated 3 years ago
- Schema Registry integration for Apache Spark☆40Nov 16, 2022Updated 3 years ago
- Shed light on your data layout in order to monitor the health of your Lakehouse tables and identify when data maintenance operations shou…☆10Jul 31, 2023Updated 2 years ago
- Package provides java implementation of the latent dirichlet allocation (LDA) for topic modelling☆10May 18, 2017Updated 8 years ago
- phData Pulse application log aggregation and monitoring☆13Apr 13, 2020Updated 5 years ago
- sbt plugin for scala modules.☆14Feb 9, 2026Updated last week
- This repo is a curated list of places I consider for weekends in Athens with my kid.☆11Dec 19, 2021Updated 4 years ago
- Extension for profiling performance of JupyterLab UI for JupyterLab core developers, extension developers, and advanced users.☆14Jan 29, 2026Updated 2 weeks ago
- Plutus for the masses☆11Jan 20, 2023Updated 3 years ago
- Generate a JSON schema from an example object☆10Jul 24, 2016Updated 9 years ago
- Interactive playing with Math in Scala☆10Jan 4, 2017Updated 9 years ago
- ☆11Oct 12, 2019Updated 6 years ago
- Hadoop InputFormat for http://druid.io/☆10Oct 26, 2016Updated 9 years ago
- An example of SparkConnect extension.☆15Mar 5, 2024Updated last year
- Meet Rustacean GPT, an experimental project transforming OpenAi's GPT into a helpful, autonomous software engineer to support senior deve…☆14May 10, 2023Updated 2 years ago
- HiveQL Jupyter Kernel☆10Aug 5, 2022Updated 3 years ago
- Using the Parquet file format (with Avro) to process data with Apache Flink☆14Aug 17, 2015Updated 10 years ago
- Hive Storage Handler for SOLR☆16Mar 17, 2014Updated 11 years ago
- HDFS rsync-like utility to replicate data between HDFS clusters☆17Jun 16, 2012Updated 13 years ago
- A Rust implementation of the Gravity-post-quantum signature schemes☆18Nov 20, 2025Updated 2 months ago
- Bringing up Docker Compose environments for system, integration and performance testing, with support for ScalaTest and Gatling☆11Jul 29, 2021Updated 4 years ago
- Generate shell auto completion files☆13Nov 8, 2025Updated 3 months ago
- A collection of Flink applications for working with Pravega streams☆12Dec 20, 2022Updated 3 years ago
- Deriving Spark DataFrame schemas from case classes☆44Jun 24, 2024Updated last year
- Functional programming with Kafka and Scala☆100Updated this week
- JupyterLab Notebook for Mesosphere DC/OS☆11Aug 6, 2019Updated 6 years ago
- JSON editor for React☆11Feb 7, 2026Updated last week
- ☆10Jul 6, 2020Updated 5 years ago
- Product Catalog App Build Using React.js & Tailwind.css☆11Dec 25, 2021Updated 4 years ago
- A Twisted client for etcd3☆14Feb 22, 2019Updated 6 years ago
- AIS visualization from an interactive R and Shiny based web app using Material Design from Google.☆12Sep 13, 2018Updated 7 years ago
- Generates diff markup for two strings.☆20Mar 29, 2013Updated 12 years ago
- hive-phoenix-handler is a hive plug-in that can access Apache Phoenix table on HBase using HiveQL.☆10Aug 17, 2017Updated 8 years ago
- Shapeless generic instances for Scrooge types☆14Feb 16, 2018Updated 8 years ago