jasonsatran / spark-metaView external linksLinks
Spark data profiling utilities
☆22Nov 24, 2018Updated 7 years ago
Alternatives and similar repositories for spark-meta
Users that are interested in spark-meta are comparing it to the libraries listed below
Sorting:
- low-level helpers for Apache Spark libraries and tests☆16Dec 29, 2018Updated 7 years ago
- type-class based data cleansing library for Apache Spark SQL☆78Jun 23, 2019Updated 6 years ago
- Test suite to document the behavior of Spark☆21Apr 15, 2021Updated 4 years ago
- Spark functions to run popular phonetic and string matching algorithms☆59Feb 22, 2022Updated 3 years ago
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆73Mar 14, 2021Updated 4 years ago
- Schema Registry integration for Apache Spark☆40Nov 16, 2022Updated 3 years ago
- csv to parquet and vice versa file converter based on Pandas written in Python3☆10Mar 23, 2021Updated 4 years ago
- MongoDB 3.6 Developer Workshop☆10Apr 27, 2018Updated 7 years ago
- Package provides java implementation of the latent dirichlet allocation (LDA) for topic modelling☆10May 18, 2017Updated 8 years ago
- A connector for Apache Spark to access Exasol☆12Oct 31, 2025Updated 3 months ago
- phData Pulse application log aggregation and monitoring☆13Apr 13, 2020Updated 5 years ago
- Spark Operations Research☆10Sep 21, 2016Updated 9 years ago
- Shed light on your data layout in order to monitor the health of your Lakehouse tables and identify when data maintenance operations shou…☆10Jul 31, 2023Updated 2 years ago
- PLSQL Package converting Markdown to HTML☆13May 5, 2017Updated 8 years ago
- Ready to use UI patterns for websites☆16Sep 17, 2015Updated 10 years ago
- Generate a JSON schema from an example object☆10Jul 24, 2016Updated 9 years ago
- An example setup for integrating the oso policy engine logic within a FastAPI application.☆10Dec 5, 2020Updated 5 years ago
- HiveQL Jupyter Kernel☆10Aug 5, 2022Updated 3 years ago
- Meet Rustacean GPT, an experimental project transforming OpenAi's GPT into a helpful, autonomous software engineer to support senior deve…☆14May 10, 2023Updated 2 years ago
- Testing node.js or io.js app with wallaby.js☆10Oct 13, 2017Updated 8 years ago
- ☆10Apr 13, 2020Updated 5 years ago
- Generate a Full Stack Python Web App - Choose the framework you want Vue, React, Angular - Can be run in a single container or without Do…☆13Aug 22, 2021Updated 4 years ago
- Hadoop InputFormat for http://druid.io/☆10Oct 26, 2016Updated 9 years ago
- An example of SparkConnect extension.☆15Mar 5, 2024Updated last year
- Hands-on hub to learn techniques to optimize and serve AI models to production the most optimal way.☆14Aug 20, 2025Updated 5 months ago
- Plutus for the masses☆11Jan 20, 2023Updated 3 years ago
- Interactive playing with Math in Scala☆10Jan 4, 2017Updated 9 years ago
- Using the Parquet file format (with Avro) to process data with Apache Flink☆14Aug 17, 2015Updated 10 years ago
- A collection of Flink applications for working with Pravega streams☆12Dec 20, 2022Updated 3 years ago
- Generate shell auto completion files☆13Nov 8, 2025Updated 3 months ago
- Bringing up Docker Compose environments for system, integration and performance testing, with support for ScalaTest and Gatling☆11Jul 29, 2021Updated 4 years ago
- sbt plugin for scala modules.☆14Feb 9, 2026Updated last week
- HDFS rsync-like utility to replicate data between HDFS clusters☆17Jun 16, 2012Updated 13 years ago
- Hive Storage Handler for SOLR☆16Mar 17, 2014Updated 11 years ago
- Deriving Spark DataFrame schemas from case classes☆44Jun 24, 2024Updated last year
- Functional programming with Kafka and Scala☆100Updated this week
- Product Catalog App Build Using React.js & Tailwind.css☆11Dec 25, 2021Updated 4 years ago
- Generates diff markup for two strings.☆20Mar 29, 2013Updated 12 years ago
- JupyterLab Notebook for Mesosphere DC/OS☆11Aug 6, 2019Updated 6 years ago