Spark data profiling utilities
☆23Nov 24, 2018Updated 7 years ago
Alternatives and similar repositories for spark-meta
Users that are interested in spark-meta are comparing it to the libraries listed below
Sorting:
- Synthesis with Metaheuristics - Genetic Programming in Scala☆15Oct 4, 2019Updated 6 years ago
- AIS visualization from an interactive R and Shiny based web app using Material Design from Google.☆13Sep 13, 2018Updated 7 years ago
- Fast bottom up trend reversal detection algorithm.☆14Oct 1, 2020Updated 5 years ago
- Converts 3D file formats to Minecraft schematics☆14Mar 8, 2013Updated 13 years ago
- low-level helpers for Apache Spark libraries and tests☆16Dec 29, 2018Updated 7 years ago
- ☆19Sep 8, 2017Updated 8 years ago
- type-class based data cleansing library for Apache Spark SQL☆78Jun 23, 2019Updated 6 years ago
- Test suite to document the behavior of Spark☆21Apr 15, 2021Updated 4 years ago
- Lua/Terra + Java Native Interface☆21Mar 3, 2017Updated 9 years ago
- Welcome to the WSO2 Machine Learner source code! For info on working with the WSO2 Machine Learner repository and contributing code, clic…☆23Jun 1, 2017Updated 8 years ago
- Space-Filling Curves in Scala☆26Aug 25, 2020Updated 5 years ago
- Ready-to-go Docker image with Polynote☆25Aug 6, 2020Updated 5 years ago
- Spark functions to run popular phonetic and string matching algorithms☆59Feb 22, 2022Updated 4 years ago
- Binding the GDELT universe in a Spark environment☆26Apr 21, 2023Updated 2 years ago
- A library for parsing and querying an Esri File Geodatabase with Apache Spark.☆27Nov 13, 2016Updated 9 years ago
- Bucketing and partitioning system for Parquet☆30May 22, 2018Updated 7 years ago
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆73Mar 14, 2021Updated 4 years ago
- Spark job to perform massive Point in Polygon (PiP) operations☆32Mar 19, 2017Updated 8 years ago
- Distributed Linear Programming Solver on top of Apache Spark☆80Jan 4, 2021Updated 5 years ago
- ☆36Aug 24, 2022Updated 3 years ago
- A TensorFlow 2.0 .whl file compiled with an old processor/computer☆11Dec 12, 2020Updated 5 years ago
- MongoDB 3.6 Developer Workshop☆10Apr 27, 2018Updated 7 years ago
- Package provides java implementation of the latent dirichlet allocation (LDA) for topic modelling☆10May 18, 2017Updated 8 years ago
- phData Pulse application log aggregation and monitoring☆13Apr 13, 2020Updated 5 years ago
- Shed light on your data layout in order to monitor the health of your Lakehouse tables and identify when data maintenance operations shou…☆10Jul 31, 2023Updated 2 years ago
- ☆13Aug 11, 2025Updated 6 months ago
- Note: the repo has been moved to https://gitlab.com/readcoop/Transkribus/TranskribusCore☆38Oct 28, 2020Updated 5 years ago
- Spark Custome Stream Source and Sink☆12Jan 19, 2019Updated 7 years ago
- This repo is a curated list of places I consider for weekends in Athens with my kid.☆11Dec 19, 2021Updated 4 years ago
- HDFS rsync-like utility to replicate data between HDFS clusters☆17Jun 16, 2012Updated 13 years ago
- Hive Storage Handler for SOLR☆16Mar 17, 2014Updated 11 years ago
- Extension for profiling performance of JupyterLab UI for JupyterLab core developers, extension developers, and advanced users.☆14Mar 1, 2026Updated last week
- Ready to use UI patterns for websites☆16Sep 17, 2015Updated 10 years ago
- Simple, beautiful data driven tooltip☆13Mar 13, 2022Updated 3 years ago
- Chapel Data Object☆10Jun 9, 2021Updated 4 years ago
- Generate a JSON schema from an example object☆10Jul 24, 2016Updated 9 years ago
- Uses a genetic algorithm to "evolve" brainfuck programs with desirable behaviours☆11Feb 8, 2025Updated last year
- My Very Own Deep Multiple Layered Echo State Network☆13Jan 2, 2021Updated 5 years ago
- A collection of demonstration languages in Lua/Terra suitable for learning or for forking when creating a new language☆11Aug 27, 2015Updated 10 years ago