Ackuq / spark-pitView external linksLinks
Point-in-Time optimizations for Apache Spark
☆30Jan 18, 2024Updated 2 years ago
Alternatives and similar repositories for spark-pit
Users that are interested in spark-pit are comparing it to the libraries listed below
Sorting:
- Python - Java/Scala API for the Hopsworks feature store☆55Sep 24, 2025Updated 4 months ago
- Ultra-high-performance local IPC framework with Zipkin tracing to conduct a beautiful symphony of (brotherhood) build tooling.☆10Jan 8, 2021Updated 5 years ago
- A Generic Resource-Aware Hyperparameter Tuning Execution Engine☆15Jan 8, 2022Updated 4 years ago
- something to help you spark☆64Oct 23, 2018Updated 7 years ago
- Angular Material Meteor Dashboard template☆14Oct 14, 2019Updated 6 years ago
- Distributed solver library for large-scale structured output prediction, based on Spark. Project website:☆17Mar 3, 2016Updated 9 years ago
- CUDA kernel and JNI code which is called by Apache Spark's MLlib.☆19Jun 18, 2016Updated 9 years ago
- SQLAlchemy dialect for Databricks☆20May 15, 2023Updated 2 years ago
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.☆25Sep 8, 2023Updated 2 years ago
- A python library bakeoff for medium sized datasets☆24Aug 25, 2023Updated 2 years ago
- Kompics - A message-passing component model for building distributed systems☆66Oct 4, 2022Updated 3 years ago
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆62Sep 6, 2024Updated last year
- A dynamic data completeness and accuracy library at enterprise scale for Apache Spark☆29Nov 4, 2024Updated last year
- Scala framework for collecting performance metrics and conducting sound experimental benchmarking.☆13Nov 19, 2025Updated 2 months ago
- Core Gwen interpreter☆36Jan 7, 2026Updated last month
- EncryCore node reference implementation☆15Apr 2, 2020Updated 5 years ago
- a curated list of awesome lakehouse frameworks, applications, etc☆40Updated this week
- A package that enables the use of SIMD x86 instructions in the Lightweight Modular Staging Framework (LMS).☆40Apr 19, 2018Updated 7 years ago
- Data-Driven Spark allows quick data exploration based on Apache Spark.☆29Jan 6, 2017Updated 9 years ago
- Reproducing Distributed Systems and Experiments on Cloud☆39Sep 11, 2023Updated 2 years ago
- FeatHub - A stream-batch unified feature store for real-time machine learning☆347May 27, 2024Updated last year
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆37Mar 9, 2021Updated 4 years ago
- Benchmarks of artificial neural network library for Spark MLlib☆11Dec 3, 2015Updated 10 years ago
- A command-line interface for interacting with the NeoLoad Web Platform...running tests, reporting results, etc...on your workstation or i…☆10Jan 20, 2026Updated 3 weeks ago
- This solution helps you deploy ETL processes and data storage resources to create an Insurance Lake using Amazon S3 buckets for storage, …☆16Feb 5, 2026Updated last week
- Example for baking the current git commit hash into a bazel C++ project☆11Jan 25, 2022Updated 4 years ago
- Proximal Asynchronous SAGA☆13Nov 30, 2017Updated 8 years ago
- ☆11Nov 26, 2024Updated last year
- Factorization Machines for Julia☆11Aug 26, 2016Updated 9 years ago
- FTRL-Proximal Online Learning Algorithm☆15May 22, 2017Updated 8 years ago
- Sangria akka-streams integration☆11Feb 8, 2026Updated last week
- An example of a multiple workspace deployment with reusable modules.☆13May 28, 2025Updated 8 months ago
- Provides time series data and metadata as Apache Arrow.☆16Updated this week
- Taiga for Dataset management☆12Feb 5, 2026Updated last week
- An Apache Mesos Framework that allows for replaying load over and over and over (and over) again☆10Aug 10, 2015Updated 10 years ago
- Gain information about applications to inform deployments☆11Mar 3, 2022Updated 3 years ago
- Hierarchical Image Representation☆10Dec 9, 2023Updated 2 years ago
- Python SDK to interact with the Hopsworks API☆14Updated this week
- A repository for all code generated at our Datadive events☆36May 12, 2012Updated 13 years ago