Point-in-Time optimizations for Apache Spark
☆30Jan 18, 2024Updated 2 years ago
Alternatives and similar repositories for spark-pit
Users that are interested in spark-pit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python - Java/Scala API for the Hopsworks feature store☆55Sep 24, 2025Updated 6 months ago
- A Generic Resource-Aware Hyperparameter Tuning Execution Engine☆15Jan 8, 2022Updated 4 years ago
- SQLAlchemy dialect for Databricks☆20May 15, 2023Updated 2 years ago
- Asynchronous actions for PySpark☆48Dec 2, 2021Updated 4 years ago
- Bindings for FFmpeg☆11May 15, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- DuckDB extension for MySQL☆15Mar 17, 2024Updated 2 years ago
- Ultra-high-performance local IPC framework with Zipkin tracing to conduct a beautiful symphony of (brotherhood) build tooling.☆10Jan 8, 2021Updated 5 years ago
- A dbt adapter for Decodable☆12Sep 4, 2025Updated 7 months ago
- something to help you spark☆64Oct 23, 2018Updated 7 years ago
- Reproducing Distributed Systems and Experiments on Cloud☆40Sep 11, 2023Updated 2 years ago
- Extending the Neural Graph Algorithm Executor☆13Dec 8, 2022Updated 3 years ago
- Scala SDK for Temporal☆10May 31, 2022Updated 3 years ago
- Airflow DAGs for ingesting Bitcoin blockchain data to Neo4j☆15Dec 8, 2022Updated 3 years ago
- ☆18May 23, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆11Nov 14, 2023Updated 2 years ago
- Documentation for Hopsworks and Hops☆10Jan 30, 2022Updated 4 years ago
- Python SDK to interact with the Hopsworks API☆14Updated this week
- A Singer.io target for DuckDB☆19Feb 11, 2026Updated 2 months ago
- A cloud native data mesh implementation☆12Jan 15, 2021Updated 5 years ago
- Trafiklabs website☆18Apr 7, 2026Updated last week
- ☆12Apr 10, 2020Updated 6 years ago
- Mavuno: A Hadoop-Based Text Mining Toolkit☆47Feb 7, 2012Updated 14 years ago
- Mahout vector encoding for pig☆53Nov 20, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Tutorial & scripts to run a meta-rl model on DeepMind Lab's Harlow task environment.☆15Mar 28, 2019Updated 7 years ago
- Transitmap is an interactive realtime visualisation of all public transport in Sweden.☆11Jun 6, 2025Updated 10 months ago
- CUDA kernel and JNI code which is called by Apache Spark's MLlib.☆19Jun 18, 2016Updated 9 years ago
- ☆21Nov 13, 2025Updated 5 months ago
- BirdiDQ leverages the power of the Python Great Expectations open-source library and combines it with the simplicity of natural language …☆23Jul 17, 2023Updated 2 years ago
- An analysis of adverse drug event data using Hadoop, R, and Gephi☆44Jan 28, 2016Updated 10 years ago
- Inspect Your Servers with DuckDB☆31May 8, 2025Updated 11 months ago
- a curated list of awesome lakehouse frameworks, applications, etc☆43Mar 9, 2026Updated last month
- Distributed solver library for large-scale structured output prediction, based on Spark. Project website:☆17Mar 3, 2016Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A Time Series Library for Apache Spark☆1,024Jul 3, 2020Updated 5 years ago
- ☆108Jul 5, 2023Updated 2 years ago
- ☆17May 7, 2024Updated last year
- Apache Arrow Flight example☆11Nov 9, 2020Updated 5 years ago
- Overlapping Normalized Mutual Information and Omega Index evaluation for the overlapping community structure produced by clustering algor…☆16Nov 25, 2019Updated 6 years ago
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆37Mar 9, 2021Updated 5 years ago
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆62Sep 6, 2024Updated last year