A library for Spark DataFrame using MinIO Select API
☆102Sep 27, 2019Updated 6 years ago
Alternatives and similar repositories for spark-select
Users that are interested in spark-select are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Collection of docker images, helm charts and other tools needed to build DataLake on Kubernetes.☆13Oct 19, 2018Updated 7 years ago
- Depreciated in favor of datalake-kubernetes. Collection of Kubernetes Big Data ecosystem products helm charts☆11Aug 9, 2018Updated 7 years ago
- MinIO Client SDK for Haskell☆52May 13, 2025Updated 11 months ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆62Sep 4, 2023Updated 2 years ago
- Filling in the Spark function gaps across APIs☆50Apr 14, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- High-performance log search engine.☆359Jul 17, 2020Updated 5 years ago
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10May 12, 2023Updated 2 years ago
- Apache Atlas v2 Rest API☆23Apr 27, 2018Updated 8 years ago
- A blazing-fast PostgreSQL client built in Rust. No Electron. No JVM. No bloat.☆80Updated this week
- Homebrew tap for MinIO☆20Sep 7, 2025Updated 7 months ago
- Adaptive File Source Connector for Spark, optimised for reading from object stores☆15Oct 18, 2022Updated 3 years ago
- FluxCD and Express.js GitOps tutorial for Civo☆80Jan 29, 2020Updated 6 years ago
- Collection of tests to detect overall correctness of MinIO server.☆102Jan 8, 2026Updated 3 months ago
- Minio Object Storage Server☆10Nov 19, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive☆191Oct 15, 2025Updated 6 months ago
- A custom ContentRepository implementation for NiFi to persist data to MinIO Object Storage☆35Jul 15, 2022Updated 3 years ago
- Spark metrics related custom classes and sinks (e.g. Prometheus)☆188Aug 2, 2022Updated 3 years ago
- Several swing components and utilities☆12Aug 11, 2024Updated last year
- Drive performance measurement tool☆77Dec 29, 2025Updated 4 months ago
- Run TPCH Benchmark on Apache Kylin☆22Jan 24, 2022Updated 4 years ago
- Go Client for Hive Metastore☆14Dec 18, 2022Updated 3 years ago
- Shed light on your data layout in order to monitor the health of your Lakehouse tables and identify when data maintenance operations shou…☆10Jul 31, 2023Updated 2 years ago
- ☆41May 16, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Colab, MLflow and papermill are individually great. Together they form a dream team.☆10Jun 9, 2020Updated 5 years ago
- various scripts☆20Dec 16, 2022Updated 3 years ago
- Essential Spark extensions and helper methods ✨😲☆766Sep 14, 2025Updated 7 months ago
- Akka plugin to collect various data about actors☆17Aug 19, 2024Updated last year
- Apache Storm 0.9.3-rc1 Docker cluster deployed on Apache Mesos with Marathon.☆11Jan 5, 2015Updated 11 years ago
- An open protocol for secure data sharing☆938Updated this week
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆95May 9, 2025Updated 11 months ago
- Single node, in-memory DataFrame analytics library.☆44Mar 6, 2026Updated last month
- ☆13Dec 12, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Scala client for MaxMind Geo-IP☆87Feb 18, 2026Updated 2 months ago
- ☆23May 2, 2024Updated 2 years ago
- Generating Federated GraphQL API's from Datasources with Apache Calcite☆37Feb 21, 2022Updated 4 years ago
- A DSL for scalacOptions☆17Updated this week
- Plug-and-play implementation of an Apache Spark custom data source for AWS DynamoDB.☆173Mar 6, 2021Updated 5 years ago
- Type safety for spark columns☆79Oct 27, 2025Updated 6 months ago
- Stencila for Python☆17Aug 3, 2018Updated 7 years ago