Testing Sandbox for Hadoop Ecosystem Components
☆44Apr 29, 2026Updated last week
Alternatives and similar repositories for hadoop-testing
Users that are interested in hadoop-testing are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆16Jan 4, 2026Updated 4 months ago
- Apache Kyuubi Site☆13Mar 26, 2026Updated last month
- A Full RPC Framework Based on Netty.☆14May 19, 2018Updated 7 years ago
- ☆15Oct 12, 2021Updated 4 years ago
- Spark integrations for working with Lance datasets☆47Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Alerting and monitoring tool for Apache Spark☆23May 20, 2022Updated 3 years ago
- Uniffle is a high performance, general purpose Remote Shuffle Service.☆450Apr 7, 2026Updated last month
- Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shu…☆257Apr 7, 2023Updated 3 years ago
- Client libraries of end users of Apache Kyuubi☆11Jan 10, 2023Updated 3 years ago
- SparkSQL自定义Hint优化器解决热点数据导致JOIN数据倾斜问题☆48Jan 4, 2019Updated 7 years ago
- ☆10Dec 5, 2022Updated 3 years ago
- NetEase Spark Courses☆15Sep 4, 2018Updated 7 years ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆2,327Apr 28, 2026Updated last week
- SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.☆136Mar 6, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A library that brings useful functions from various modern database management systems to Apache Spark☆62Sep 4, 2023Updated 2 years ago
- ☆18May 25, 2024Updated last year
- Redash plugin for Apache Kylin integration☆12Mar 21, 2018Updated 8 years ago
- Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.☆1,045Apr 23, 2026Updated 2 weeks ago
- This is a library for SQL optimizing/rewriting including Materialized View rewrite☆69Jun 21, 2022Updated 3 years ago
- Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.☆37Jan 3, 2023Updated 3 years ago
- 又一个newapi的二开☆19May 1, 2026Updated last week
- Port of TPC-DS dsdgen to Java☆22Nov 29, 2022Updated 3 years ago
- A playground to experience Gravitino☆77Mar 16, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆17Mar 19, 2024Updated 2 years ago
- An alternative to the "hive standalone" jar for connecting Java applications to Apache Hive via JDBC☆41Oct 1, 2024Updated last year
- 已经合入(apache/incubator-kyuubi) ACL Management for Apache Spark SQL with Apache Ranger.☆58Nov 11, 2021Updated 4 years ago
- HDFS Native Client in Rust via HDFS C API libhdfs☆41Jan 27, 2025Updated last year
- Generates Art for your contribution graph☆14Jun 13, 2020Updated 5 years ago
- Alluxio Python client - Access Any Data Source with Python☆31Sep 29, 2025Updated 7 months ago
- A Model Context Protocol (MCP) server for Apache Dolphinscheduler. This provides access to your Apache Dolphinshcheduler RESTful API V1 …☆21May 14, 2025Updated 11 months ago
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆898Apr 27, 2026Updated last week
- ☆14Jan 4, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.☆262May 12, 2024Updated last year
- Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.☆286Feb 24, 2026Updated 2 months ago
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,558Updated this week
- Apache Amoro(incubating) is a Lakehouse management system built on open data lake formats.☆1,127Apr 23, 2026Updated 2 weeks ago
- A Fully HiveServer2-like Multi-tenancy Spark Thrift Server Supporting Impersonation and Multi-SparkContext with Ranger Authorization (GO …☆10Jul 7, 2022Updated 3 years ago
- A tool to get better debug info on spark's memory usage☆42Aug 21, 2019Updated 6 years ago
- 📄 📃 papers that I read and noted 🧐☆37Apr 23, 2026Updated 2 weeks ago