Testing Sandbox for Hadoop Ecosystem Components
☆44Apr 15, 2026Updated this week
Alternatives and similar repositories for hadoop-testing
Users that are interested in hadoop-testing are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆16Jan 4, 2026Updated 3 months ago
- Apache Kyuubi Site☆12Mar 26, 2026Updated 3 weeks ago
- A Full RPC Framework Based on Netty.☆14May 19, 2018Updated 7 years ago
- ☆15Oct 12, 2021Updated 4 years ago
- Spark integrations for working with Lance datasets☆47Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Alerting and monitoring tool for Apache Spark☆23May 20, 2022Updated 3 years ago
- Uniffle is a high performance, general purpose Remote Shuffle Service.☆448Apr 7, 2026Updated last week
- Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shu…☆257Apr 7, 2023Updated 3 years ago
- Client libraries of end users of Apache Kyuubi☆11Jan 10, 2023Updated 3 years ago
- SparkSQL自定义Hint优化器解决热点数据导致JOIN数据倾斜问题☆48Jan 4, 2019Updated 7 years ago
- ☆10Dec 5, 2022Updated 3 years ago
- NetEase Spark Courses☆15Sep 4, 2018Updated 7 years ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆2,322Updated this week
- SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.☆136Mar 6, 2023Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A library that brings useful functions from various modern database management systems to Apache Spark☆62Sep 4, 2023Updated 2 years ago
- ☆18May 25, 2024Updated last year
- Redash plugin for Apache Kylin integration☆12Mar 21, 2018Updated 8 years ago
- Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.☆1,042Updated this week
- Maven packaging and lifecycle for Trino plugins☆15Apr 3, 2026Updated 2 weeks ago
- This is a library for SQL optimizing/rewriting including Materialized View rewrite☆69Jun 21, 2022Updated 3 years ago
- Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.☆37Jan 3, 2023Updated 3 years ago
- Port of TPC-DS dsdgen to Java☆22Nov 29, 2022Updated 3 years ago
- 又一个newapi的二开☆19Mar 30, 2026Updated 2 weeks ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A playground to experience Gravitino☆75Mar 16, 2026Updated last month
- Generates Art for your contribution graph☆13Jun 13, 2020Updated 5 years ago
- SeaTunnel is a multimodal, high-performance, distributed, massive data integration tool.☆25Feb 27, 2026Updated last month
- ☆17Mar 19, 2024Updated 2 years ago
- World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.☆2,923Updated this week
- An alternative to the "hive standalone" jar for connecting Java applications to Apache Hive via JDBC☆41Oct 1, 2024Updated last year
- HDFS Native Client in Rust via HDFS C API libhdfs☆41Jan 27, 2025Updated last year
- Alluxio Python client - Access Any Data Source with Python☆32Sep 29, 2025Updated 6 months ago
- A Model Context Protocol (MCP) server for Apache Dolphinscheduler. This provides access to your Apache Dolphinshcheduler RESTful API V1 …☆21May 14, 2025Updated 11 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆897Apr 10, 2026Updated last week
- A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…☆183Apr 6, 2022Updated 4 years ago
- Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.☆262May 12, 2024Updated last year
- Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.☆286Feb 24, 2026Updated last month
- 企业级大数据平台构建:架构与实现☆11Nov 13, 2019Updated 6 years ago
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,547Apr 11, 2026Updated last week
- MCP server for Apache Gravitino☆21Jul 3, 2025Updated 9 months ago