aliyun / aliyun-emapreduce-datasourcesView external linksLinks
Extended datasource support for Spark/Hadoop on Aliyun E-MapReduce.
☆170Nov 30, 2023Updated 2 years ago
Alternatives and similar repositories for aliyun-emapreduce-datasources
Users that are interested in aliyun-emapreduce-datasources are comparing it to the libraries listed below
Sorting:
- ☆126Feb 2, 2026Updated 2 weeks ago
- Spark on ECS☆24Jul 4, 2015Updated 10 years ago
- SDK for open source framwork to interact with MaxCompute☆39Feb 24, 2020Updated 5 years ago
- SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.☆136Mar 6, 2023Updated 2 years ago
- ☆149Jun 12, 2025Updated 8 months ago
- TiSpark is built for running Apache Spark on top of TiDB/TiKV☆892Jan 16, 2026Updated last month
- MaxCompute spark demo for building a runnable application.☆115Feb 3, 2026Updated 2 weeks ago
- alibabacloud-jindodata☆204Feb 6, 2026Updated last week
- A S3 Shuffle plugin for Apache Spark to enable elastic scaling for generic Spark workloads.☆52Sep 17, 2025Updated 4 months ago
- Example project to show how to use Kafka from Spark Streaming with the Confluent schema registry☆11Aug 17, 2016Updated 9 years ago
- springmvc+phoenix操作hbase的web架构☆10Aug 20, 2018Updated 7 years ago
- An Extensible Data Skipping Framework☆48Jul 15, 2025Updated 7 months ago
- 一个比Spark-Parquet还快5~100倍的存储格式☆12Feb 22, 2016Updated 9 years ago
- Quickly create example test-cases to reproduce your issues for reporting for Spring-Data-Neo4j in JIRA https://jira.spring.io/browse/DATA…☆10Apr 11, 2019Updated 6 years ago
- Apache YuniKorn Scheduler Interface☆34Feb 4, 2026Updated last week
- Builds HTML from Python (recovered from local installation since original was deleted)☆11Dec 26, 2022Updated 3 years ago
- 录制Spak视频课程讲解涉及编写的源代码 https://edu.hellobi.com/course/107/overview☆13Apr 23, 2019Updated 6 years ago
- Spark Shuffle Optimization with RDMA+AEP☆30May 23, 2023Updated 2 years ago
- OpenResty Lua Utils☆15Oct 11, 2021Updated 4 years ago
- Hadoop filesystem implementation for Aliyun OSS☆13Feb 14, 2016Updated 10 years ago
- Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.☆1,038Feb 5, 2026Updated last week
- Java PMML API (legacy codebase)☆80Jun 16, 2015Updated 10 years ago
- A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…☆182Apr 6, 2022Updated 3 years ago
- Mirror of Apache Hive☆33Mar 16, 2020Updated 5 years ago
- ODPS Python SDK and data analysis framework☆448Dec 9, 2025Updated 2 months ago
- A HBase datasource implementation for Spark and [MLSQL](http://www.mlsql.tech).☆15Sep 29, 2023Updated 2 years ago
- A Spark Reliability Testing Suite☆13Jan 10, 2017Updated 9 years ago
- The released version of Astro(Spark SQL on HBase) has been moved to:☆16Jul 23, 2015Updated 10 years ago
- HiBench is a big data benchmark suite.☆1,489Dec 15, 2025Updated 2 months ago
- Plugin for Presto to allow addition of user functions easily☆119Mar 31, 2021Updated 4 years ago
- A parallel implementation of factorization machines based on Spark☆75Jun 28, 2020Updated 5 years ago
- ☆33May 15, 2015Updated 10 years ago
- JVM integration for Weld☆16Sep 24, 2018Updated 7 years ago
- Import Salesforce data into Hadoop HDFS in Avro format☆23Jan 8, 2020Updated 6 years ago
- Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange☆130Dec 19, 2024Updated last year
- The out-of-the-box environment to for Hadoop/Spark applications☆38Jan 31, 2023Updated 3 years ago
- Apache Kylin☆3,765Dec 29, 2025Updated last month
- Spark Terasort☆121Apr 21, 2023Updated 2 years ago
- Alibaba Cloud Log Service C++ SDK☆21Apr 4, 2025Updated 10 months ago