Windows binaries for Hadoop versions (built from the git commit ID used for the ASF relase)
☆2,640Sep 29, 2023Updated 2 years ago
Alternatives and similar repositories for winutils
Users that are interested in winutils are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- winutils.exe hadoop.dll and hdfs.dll binaries for hadoop windows☆2,252May 16, 2024Updated last year
- hadoop-common-2.2.0/bin☆365Aug 30, 2015Updated 10 years ago
- Apache Spark - A unified analytics engine for large-scale data processing☆43,236Updated this week
- The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.☆547May 10, 2021Updated 4 years ago
- winutils.exe hadoop.dll and hdfs.dll binaries for hadoop windows☆293Dec 11, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Apache Flink☆25,980Updated this week
- 酷玩 Spark: Spark 源代码解析、Spark 类库等☆3,479May 18, 2022Updated 3 years ago
- DataX是阿里云DataWorks数据集成的开源版本。☆17,190Jul 1, 2025Updated 10 months ago
- eclipse plugin for hadoop 2.2.0 , 2.4.1☆558Jan 24, 2019Updated 7 years ago
- Windows binaries and winutils for Hadoop 3.1.1☆33Sep 20, 2018Updated 7 years ago
- Notes talking about the design and implementation of Apache Spark☆5,367Apr 2, 2024Updated 2 years ago
- Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code☆14,252Updated this week
- Apache Hive☆5,968Updated this week
- A data integration framework☆4,108Dec 2, 2025Updated 5 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- SeaTunnel is a multimodal, high-performance, distributed, massive data integration tool.☆9,302Apr 30, 2026Updated last week
- flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Ta…☆15,052Apr 14, 2026Updated 3 weeks ago
- Apache Spark 官方文档中文版☆1,180Jul 21, 2023Updated 2 years ago
- 阿里云计算平台DataWorks(https://help.aliyun.com/document_detail/137663.html) 团队出品,为监控而生的数据库连接池☆28,196Apr 22, 2026Updated 2 weeks ago
- hadoop各组件使用,持续更新☆900Jan 4, 2023Updated 3 years ago
- Azkaban workflow manager.☆4,511Jul 3, 2024Updated last year
- Apache Ambari simplifies provisioning, managing, and monitoring of Apache Hadoop clusters.☆2,301Feb 20, 2026Updated 2 months ago
- Apache Hadoop☆15,533Updated this week
- 基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法☆2,055Feb 21, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Flink CDC is a streaming data integration tool☆6,415Updated this week
- 阿里巴巴 MySQL binlog 增量订阅&消费组件☆29,679Apr 29, 2026Updated last week
- Pentaho Data Integration ( ETL ) a.k.a Kettle☆8,341Updated this week
- Upserts, Deletes And Incremental Processing on Big Data.☆6,150Updated this week
- CMAK is a tool for managing Apache Kafka clusters☆11,938Aug 2, 2023Updated 2 years ago
- A AI-Driven, Distributed and high-performance monitoring system, for comprehensive monitoring and management of kafka cluster.☆3,177Dec 18, 2025Updated 4 months ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆2,327Apr 28, 2026Updated last week
- Windows binaries for Hadoop versions (built from the git commit ID used for the ASF relase)☆121Dec 20, 2017Updated 8 years ago
- 专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...☆10,457Aug 7, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Hadoop 2.7.1 on windows☆86May 29, 2018Updated 7 years ago
- DataX集成可视化页面,选择数据源即可一键生成数据同步任务,支持RDBMS、Hive、HBase、ClickHouse、MongoDB等数据源,批量创建RDBMS数据同步任务,集成开源调度系统,支持分布式、增量同步数据、实时查看运行日志、监控执行器资源、KILL运行进程、…☆5,993Jun 2, 2024Updated last year
- A web front end for an elastic search cluster☆9,486Jul 17, 2021Updated 4 years ago
- Redis is an in-memory database that persists on disk. The data model is key-value, but many different kind of values are supported: Strin…☆21,090Jul 18, 2019Updated 6 years ago
- Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.☆3,728Apr 20, 2026Updated 2 weeks ago
- Example source code accompanying O'Reilly's "Hadoop: The Definitive Guide" by Tom White☆3,506Mar 17, 2020Updated 6 years ago
- A distributed task scheduling framework.(分布式任务调度平台XXL-JOB)☆30,119Apr 5, 2026Updated last month