hdfs文件治理工具,文件批量解压、压缩、小文件合并
☆25Feb 2, 2024Updated 2 years ago
Alternatives and similar repositories for hdfsutils
Users that are interested in hdfsutils are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Merge Small files for Hive Table on HDFS☆15Mar 4, 2014Updated 12 years ago
- Hadoop utility to compact small files☆18Feb 16, 2026Updated 2 months ago
- kafka0.8.2 using simple consumer load message into hdfs using custom mapreduce☆12Aug 12, 2015Updated 10 years ago
- 简易TCP/IP协议栈,支持TCP、UDP,支持DHCP动态获取IP,支持keep_alive等☆24Mar 30, 2018Updated 8 years ago
- Spark(multi versions) + Streaming/Hive/SQL/UDF Demos☆15May 17, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 自动刷新Impala元数据。给Impala3.2以下没有自动刷新元数据功能的孩子们使用☆11Jul 27, 2021Updated 4 years ago
- ☆10Feb 20, 2021Updated 5 years ago
- ExcelGPT是一款智能表格处理插件。从公式解释、公式生成、数据生成、数据分析、文本内容润色改写、中英文翻译到智能问答聊天助理等全套表格处理功能的辅助您降低表格处理工作的学习成本及门槛,把时间精力集中在创造价值上。☆15Jul 28, 2024Updated last year
- ☆28Sep 18, 2019Updated 6 years ago
- Code accompanying the paper "Semi-Unsupervised Learning with Deep Generative Models: Clustering and Classifying using Ultra-Sparse Labels…☆13Jan 25, 2019Updated 7 years ago
- 一个分布式存储的在线网盘系统☆24Dec 16, 2022Updated 3 years ago
- Forgetful Bloom filters☆16Mar 8, 2019Updated 7 years ago
- 高性能大数据实时同步:kafka连接器(kafka-connect-kudu-sink插件)、海量日志流处理☆19Jun 17, 2022Updated 3 years ago
- webGame☆13Aug 20, 2012Updated 13 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for Semi-crowdsourced Clustering with Deep Generative Models☆12Dec 9, 2022Updated 3 years ago
- Spectral Clustering in C++☆17Jan 8, 2013Updated 13 years ago
- Convolutional Embedded Networks for Population Scale Clustering and Bio-ancestry Inferencing☆11Jan 7, 2020Updated 6 years ago
- MIT dsail research project☆12May 14, 2020Updated 5 years ago
- 规则引擎(drools示例,基于MVEL的规则引擎)☆29Jun 17, 2022Updated 3 years ago
- 通用数据生成平台☆13Mar 11, 2025Updated last year
- ☆12Jun 21, 2022Updated 3 years ago
- Selective Sampling-based Scalable Sparse Subspace Clustering (NeurIPS 19')☆13Jul 19, 2020Updated 5 years ago
- 项目中保留了向开源社区提交过的patch☆16Oct 22, 2017Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆16Feb 19, 2017Updated 9 years ago
- Code for the paper "Feature Grouping as a Stochastic Regularizer for High-Dimensional Structured Data" at ICML 2019.☆20Apr 22, 2019Updated 6 years ago
- A TensorFlow implementation on Deep Clustering Network(DCN), ICML 2017☆13Oct 21, 2022Updated 3 years ago
- typical zookeeper application☆17Aug 30, 2018Updated 7 years ago
- The BTL C/C++ Common bloom filters for bioinformatics projects, as well as any APIs created for other programming languages.☆18Feb 26, 2022Updated 4 years ago
- Similar to the note-taking function like that of Yuque.☆10Jan 21, 2025Updated last year
- Flink Sql 教程☆35Dec 2, 2024Updated last year
- mysql to starrocks|doris sync☆27Dec 19, 2024Updated last year
- ☆19Dec 16, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- BC-Tree and Ball-Tree for Point-to-Hyperplane NNS (ICDE 2023)☆17Aug 4, 2023Updated 2 years ago
- ☆25Oct 18, 2021Updated 4 years ago
- Akka实战☆20Apr 11, 2017Updated 9 years ago
- flink+mybatis+通用mapper集成☆13May 13, 2019Updated 6 years ago
- A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trino☆22May 30, 2022Updated 3 years ago
- FITing Tree is an indexing data structure that efficiently uses the memory without sacrificing the performance. For the paper: https://dl…☆13Nov 21, 2021Updated 4 years ago
- 给小组小伙伴们分享的dubbo demo,说明见doc目录下文档☆14Jun 26, 2015Updated 10 years ago