ZuInnoTe / spark-hadoopoffice-dsView external linksLinks
A Spark datasource for the HadoopOffice library
☆36Sep 29, 2025Updated 4 months ago
Alternatives and similar repositories for spark-hadoopoffice-ds
Users that are interested in spark-hadoopoffice-ds are comparing it to the libraries listed below
Sorting:
- HadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)☆62Sep 29, 2025Updated 4 months ago
- A Spark datasource for the HadoopCryptoLedger library☆13Sep 29, 2025Updated 4 months ago
- Connect your Spark Databricks clusters Log4J output to the Application Insights Appender☆19Aug 4, 2020Updated 5 years ago
- Herd-UI is a search and discovery tool for business and technical users. Everyone in your organization can use Herd-UI to browse and unde…☆16Oct 1, 2022Updated 3 years ago
- A Spark plugin for reading and writing Excel files☆520Updated this week
- ACID Data Source for Apache Spark based on Hive ACID☆96Jul 7, 2021Updated 4 years ago
- HA, fault-tolerant, non-intrusive INotify for Hadoop HDFS☆18Apr 16, 2023Updated 2 years ago
- Hadoop utility to compact small files☆18Mar 5, 2024Updated last year
- Collection of HDP Tuning Tricks & Tips (unofficial guide)☆17Sep 26, 2017Updated 8 years ago
- ☆38Feb 28, 2018Updated 7 years ago
- Apache Spark ETL Utilities☆39Oct 23, 2024Updated last year
- A Spark connector for the Azure Common Data Model☆15May 31, 2023Updated 2 years ago
- Splittable SAS (.sas7bdat) Input Format for Hadoop and Spark SQL☆97Jan 22, 2026Updated 3 weeks ago
- A toolset to streamline running spark python on EMR☆20Nov 16, 2016Updated 9 years ago
- DataQuality for BigData☆147Dec 15, 2023Updated 2 years ago
- ☆20Feb 28, 2018Updated 7 years ago
- Convert a CSV fle to ORCFile☆26Apr 10, 2019Updated 6 years ago
- ☆30Apr 6, 2025Updated 10 months ago
- Terraform modules for provisioning and managing AWS Glue resources☆34Dec 10, 2025Updated 2 months ago
- A project to create a stub/mock environment for testing ExecuteScript processors☆30Aug 10, 2018Updated 7 years ago
- Storage Benchmark Kit☆33Nov 5, 2025Updated 3 months ago
- Linux open by handle based VFS implementation for nfs4j☆13Oct 24, 2025Updated 3 months ago
- Advanced block device testing/file system testing, targetting SNIA compatible reporting☆12Oct 15, 2025Updated 4 months ago
- Adelic p-adic Dark Matter☆13Dec 28, 2025Updated last month
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆73Mar 14, 2021Updated 4 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆91Mar 5, 2024Updated last year
- This repository is to help with the Partner Demonstration of the Apache Atlas project.☆30Oct 29, 2015Updated 10 years ago
- On the fly, translation of Spark programs to run natively on your Oracle DB. Your Spark programs require no changes.☆35Apr 15, 2025Updated 10 months ago
- Windows 10 driver for the Mixman DM2 USB turntable☆14Mar 6, 2023Updated 2 years ago
- ☆13Nov 10, 2025Updated 3 months ago
- I'll munch some data here☆12Jun 18, 2021Updated 4 years ago
- 适合2到6岁的宝宝打字游戏☆10May 29, 2020Updated 5 years ago
- Lustre Repository with MS patches☆13Updated this week
- Python wrappers for the FirecREST API☆12Dec 23, 2025Updated last month
- Spark Structured Streaming State Tools☆34Jul 3, 2020Updated 5 years ago
- HDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS☆152Sep 11, 2023Updated 2 years ago
- Java event logs collector for hadoop and frameworks☆41Mar 25, 2025Updated 10 months ago
- 【原型探索】基于MPEG-DASH的SRD,实现只传输和渲染用户观看FOV区域中的全景视频分块。Powered By dash-srd.js☆10Oct 6, 2020Updated 5 years ago
- Auto detection of apt proxies in the LAN, caching and checking status☆10Feb 13, 2025Updated last year