A Spark datasource for the HadoopOffice library
☆36Sep 29, 2025Updated 5 months ago
Alternatives and similar repositories for spark-hadoopoffice-ds
Users that are interested in spark-hadoopoffice-ds are comparing it to the libraries listed below
Sorting:
- A Spark datasource for the HadoopCryptoLedger library☆13Sep 29, 2025Updated 5 months ago
- Connect your Spark Databricks clusters Log4J output to the Application Insights Appender☆19Aug 4, 2020Updated 5 years ago
- Herd-UI is a search and discovery tool for business and technical users. Everyone in your organization can use Herd-UI to browse and unde…☆16Oct 1, 2022Updated 3 years ago
- A Spark plugin for reading and writing Excel files☆520Feb 12, 2026Updated 3 weeks ago
- ACID Data Source for Apache Spark based on Hive ACID☆96Jul 7, 2021Updated 4 years ago
- HA, fault-tolerant, non-intrusive INotify for Hadoop HDFS☆18Apr 16, 2023Updated 2 years ago
- Collection of HDP Tuning Tricks & Tips (unofficial guide)☆17Sep 26, 2017Updated 8 years ago
- Hadoop utility to compact small files☆18Feb 16, 2026Updated 3 weeks ago
- ☆38Feb 28, 2018Updated 8 years ago
- Apache Spark ETL Utilities☆39Oct 23, 2024Updated last year
- A Spark connector for the Azure Common Data Model☆15May 31, 2023Updated 2 years ago
- Apache Spark Connector for Azure Kusto☆79Updated this week
- Marquez Web UI☆21Nov 13, 2020Updated 5 years ago
- DataQuality for BigData☆148Dec 15, 2023Updated 2 years ago
- Hadoop FSImage Analyzer (HFSA)☆66Mar 2, 2026Updated last week
- Convert a CSV fle to ORCFile☆26Apr 10, 2019Updated 6 years ago
- A project to create a stub/mock environment for testing ExecuteScript processors☆31Aug 10, 2018Updated 7 years ago
- Terraform modules for provisioning and managing AWS Glue resources☆34Dec 10, 2025Updated 2 months ago
- Storage Benchmark Kit☆33Nov 5, 2025Updated 4 months ago
- Adelic p-adic Dark Matter☆13Feb 15, 2026Updated 3 weeks ago
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆73Mar 14, 2021Updated 4 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆92Mar 5, 2024Updated 2 years ago
- On the fly, translation of Spark programs to run natively on your Oracle DB. Your Spark programs require no changes.☆35Apr 15, 2025Updated 10 months ago
- 适合2到6岁的宝宝打字游戏☆10May 29, 2020Updated 5 years ago
- Lustre Repository with MS patches☆14Updated this week
- ☆14Nov 10, 2025Updated 3 months ago
- Windows 10 driver for the Mixman DM2 USB turntable☆14Mar 6, 2023Updated 3 years ago
- Use Terraform outputs in your ruby code.☆11Feb 5, 2020Updated 6 years ago
- Spark Structured Streaming State Tools☆34Jul 3, 2020Updated 5 years ago
- HDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS☆152Sep 11, 2023Updated 2 years ago
- Java event logs collector for hadoop and frameworks☆41Mar 25, 2025Updated 11 months ago
- Pipeline for Recombinant Yeast genoMEs That Identifies Markers of Engineering☆12May 13, 2024Updated last year
- Local Development of AWS Glue with Docker and Visual Studio Code☆14Nov 29, 2021Updated 4 years ago
- 【原型探索】基于MPEG-DASH的SRD,实现只传输和渲染用户观看FOV区域中的全景视频分块。Powered By dash-srd.js☆10Oct 6, 2020Updated 5 years ago
- A Scala library for locality sensitive hashing☆14Aug 1, 2018Updated 7 years ago
- Everything which has to do with Data Integration. Templates for Azure Data Factory and Azure Synapse Analytics☆10Jan 29, 2022Updated 4 years ago
- Repo to hold code Artifacts for WAF☆10Sep 14, 2022Updated 3 years ago
- Lustre HSM tools☆10Feb 19, 2024Updated 2 years ago
- ☆12Apr 27, 2018Updated 7 years ago