ZuInnoTe/spark-hadoopoffice-ds

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ZuInnoTe/spark-hadoopoffice-ds)

ZuInnoTe / spark-hadoopoffice-ds

A Spark datasource for the HadoopOffice library

☆36

Alternatives and similar repositories for spark-hadoopoffice-ds

Users that are interested in spark-hadoopoffice-ds are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ZuInnoTe / hadoopoffice
View on GitHub
HadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)
☆63Sep 29, 2025Updated 10 months ago
ZuInnoTe / spark-hadoopcryptoledger-ds
View on GitHub
A Spark datasource for the HadoopCryptoLedger library
☆13Sep 29, 2025Updated 10 months ago
mayur2810 / sope
View on GitHub
Apache Spark ETL Utilities
☆40Oct 23, 2024Updated last year
AdamPaternostro / Azure-Databricks-Log4J-To-AppInsights
View on GitHub
Connect your Spark Databricks clusters Log4J output to the Application Insights Appender
☆19Aug 4, 2020Updated 5 years ago
nightscape / spark-excel
View on GitHub
A Spark plugin for reading and writing Excel files
☆523Jul 20, 2026Updated last week
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
luseiee / machineLearningWithSpark
View on GitHub
Spark projects. Learning book "Machine Learning with Spark"
☆10Jun 3, 2017Updated 9 years ago
bernhard-42 / pyspark-atlas
View on GitHub
PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection
☆17Jan 12, 2017Updated 9 years ago
drbh / ipfs-pubsub-compute
View on GitHub
An easy way to run code on other machines using IPFS Pubsub as the message queue, AWS's Python3.7 Lambda Docker Container for execution
☆10May 26, 2019Updated 7 years ago
yodasco / pyspark-emr
View on GitHub
A toolset to streamline running spark python on EMR
☆20Nov 16, 2016Updated 9 years ago
llnl / spark-hdf5
View on GitHub
A plugin to enable Apache Spark to read HDF5 files
☆20Nov 17, 2016Updated 9 years ago
w3c-cg / holon
View on GitHub
☆17Updated this week
danlessa / TheGraph
View on GitHub
☆13Nov 10, 2022Updated 3 years ago
ebonnal / delta-lake-ui
View on GitHub
[student project] UI to run SQL on Delta Lake tables and visualize the variations of the result among tables versions
☆12Apr 21, 2020Updated 6 years ago
MRobalinho / Object-Detection-Video
View on GitHub
Object Detection Video with TensorFlow
☆14Nov 17, 2018Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
caroljmcdonald / spark-ml-kmeans-uber
View on GitHub
☆38Feb 28, 2018Updated 8 years ago
qubole / spark-acid
View on GitHub
ACID Data Source for Apache Spark based on Hive ACID
☆97Jul 7, 2021Updated 5 years ago
saurfang / spark-sas7bdat
View on GitHub
Splittable SAS (.sas7bdat) Input Format for Hadoop and Spark SQL
☆99Jan 22, 2026Updated 6 months ago
FINRAOS / herd-ui
View on GitHub
⚠️ Archived — This repository is no longer maintained and will not receive updates, including security patches. It is preserved in read-o…
☆16Updated this week
zaratsian / HDP_Tuning_Unofficial
View on GitHub
Collection of HDP Tuning Tricks & Tips (unofficial guide)
☆17Sep 26, 2017Updated 8 years ago
Atheuz / Test-API
View on GitHub
Test API using Fast API library.
☆14Apr 10, 2022Updated 4 years ago
manerfan / SnowFlakeWithZK
View on GitHub
分布式自增ID/发号器 SnowFlake with Zookeeper in Kotlin
☆17Jun 22, 2018Updated 8 years ago
tina437213 / spark-bulkload-hbase-spring-boot-rest
View on GitHub
This project compose of two parts: 1) write, spark job to write to hbase using bulk load to; 2)read, rest api reading from hbase base on …
☆20Oct 25, 2017Updated 8 years ago
Azure / azure-kusto-spark
View on GitHub
Apache Spark Connector for Azure Kusto
☆81Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
openSUSE / microos-tools
View on GitHub
Files and scripts for the SUSE MicroOS part
☆17Mar 9, 2026Updated 4 months ago
ClaudeBot / hubot-remind-advanced
View on GitHub
A Hubot script for creating quick reminders through natural language.
☆11Jun 29, 2017Updated 9 years ago
richardswinbank / community
View on GitHub
☆30Apr 6, 2025Updated last year
wenbo2018 / WebS
View on GitHub
WebS is a lightweight MVC framework
☆11Dec 6, 2017Updated 8 years ago
LinkedInAttic / apache-incubator-gobblin
View on GitHub
Gobblin is a distributed big data integration framework (ingestion, replication, compliance, retention) for batch and streaming systems.…
☆11Jul 29, 2017Updated 9 years ago
yaooqinn / spark-ranger
View on GitHub
已经合入(apache/incubator-kyuubi) ACL Management for Apache Spark SQL with Apache Ranger.
☆59Nov 11, 2021Updated 4 years ago
Azure / spark-cdm
View on GitHub
A Spark connector for the Azure Common Data Model
☆15May 31, 2023Updated 3 years ago
kamon-io / kamon-netty
View on GitHub
kamon netty integration
☆10Aug 30, 2020Updated 5 years ago
Liberxue / OmniGraffle-Pro-template
View on GitHub
OmniGraffle free templates
☆30Mar 16, 2021Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
kmgowda / SBK
View on GitHub
Storage Benchmark Kit
☆35Updated this week
fscm / terraform-module-aws-zookeeper
View on GitHub
Terraform Module to create a Apache Zookeeper cluster on AWS
☆13Jan 3, 2022Updated 4 years ago
hortonworks-gallery / ambari-freeipa-service
View on GitHub
Ambari service for RedHat FreeIPA
☆11Sep 30, 2016Updated 9 years ago
mstump / cassandra_range_repair
View on GitHub
python script to repair the primary range of a node in N discrete steps
☆12Aug 3, 2018Updated 7 years ago
cevoaustralia / glue-vscode
View on GitHub
Local Development of AWS Glue with Docker and Visual Studio Code
☆14Nov 29, 2021Updated 4 years ago
amazon-archives / amazon-cognito-streams-sample
View on GitHub
Sample demonstrating consuming Amazon Cognito Streams
☆10Jun 15, 2020Updated 6 years ago
oleewere / ansible-ambari-manager
View on GitHub
List of playbooks to manage Ambari
☆13Oct 3, 2018Updated 7 years ago