A Spark datasource for the HadoopOffice library
☆36Sep 29, 2025Updated 8 months ago
Alternatives and similar repositories for spark-hadoopoffice-ds
Users that are interested in spark-hadoopoffice-ds are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- HadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)☆63Sep 29, 2025Updated 8 months ago
- A Spark datasource for the HadoopCryptoLedger library☆13Sep 29, 2025Updated 8 months ago
- Apache Spark ETL Utilities☆39Oct 23, 2024Updated last year
- Connect your Spark Databricks clusters Log4J output to the Application Insights Appender☆19Aug 4, 2020Updated 5 years ago
- A Spark plugin for reading and writing Excel files☆522May 13, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Microservices with spring-boot and Machine Learning with Apache Spark ML☆13Sep 15, 2018Updated 7 years ago
- An easy way to run code on other machines using IPFS Pubsub as the message queue, AWS's Python3.7 Lambda Docker Container for execution☆10May 26, 2019Updated 7 years ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆17Jan 12, 2017Updated 9 years ago
- A plugin to enable Apache Spark to read HDF5 files☆20Nov 17, 2016Updated 9 years ago
- A toolset to streamline running spark python on EMR☆20Nov 16, 2016Updated 9 years ago
- Repository for Complex Systems model of the Grassroots Economics Community Inclusion Currency project.☆11May 15, 2026Updated 2 weeks ago
- ☆13Nov 10, 2022Updated 3 years ago
- [student project] UI to run SQL on Delta Lake tables and visualize the variations of the result among tables versions☆12Apr 21, 2020Updated 6 years ago
- ACID Data Source for Apache Spark based on Hive ACID☆97Jul 7, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Herd-UI is a search and discovery tool for business and technical users. Everyone in your organization can use Herd-UI to browse and unde…☆16Oct 1, 2022Updated 3 years ago
- Collection of HDP Tuning Tricks & Tips (unofficial guide)☆17Sep 26, 2017Updated 8 years ago
- Verify that all reachable code links and will not fail at runtime with a linkage error☆10Jul 30, 2022Updated 3 years ago
- Splittable SAS (.sas7bdat) Input Format for Hadoop and Spark SQL☆97Jan 22, 2026Updated 4 months ago
- Test API using Fast API library.☆14Apr 10, 2022Updated 4 years ago
- A Hubot script for creating quick reminders through natural language.☆11Jun 29, 2017Updated 8 years ago
- R Package for WebHDFS REST API☆18Apr 15, 2019Updated 7 years ago
- This project compose of two parts: 1) write, spark job to write to hbase using bulk load to; 2)read, rest api reading from hbase base on …☆20Oct 25, 2017Updated 8 years ago
- Apache Spark Connector for Azure Kusto☆81May 12, 2026Updated 2 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Finetuning and Inference of Llama2 7b model on colab☆14Jul 19, 2023Updated 2 years ago
- Interactive playing with Math in Scala☆10Jan 4, 2017Updated 9 years ago
- Gobblin is a distributed big data integration framework (ingestion, replication, compliance, retention) for batch and streaming systems.…☆11Jul 29, 2017Updated 8 years ago
- Example for a Gradle project for LEGO Mindstorm EV3 and Lejos☆13Nov 28, 2021Updated 4 years ago
- A repository of strategies that can be used to automate intra day trades in the National Stock Exchange using the KiteConnect API by Zero…☆16Apr 19, 2021Updated 5 years ago
- WebS is a lightweight MVC framework☆11Dec 6, 2017Updated 8 years ago
- ☆30Apr 6, 2025Updated last year
- Hadoop, MapReduce, HDFS, Spark, Pig, Hive, HBase, MongoDB, Cassandra, Flume - the list goes on! Over 25 technologies.☆10Jan 1, 2018Updated 8 years ago
- Scrapy exporter for Big Data formats☆16Mar 10, 2026Updated 2 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- 已经合入(apache/incubator-kyuubi) ACL Management for Apache Spark SQL with Apache Ranger.☆58Nov 11, 2021Updated 4 years ago
- Terraform Module to create a Apache Zookeeper cluster on AWS☆13Jan 3, 2022Updated 4 years ago
- A Spark connector for the Azure Common Data Model☆15May 31, 2023Updated 2 years ago
- Ambari service for RedHat FreeIPA☆11Sep 30, 2016Updated 9 years ago
- python script to repair the primary range of a node in N discrete steps☆12Aug 3, 2018Updated 7 years ago
- ☆10Jul 5, 2016Updated 9 years ago
- ☆19May 15, 2026Updated 2 weeks ago