A Spark datasource for the HadoopOffice library
☆36Sep 29, 2025Updated 6 months ago
Alternatives and similar repositories for spark-hadoopoffice-ds
Users that are interested in spark-hadoopoffice-ds are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- HadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)☆62Sep 29, 2025Updated 6 months ago
- A Spark datasource for the HadoopCryptoLedger library☆13Sep 29, 2025Updated 6 months ago
- Apache Spark ETL Utilities☆39Oct 23, 2024Updated last year
- Connect your Spark Databricks clusters Log4J output to the Application Insights Appender☆19Aug 4, 2020Updated 5 years ago
- A Spark plugin for reading and writing Excel files☆522Mar 23, 2026Updated last week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Microservices with spring-boot and Machine Learning with Apache Spark ML☆13Sep 15, 2018Updated 7 years ago
- Spark projects. Learning book "Machine Learning with Spark"☆10Jun 3, 2017Updated 8 years ago
- An easy way to run code on other machines using IPFS Pubsub as the message queue, AWS's Python3.7 Lambda Docker Container for execution☆10May 26, 2019Updated 6 years ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆17Jan 12, 2017Updated 9 years ago
- A plugin to enable Apache Spark to read HDF5 files☆20Nov 17, 2016Updated 9 years ago
- A toolset to streamline running spark python on EMR☆20Nov 16, 2016Updated 9 years ago
- Repository for Complex Systems model of the Grassroots Economics Community Inclusion Currency project.☆11Jan 29, 2023Updated 3 years ago
- [student project] UI to run SQL on Delta Lake tables and visualize the variations of the result among tables versions☆12Apr 21, 2020Updated 5 years ago
- ACID Data Source for Apache Spark based on Hive ACID☆96Jul 7, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Herd-UI is a search and discovery tool for business and technical users. Everyone in your organization can use Herd-UI to browse and unde…☆16Oct 1, 2022Updated 3 years ago
- Collection of HDP Tuning Tricks & Tips (unofficial guide)☆17Sep 26, 2017Updated 8 years ago
- Tools for improving cadCAD experience☆11Dec 22, 2023Updated 2 years ago
- ☆38Feb 28, 2018Updated 8 years ago
- A Hubot script for creating quick reminders through natural language.☆11Jun 29, 2017Updated 8 years ago
- R Package for WebHDFS REST API☆18Apr 15, 2019Updated 6 years ago
- Files and scripts for the SUSE MicroOS part☆17Mar 9, 2026Updated 2 weeks ago
- auth_os beta platform build☆19Aug 20, 2018Updated 7 years ago
- The setup repository is part of the Corporate Linked Data Catalog - short: COLID - application. It helps setting up a local environment …☆16Dec 17, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Interactive playing with Math in Scala☆10Jan 4, 2017Updated 9 years ago
- Gobblin is a distributed big data integration framework (ingestion, replication, compliance, retention) for batch and streaming systems.…☆11Jul 29, 2017Updated 8 years ago
- WebS is a lightweight MVC framework☆11Dec 6, 2017Updated 8 years ago
- ☆30Apr 6, 2025Updated 11 months ago
- 已经合入(apache/incubator-kyuubi) ACL Management for Apache Spark SQL with Apache Ranger.☆58Nov 11, 2021Updated 4 years ago
- Scrapy exporter for Big Data formats☆16Mar 10, 2026Updated 2 weeks ago
- Terraform Module to create a Apache Zookeeper cluster on AWS☆13Jan 3, 2022Updated 4 years ago
- A Spark connector for the Azure Common Data Model☆15May 31, 2023Updated 2 years ago
- kamon netty integration☆10Aug 30, 2020Updated 5 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Ambari service for RedHat FreeIPA☆11Sep 30, 2016Updated 9 years ago
- python script to repair the primary range of a node in N discrete steps☆12Aug 3, 2018Updated 7 years ago
- Hadoop FSImage Analyzer (HFSA)☆67Mar 17, 2026Updated last week
- ☆10Jul 5, 2016Updated 9 years ago
- List of playbooks to manage Ambari☆13Oct 3, 2018Updated 7 years ago
- Grafana Prometheus exporter☆10Oct 17, 2017Updated 8 years ago
- Extract data from SAP applications using Operational Data Provisioning☆10Jul 19, 2023Updated 2 years ago