A Spark datasource for the HadoopOffice library
☆36Sep 29, 2025Updated 6 months ago
Alternatives and similar repositories for spark-hadoopoffice-ds
Users that are interested in spark-hadoopoffice-ds are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- HadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)☆62Sep 29, 2025Updated 6 months ago
- Apache Spark ETL Utilities☆39Oct 23, 2024Updated last year
- Connect your Spark Databricks clusters Log4J output to the Application Insights Appender☆19Aug 4, 2020Updated 5 years ago
- A Spark plugin for reading and writing Excel files☆523Updated this week
- Microservices with spring-boot and Machine Learning with Apache Spark ML☆13Sep 15, 2018Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Spark projects. Learning book "Machine Learning with Spark"☆10Jun 3, 2017Updated 8 years ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆17Jan 12, 2017Updated 9 years ago
- A plugin to enable Apache Spark to read HDF5 files☆20Nov 17, 2016Updated 9 years ago
- Object Detection Video with TensorFlow☆13Nov 17, 2018Updated 7 years ago
- [student project] UI to run SQL on Delta Lake tables and visualize the variations of the result among tables versions☆12Apr 21, 2020Updated 5 years ago
- ACID Data Source for Apache Spark based on Hive ACID☆96Jul 7, 2021Updated 4 years ago
- Herd-UI is a search and discovery tool for business and technical users. Everyone in your organization can use Herd-UI to browse and unde…☆16Oct 1, 2022Updated 3 years ago
- Collection of HDP Tuning Tricks & Tips (unofficial guide)☆17Sep 26, 2017Updated 8 years ago
- Verify that all reachable code links and will not fail at runtime with a linkage error☆10Jul 30, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Test API using Fast API library.☆14Apr 10, 2022Updated 4 years ago
- Examples and Quick Starts for Snowflake☆11Apr 4, 2026Updated 2 weeks ago
- A Hubot script for creating quick reminders through natural language.☆11Jun 29, 2017Updated 8 years ago
- R Package for WebHDFS REST API☆18Apr 15, 2019Updated 7 years ago
- Apache Spark Connector for Azure Kusto☆81Mar 30, 2026Updated 2 weeks ago
- The setup repository is part of the Corporate Linked Data Catalog - short: COLID - application. It helps setting up a local environment …☆16Dec 17, 2024Updated last year
- Finetuning and Inference of Llama2 7b model on colab☆14Jul 19, 2023Updated 2 years ago
- Interactive playing with Math in Scala☆10Jan 4, 2017Updated 9 years ago
- Gobblin is a distributed big data integration framework (ingestion, replication, compliance, retention) for batch and streaming systems.…☆11Jul 29, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- WebS is a lightweight MVC framework☆11Dec 6, 2017Updated 8 years ago
- ☆30Apr 6, 2025Updated last year
- Ambari service for RedHat FreeIPA☆11Sep 30, 2016Updated 9 years ago
- ☆10Jul 5, 2016Updated 9 years ago
- List of playbooks to manage Ambari☆13Oct 3, 2018Updated 7 years ago
- Grafana Prometheus exporter☆10Oct 17, 2017Updated 8 years ago
- Extract data from SAP applications using Operational Data Provisioning☆10Jul 19, 2023Updated 2 years ago
- Local Development of AWS Glue with Docker and Visual Studio Code☆14Nov 29, 2021Updated 4 years ago
- Sample demonstrating consuming Amazon Cognito Streams☆10Jun 15, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Avro Schema Shredder is a REST API that enables storage of Avro Schemas in Apache Atlas. This API enables an organization to use Apache A…☆13Jan 11, 2017Updated 9 years ago
- Sample processing code using Spark 2.1+ and Scala☆51Jun 28, 2020Updated 5 years ago
- ☆13Sep 5, 2023Updated 2 years ago
- Data Quality Monitoring Tool☆15Dec 5, 2017Updated 8 years ago
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆73Mar 14, 2021Updated 5 years ago
- Sample Docker Compose files for running Apache Ambari☆11Oct 29, 2018Updated 7 years ago
- Storage Benchmark Kit☆33Mar 19, 2026Updated last month