HadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)
☆62Sep 29, 2025Updated 5 months ago
Alternatives and similar repositories for hadoopoffice
Users that are interested in hadoopoffice are comparing it to the libraries listed below
Sorting:
- A Spark datasource for the HadoopOffice library☆36Sep 29, 2025Updated 5 months ago
- A Spark datasource for the HadoopCryptoLedger library☆13Sep 29, 2025Updated 5 months ago
- A Spark plugin for reading and writing Excel files☆520Feb 12, 2026Updated 3 weeks ago
- Automated TPC-DS and TPC-H benchmark for Apache Hive LLAP☆10Jul 18, 2022Updated 3 years ago
- Cloud Spanner Connector for Apache Spark☆17Updated this week
- Scala API for Apache Spark SQL high-order functions☆14Aug 4, 2023Updated 2 years ago
- A Spark data source for reading Microsoft Excel files☆13Jul 1, 2024Updated last year
- Android's Room Persistence Library for Scala☆12Apr 29, 2020Updated 5 years ago
- low-level helpers for Apache Spark libraries and tests☆16Dec 29, 2018Updated 7 years ago
- Google Maps geocoding library for Scala☆12Oct 12, 2019Updated 6 years ago
- Spark SQL DBF Library☆16Jan 2, 2015Updated 11 years ago
- Type-safe SQL builder for Scala☆30Jul 18, 2019Updated 6 years ago
- ACID Data Source for Apache Spark based on Hive ACID☆96Jul 7, 2021Updated 4 years ago
- Schema Registry integration for Apache Spark☆40Nov 16, 2022Updated 3 years ago
- A Spark connector for the Azure Common Data Model☆15May 31, 2023Updated 2 years ago
- Generate literate-style markdown docs from your sources☆59Jan 30, 2018Updated 8 years ago
- ☆34Jan 4, 2026Updated 2 months ago
- Deriving Spark DataFrame schemas from case classes☆44Jun 24, 2024Updated last year
- Sbt plugin for fully automated releases, without SNAPSHOT and git sha's in the version. A remix of the best ideas from sbt-ci-release and…☆21Jul 1, 2025Updated 8 months ago
- ☆47Apr 21, 2021Updated 4 years ago
- Jaqy - a Universal JDBC Client☆21Jun 15, 2023Updated 2 years ago
- Utilities for writing tests that use Apache Spark.☆24Dec 29, 2018Updated 7 years ago
- DataQuality for BigData☆148Dec 15, 2023Updated 2 years ago
- ☆23Feb 10, 2019Updated 7 years ago
- Make your joins typesafe again☆26Feb 5, 2026Updated last month
- Connect your Spark Databricks clusters Log4J output to the Application Insights Appender☆19Aug 4, 2020Updated 5 years ago
- Eval evaluates Scala 3 code. Why parse JSON when you can load case classes?☆24Jan 23, 2026Updated last month
- Simple implementations of forward- and backward-mode automatic differentation in Scala☆23Jun 21, 2018Updated 7 years ago
- An example of building kubernetes operator (Flink) using Abstract operator's framework☆26Jul 12, 2019Updated 6 years ago
- Immutable DataTable implementation in Scala☆70Dec 30, 2019Updated 6 years ago
- Scala-friendly, fast class-finder library (using ASM under the covers)☆94Jul 3, 2021Updated 4 years ago
- A native CLI in Scala to quickly move through the filesystem☆21Mar 15, 2020Updated 5 years ago
- Type-safe, high performance, distributed Neural networks in Scala☆29Nov 20, 2023Updated 2 years ago
- A project to create a stub/mock environment for testing ExecuteScript processors☆31Aug 10, 2018Updated 7 years ago
- ScalaCheck for Spark☆63Apr 2, 2018Updated 7 years ago
- Discover java object sizes through questionable sleuthing plus luck.☆70Jul 16, 2018Updated 7 years ago
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆122Updated this week
- Manage your Watcher definitions in a beautiful way.☆11Sep 22, 2016Updated 9 years ago
- Linux open by handle based VFS implementation for nfs4j☆13Oct 24, 2025Updated 4 months ago