ACID Data Source for Apache Spark based on Hive ACID
☆96Jul 7, 2021Updated 4 years ago
Alternatives and similar repositories for spark-acid
Users that are interested in spark-acid are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines☆17Jan 21, 2020Updated 6 years ago
- Rocksdb state storage implementation for Structured Streaming.☆17Oct 21, 2020Updated 5 years ago
- ☆29Oct 15, 2019Updated 6 years ago
- On the fly, translation of Spark programs to run natively on your Oracle DB. Your Spark programs require no changes.☆35Apr 15, 2025Updated 11 months ago
- ☆103Mar 23, 2020Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35Jul 9, 2025Updated 8 months ago
- Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.☆285Feb 24, 2026Updated last month
- Qubole Sparklens tool for performance tuning Apache Spark☆590Jun 26, 2024Updated last year
- Custom state store providers for Apache Spark☆92Feb 14, 2025Updated last year
- A curated list of awesome PrestoDB / Trino software, libraries, tools and resources☆18Jun 28, 2021Updated 4 years ago
- Delta Lake Examples☆11Apr 24, 2020Updated 5 years ago
- Prescriptive Applications over Kite and Hadoop☆12Oct 14, 2015Updated 10 years ago
- Cache File System optimized for columnar formats and object stores☆187Aug 11, 2022Updated 3 years ago
- Convert a CSV fle to ORCFile☆26Apr 10, 2019Updated 6 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.☆20Oct 11, 2021Updated 4 years ago
- A simplified, lightweight ETL Framework based on Apache Spark☆587Jan 24, 2024Updated 2 years ago
- A Spark datasource for the HadoopOffice library☆36Sep 29, 2025Updated 5 months ago
- Spark Structured Streaming State Tools☆34Jul 3, 2020Updated 5 years ago
- An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.☆431Jan 14, 2022Updated 4 years ago
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆95May 9, 2025Updated 10 months ago
- ☆12Jun 26, 2023Updated 2 years ago
- Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange☆131Dec 19, 2024Updated last year
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆893Mar 10, 2026Updated 2 weeks ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Iceberg is a table format for large, slow-moving tabular data☆490Apr 10, 2023Updated 2 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆93Mar 5, 2024Updated 2 years ago
- Example project showing how to use Hive UDFs in Apache Spark☆55Apr 23, 2019Updated 6 years ago
- Discover Flink clusters on Hadoop YARN for Prometheus☆23Aug 5, 2020Updated 5 years ago
- StreamLine - Streaming Analytics☆166Aug 27, 2023Updated 2 years ago
- Apache HBase Connectors☆248Feb 13, 2026Updated last month
- hadoop-mini-clusters provides an easy way to test Hadoop projects directly in your IDE☆297Jan 2, 2023Updated 3 years ago
- Embed any webapp/website as Ambari view!☆25Feb 26, 2016Updated 10 years ago
- Spark cloud integration: tests, cloud committers and more☆20Jan 30, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code samples from blogs posts https://www.querifylabs.com/blog☆28Dec 13, 2021Updated 4 years ago
- ☆202Feb 18, 2026Updated last month
- Spark* Shuffle plugin for support shuffling through remote persistent memory over fabrics, which leverages the RDMA network and remote pe…☆14Sep 18, 2023Updated 2 years ago
- 已经合入(apache/incubator-kyuubi) ACL Management for Apache Spark SQL with Apache Ranger.☆58Nov 11, 2021Updated 4 years ago
- SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka☆29Jun 8, 2016Updated 9 years ago
- ☆63Nov 8, 2019Updated 6 years ago
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆306Oct 30, 2025Updated 4 months ago