ACID Data Source for Apache Spark based on Hive ACID
☆96Jul 7, 2021Updated 4 years ago
Alternatives and similar repositories for spark-acid
Users that are interested in spark-acid are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines☆17Jan 21, 2020Updated 6 years ago
- Rocksdb state storage implementation for Structured Streaming.☆17Oct 21, 2020Updated 5 years ago
- On the fly, translation of Spark programs to run natively on your Oracle DB. Your Spark programs require no changes.☆35Apr 15, 2025Updated last year
- ☆103Mar 23, 2020Updated 6 years ago
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35Jul 9, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.☆286Feb 24, 2026Updated last month
- Qubole Sparklens tool for performance tuning Apache Spark☆589Jun 26, 2024Updated last year
- Custom state store providers for Apache Spark☆92Feb 14, 2025Updated last year
- A curated list of awesome PrestoDB / Trino software, libraries, tools and resources☆18Jun 28, 2021Updated 4 years ago
- Delta Lake Examples☆11Apr 24, 2020Updated 5 years ago
- Prescriptive Applications over Kite and Hadoop☆12Oct 14, 2015Updated 10 years ago
- Cache File System optimized for columnar formats and object stores☆188Aug 11, 2022Updated 3 years ago
- Convert a CSV fle to ORCFile☆26Apr 10, 2019Updated 7 years ago
- Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.☆20Oct 11, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A simplified, lightweight ETL Framework based on Apache Spark☆588Jan 24, 2024Updated 2 years ago
- A Spark datasource for the HadoopOffice library☆36Sep 29, 2025Updated 6 months ago
- Spark Structured Streaming State Tools☆34Jul 3, 2020Updated 5 years ago
- An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.☆431Jan 14, 2022Updated 4 years ago
- Repository that showcases problems with Kafka rebalancing and explains how to fix them. Please visit our blog article to learn what Kafka…☆12Aug 21, 2020Updated 5 years ago
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆95May 9, 2025Updated 11 months ago
- ☆12Jun 26, 2023Updated 2 years ago
- Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange☆131Dec 19, 2024Updated last year
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆897Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Iceberg is a table format for large, slow-moving tabular data☆492Apr 10, 2023Updated 3 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆93Mar 5, 2024Updated 2 years ago
- Example project showing how to use Hive UDFs in Apache Spark☆55Apr 23, 2019Updated 6 years ago
- Discover Flink clusters on Hadoop YARN for Prometheus☆23Aug 5, 2020Updated 5 years ago
- Extensible SQL Lexer and Parser for Rust☆12Dec 22, 2021Updated 4 years ago
- StreamLine - Streaming Analytics☆167Aug 27, 2023Updated 2 years ago
- Apache HBase Connectors☆246Feb 13, 2026Updated 2 months ago
- hadoop-mini-clusters provides an easy way to test Hadoop projects directly in your IDE☆296Jan 2, 2023Updated 3 years ago
- Embed any webapp/website as Ambari view!☆25Feb 26, 2016Updated 10 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Spark cloud integration: tests, cloud committers and more☆20Jan 30, 2025Updated last year
- Code samples from blogs posts https://www.querifylabs.com/blog☆28Dec 13, 2021Updated 4 years ago
- ☆201Feb 18, 2026Updated last month
- Spark* Shuffle plugin for support shuffling through remote persistent memory over fabrics, which leverages the RDMA network and remote pe…☆14Sep 18, 2023Updated 2 years ago
- 已经合入(apache/incubator-kyuubi) ACL Management for Apache Spark SQL with Apache Ranger.☆58Nov 11, 2021Updated 4 years ago
- SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka☆29Jun 8, 2016Updated 9 years ago
- ☆63Nov 8, 2019Updated 6 years ago