jcrist / hdfscmLinks
An HDFS backed ContentsManager implementation for Jupyter
☆12Updated last year
Alternatives and similar repositories for hdfscm
Users that are interested in hdfscm are comparing it to the libraries listed below
Sorting:
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 3 years ago
- Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.☆37Updated 3 years ago
- Apache Calcite Adapter for Apache Kudu☆28Updated 4 months ago
- Spark Connector to read and write with Pulsar☆117Updated 3 weeks ago
- Spark Clickhouse Connector☆71Updated 5 years ago
- Hadoop FSImage Analyzer (HFSA)☆66Updated this week
- A re-implementation of Hadoop DistCP in Apache Spark☆47Updated 2 years ago
- Druid indexing plugin for using Spark in batch jobs☆101Updated 4 years ago
- A S3 Shuffle plugin for Apache Spark to enable elastic scaling for generic Spark workloads.☆50Updated 4 months ago
- 已经合入(apache/incubator-kyuubi) ACL Management for Apache Spark SQL with Apache Ranger.☆58Updated 4 years ago
- Framework for running macro benchmarks in a clustered environment☆25Updated 3 years ago
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35Updated 6 months ago
- Helm Chart for lyft/flinkk8soperator☆11Updated 5 years ago
- An Extensible Data Skipping Framework☆47Updated 6 months ago
- Utilities for processing Flink checkpoints/savepoints☆75Updated 6 years ago
- Discover Flink clusters on Hadoop YARN for Prometheus☆23Updated 5 years ago
- Utility for benchmarking changes in Spark using TPC-DS workloads☆16Updated 4 years ago
- JDBC driver that converts any INSERT, UPDATE and DELETE statements into append-only INSERTs. Instead of updating rows in-place it inserts…☆82Updated 8 years ago
- Read druid segments from hadoop☆10Updated 9 years ago
- ☆67Updated last year
- Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange☆130Updated last year
- Cache File System optimized for columnar formats and object stores☆187Updated 3 years ago
- Spark SQL index for Parquet tables☆134Updated 4 years ago
- Java event logs collector for hadoop and frameworks☆41Updated 10 months ago
- A software library of stochastic streaming algorithms, a.k.a. sketches.☆105Updated last week
- Pafka is originated from the OpenAIOS project to leverage an optimized tiered storage access strategy to improve overall performance for …☆67Updated 4 years ago
- Custom Service for deploying Apache Alluxio on a running HDP 2.3 / IOP 4.1 Ambari Managed Cluster☆13Updated 9 years ago
- UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy☆63Updated 2 years ago
- Livy and Zeppelin services for Cloudera Manager and CDH using CSDs and Parcels☆22Updated 7 years ago
- Base POM for Airlift☆54Updated this week