jcrist / hdfscmLinks

An HDFS backed ContentsManager implementation for Jupyter

☆12

Alternatives and similar repositories for hdfscm

Users that are interested in hdfscm are comparing it to the libraries listed below

Sorting:

maropu / spark-sql-server
Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol
☆34Updated 3 years ago
oap-project / sql-ds-cache
Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.
☆37Updated 2 years ago
eastcirclek / flink-service-discovery
Discover Flink clusters on Hadoop YARN for Prometheus
☆23Updated 5 years ago
twilio / calcite-kudu
Apache Calcite Adapter for Apache Kudu
☆28Updated last month
ververica / jupyter-vvp
Jupyter Integration for Flink SQL via Ververica Platform
☆43Updated 2 years ago
streamnative / pulsar-spark
Spark Connector to read and write with Pulsar
☆116Updated last month
nextbreakpoint / flink-controller
Flink Controller implements a Kubernetes Custom Controller (aka Kubernetes Operator) for Apache Flink
☆53Updated 10 months ago
VaBezruchko / spark-clickhouse-connector
Spark Clickhouse Connector
☆71Updated 5 years ago
tzolov / calcite-sql-rewriter
JDBC driver that converts any INSERT, UPDATE and DELETE statements into append-only INSERTs. Instead of updating rows in-place it inserts…
☆82Updated 8 years ago
apache / phoenix-queryserver
Apache Phoenix Query Server
☆51Updated 3 weeks ago
streamnative / pulsar-io-kafka
Pulsar IO Kafka Connector
☆24Updated 2 years ago
criteo / garmadon
Java event logs collector for hadoop and frameworks
☆41Updated 7 months ago
prestodb / benchto
Framework for running macro benchmarks in a clustered environment
☆25Updated 3 years ago
debezium / debezium-design-documents
☆34Updated 6 months ago
himanshug / druid-hadoop-utils
Read druid segments from hadoop
☆10Updated 8 years ago
apache / pulsar-adapters
Apache Pulsar Adapters
☆24Updated 10 months ago
aljoscha / flink-fault-tolerant-stream-example
An example of using Flink for Fault-Tolerant Stream Processing
☆12Updated 6 years ago
jeoffreylim / maelstrom
Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream …
☆22Updated 8 years ago
lightbend / flink-operator
Helm Chart for lyft/flinkk8soperator
☆11Updated 5 years ago
marcelmay / hfsa
Hadoop FSImage Analyzer (HFSA)
☆62Updated this week
datafusion-contrib / datafusion-java
Java binding to Apache DataFusion
☆83Updated 6 months ago
diennea / bookkeeper-visual-manager
A visual interface for Apache BookKeeper
☆61Updated last month
king / bravo
Utilities for processing Flink checkpoints/savepoints
☆76Updated 5 years ago
metamx / druid-spark-batch
Druid indexing plugin for using Spark in batch jobs
☆101Updated 4 years ago
CoxAutomotiveDataSolutions / spark-distcp
A re-implementation of Hadoop DistCP in Apache Spark
☆47Updated last year
qubole / rubix
Cache File System optimized for columnar formats and object stores
☆184Updated 3 years ago
qubole / presto-udfs
Plugin for Presto to allow addition of user functions easily
☆120Updated 4 years ago
xskipper-io / xskipper
An Extensible Data Skipping Framework
☆47Updated 3 months ago
NetEase / spark-alarm
Alerting and monitoring tool for Apache Spark
☆23Updated 3 years ago
ververica / frocksdb
☆65Updated last year