hopshadoop / hops-metadata-dal-impl-ndbLinks

☆9

Alternatives and similar repositories for hops-metadata-dal-impl-ndb

Users that are interested in hops-metadata-dal-impl-ndb are comparing it to the libraries listed below

Sorting:

hopshadoop / hops
Hops Hadoop is a distribution of Apache Hadoop with distributed metadata.
☆314Updated 3 weeks ago
logicalclocks / hopsworks-chef
Chef Cookbook for Hopsworks
☆12Updated last month
MemVerge / splash
Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange
☆127Updated 5 months ago
oap-project / gazelle_plugin
Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.
☆257Updated 2 years ago
TU-Berlin-DIMA / scotty-window-processor
This repository provides Scotty, a framework for efficient window aggregations for out-of-order Stream Processing.
☆77Updated last year
linkedin / spark
Apache Spark - A unified analytics engine for large-scale data processing
☆16Updated last year
squito / spark-memory
A tool to get better debug info on spark's memory usage
☆42Updated 5 years ago
cerndb / SparkPlugins
Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…
☆89Updated 3 weeks ago
rheem-ecosystem / rheem
Rheem - a cross-platform data processing system
☆5Updated 3 years ago
yaooqinn / spark-authorizer
A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…
☆177Updated 3 years ago
liancheng / spear
A playground for experimenting ideas that may apply to Spark SQL/Catalyst
☆141Updated 6 years ago
maropu / spark-tpcds-datagen
All the things about TPC-DS in Apache Spark
☆106Updated last year
velox4j / velox4j
Java bindings for https://github.com/facebookincubator/velox
☆27Updated this week
chermenin / spark-states
Custom state store providers for Apache Spark
☆92Updated 3 months ago
amplab / drizzle-spark
Drizzle integration with Apache Spark
☆120Updated 6 years ago
peelframework / peel
Peel is a framework that helps you to define, execute, analyze, and share experiments for distributed systems and algorithms.
☆27Updated 2 years ago
Mellanox / SparkRDMA
This is archive of SparkRDMA project. The new repository with RDMA shuffle acceleration for Apache Spark is here: https://github.com/Nvid…
☆248Updated 6 years ago
ververica / stateful-functions
Stateful Functions for Apache Flink
☆276Updated last year
IBM / spark-tpc-ds-performance-test
Use the TPC-DS benchmark to test Spark SQL performance
☆179Updated 5 years ago
criteo / babar
Profiler for large-scale distributed java applications (Spark, Scalding, MapReduce, Hive,...) on YARN.
☆127Updated 6 years ago
lightcopy / parquet-index
Spark SQL index for Parquet tables
☆134Updated 4 years ago
rxin / TPC-H-Hive
Running TPC-H on Apache Hive
☆41Updated 5 years ago
dc-sics / hopsworks
HopsWorks - Hadoop for Humans
☆117Updated 6 years ago
starburstdata / facebook-presto
Starburst Enterprise Distribution of Presto
☆45Updated 3 years ago
linkedin / dynamometer
A tool for scale and performance testing of HDFS with a specific focus on the NameNode.
☆131Updated last year
microsoft / Dhalion
Self regulation and auto-tuning for distributed system
☆65Updated last year
FlinkML / flink-jpmml
flink-jpmml is a fresh-made library for dynamic real time machine learning predictions built on top of PMML standard models and Apache Fl…
☆96Updated 6 years ago
alibaba / SparkCube
SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.
☆133Updated 2 years ago
dataArtisans / flink-benchmarks
☆56Updated 4 years ago
IBM / spark-s3-shuffle
A S3 Shuffle plugin for Apache Spark to enable elastic scaling for generic Spark workloads.
☆44Updated last month