julienledem / redelmLinks
an anagram
☆135Updated 3 years ago
Alternatives and similar repositories for redelm
Users that are interested in redelm are comparing it to the libraries listed below
Sorting:
- ☆105Updated last year
- Code snippets from the Streaming Systems book (streamingbook.net).☆252Updated 3 years ago
- Spark SQL index for Parquet tables☆134Updated 4 years ago
- Website for DataSketches.☆102Updated 2 weeks ago
- compatibility tests to make sur C and Java implementations can read each other☆69Updated 3 years ago
- Cache File System optimized for columnar formats and object stores☆182Updated 2 years ago
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆61Updated 6 months ago
- An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.☆425Updated 3 years ago
- Apache datasketches☆96Updated 2 years ago
- Apache Flink™ training material website☆78Updated 5 years ago
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆299Updated last year
- The Internals of Delta Lake☆184Updated 5 months ago
- The SpliceSQL Engine☆169Updated 2 years ago
- Flowchart for debugging Spark applications☆105Updated 9 months ago
- Remote shuffle service for Apache Spark to store shuffle data on remote servers.☆327Updated last year
- Stateful Functions for Apache Flink☆275Updated last year
- ACID Data Source for Apache Spark based on Hive ACID☆97Updated 3 years ago
- ☆85Updated this week
- Stocator is high performing connector to object storage for Apache Spark, achieving performance by leveraging object storage semantics.☆114Updated last year
- This repository provides Scotty, a framework for efficient window aggregations for out-of-order Stream Processing.☆78Updated last year
- Iceberg is a table format for large, slow-moving tabular data☆480Updated 2 years ago
- A library that provides useful extensions to Apache Spark and PySpark.☆226Updated 3 months ago
- Vectorized processing for Apache Arrow☆485Updated 3 years ago
- Schema Registry☆16Updated last year
- A library that brings useful functions from various modern database management systems to Apache Spark☆59Updated last year
- A collection of libraries for single-pass, distributed, sublinear-space approximate aggregation and sketching algorithms. Currently: Hype…☆157Updated last month
- Hadoop output committers for S3☆109Updated 4 years ago
- Collection of utilities to allow writing java code that operates across a wide range of avro versions.☆79Updated last month
- Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.☆257Updated 2 years ago
- Spark* shuffle plugin for support shuffling data through a remote Hadoop-compatible file system, as opposed to vanilla Spark's local-dis…☆21Updated last year