julienledem / redelm
an anagram
☆134Updated 3 years ago
Alternatives and similar repositories for redelm:
Users that are interested in redelm are comparing it to the libraries listed below
- Cache File System optimized for columnar formats and object stores☆182Updated 2 years ago
- ☆104Updated last year
- Apache datasketches☆93Updated 2 years ago
- ☆79Updated 3 weeks ago
- Website for DataSketches.☆96Updated last week
- Spark SQL index for Parquet tables☆134Updated 3 years ago
- Spark Connector to read and write with Pulsar☆113Updated 2 months ago
- Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.☆256Updated last year
- An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.☆424Updated 3 years ago
- Flowchart for debugging Spark applications☆104Updated 4 months ago
- A tool to get better debug info on spark's memory usage☆42Updated 5 years ago
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆86Updated 9 months ago
- The SpliceSQL Engine☆167Updated last year
- Apache ORC - the smallest, fastest columnar storage for Hadoop workloads☆709Updated this week
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆62Updated last month
- Custom state store providers for Apache Spark☆92Updated 2 years ago
- Splittable Gzip codec for Hadoop☆69Updated this week
- Collection of utilities to allow writing java code that operates across a wide range of avro versions.☆77Updated this week
- An extension of Yahoo's Benchmarks☆107Updated last year
- Code snippets from the Streaming Systems book (streamingbook.net).☆245Updated 2 years ago
- Iceberg is a table format for large, slow-moving tabular data☆480Updated last year
- Dione - a Spark and HDFS indexing library☆51Updated 10 months ago
- Mirror of Apache Omid Incubator☆88Updated last month
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆299Updated last year
- The Internals of Delta Lake☆183Updated 2 weeks ago
- ACID Data Source for Apache Spark based on Hive ACID☆97Updated 3 years ago
- Spark* shuffle plugin for support shuffling data through a remote Hadoop-compatible file system, as opposed to vanilla Spark's local-dis…☆21Updated 10 months ago
- A collection of libraries for single-pass, distributed, sublinear-space approximate aggregation and sketching algorithms. Currently: Hype…☆155Updated 2 years ago
- Schema Registry☆15Updated 7 months ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆58Updated last year