Splittable Gzip codec for Hadoop
☆75Feb 25, 2026Updated last week
Alternatives and similar repositories for splittablegzip
Users that are interested in splittablegzip are comparing it to the libraries listed below
Sorting:
- Advanced fold methods for Kotlin☆12Updated this week
- default visualizations that come packaged with the lightning viz notebook☆12Apr 18, 2016Updated 9 years ago
- OLD VERSION OF GEOTRELLIS: A sample GIS service built using GeoTrellis and Spray☆15Sep 30, 2016Updated 9 years ago
- A pluggable actor system written in java leveraging modern features from JDK21+☆36Feb 27, 2026Updated last week
- CDAP Cube Dataset Guide☆12Aug 26, 2017Updated 8 years ago
- A core library for reading, transforming, filtering, and writing data records☆15Jan 17, 2026Updated last month
- ☆34Updated this week
- Spark Shuffle Optimization with RDMA+AEP☆30May 23, 2023Updated 2 years ago
- Atomix Jepsen tests☆14Feb 7, 2017Updated 9 years ago
- This is a mirror of https://github.com/LucaCanali/sparkMeasure - sparkMeasure is a tool for performance troubleshooting of Apache Spark w…☆16Oct 3, 2025Updated 5 months ago
- ## Auto-archived due to inactivity. ## Simple JVM Profiler Using StatsD and Other Metrics Backends☆15Oct 3, 2023Updated 2 years ago
- Hadoop Cluster with security☆13Nov 21, 2021Updated 4 years ago
- A chef cookbook for deploying spark☆30Apr 14, 2013Updated 12 years ago
- A java simulator/parser/toolkit for the Dutch Smart Meter Requirements (DSMR)☆18Updated this week
- Task Metrics Explorer☆14Apr 2, 2019Updated 6 years ago
- Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive☆187Oct 15, 2025Updated 4 months ago
- cli tool to help importing a vagrant box as an AMI☆19Nov 25, 2016Updated 9 years ago
- Spark + Jupyer + Hive☆16Sep 22, 2015Updated 10 years ago
- Lucene based indexing in Cassandra☆61May 3, 2016Updated 9 years ago
- Simple animation for PlantUML diagrams☆19Jul 1, 2024Updated last year
- This is a repository for development. See https://github.com/jpsonic/jpsonic☆16Updated this week
- protobuf pyspark conversion☆23Jun 7, 2023Updated 2 years ago
- ☆18Jun 30, 2022Updated 3 years ago
- Run TPCH Benchmark on Apache Kylin☆22Jan 24, 2022Updated 4 years ago
- ☆20Jun 29, 2017Updated 8 years ago
- Paper: A Zero-rename committer for object stores☆20Nov 7, 2025Updated 3 months ago
- Interactive SQL analytics in your browser!☆22Jan 31, 2018Updated 8 years ago
- FeiTwnd的个人网站☆31Updated this week
- Data abstraction, storage, discovery, and serving system☆35Jan 30, 2026Updated last month
- ☆24Feb 25, 2023Updated 3 years ago
- HDFS compatible Distributed Filesystem backed Cassandra☆25Sep 17, 2015Updated 10 years ago
- ustat - an unified system stats collector tool☆22Feb 5, 2018Updated 8 years ago
- Commonly usable Java classes without dependencies.☆26Feb 7, 2026Updated last month
- A flake8 plugin that detects of usage withColumn in a loop or inside reduce☆28Jun 20, 2025Updated 8 months ago
- Scala client for the Lightning data visualization server (WIP)☆47Jun 25, 2019Updated 6 years ago
- Cascading on Apache Flink®☆54Feb 5, 2024Updated 2 years ago
- JDBCX: Extended JDBC driver for dynamic multi-language queries with optional bridge server for federated datasource connectivity.☆30Feb 23, 2026Updated last week
- This is the example code repository for Getting Started with Impala by John Russell (O'Reilly Media)☆22Aug 20, 2017Updated 8 years ago
- A convenience library for Apache Kafka integration in a Dropwizard service.☆24Feb 23, 2026Updated last week