nielsbasjes / splittablegzipView external linksLinks
Splittable Gzip codec for Hadoop
☆74Dec 12, 2025Updated 2 months ago
Alternatives and similar repositories for splittablegzip
Users that are interested in splittablegzip are comparing it to the libraries listed below
Sorting:
- default visualizations that come packaged with the lightning viz notebook☆12Apr 18, 2016Updated 9 years ago
- OLD VERSION OF GEOTRELLIS: A sample GIS service built using GeoTrellis and Spray☆15Sep 30, 2016Updated 9 years ago
- CDAP Cube Dataset Guide☆12Aug 26, 2017Updated 8 years ago
- ☆34Updated this week
- Spark Shuffle Optimization with RDMA+AEP☆30May 23, 2023Updated 2 years ago
- This is a mirror of https://github.com/LucaCanali/sparkMeasure - sparkMeasure is a tool for performance troubleshooting of Apache Spark w…☆16Oct 3, 2025Updated 4 months ago
- ## Auto-archived due to inactivity. ## Simple JVM Profiler Using StatsD and Other Metrics Backends☆15Oct 3, 2023Updated 2 years ago
- Protobuf definitions for the Liftbridge gRPC API. https://github.com/liftbridge-io/liftbridge☆15Dec 22, 2025Updated last month
- Port of Twitter's Scala JVM-profiler to Java☆15Sep 28, 2022Updated 3 years ago
- A chef cookbook for deploying spark☆30Apr 14, 2013Updated 12 years ago
- Hadoop Cluster with security☆13Nov 21, 2021Updated 4 years ago
- Spark + Jupyer + Hive☆16Sep 22, 2015Updated 10 years ago
- Simple animation for PlantUML diagrams☆18Jul 1, 2024Updated last year
- ML models often mispredict, and it is hard to tell when and why. We present a data mining based approach to discover whether there is a c…☆17Jun 6, 2022Updated 3 years ago
- Lucene based indexing in Cassandra☆62May 3, 2016Updated 9 years ago
- ☆16Oct 17, 2024Updated last year
- ☆18Jun 30, 2022Updated 3 years ago
- protobuf pyspark conversion☆23Jun 7, 2023Updated 2 years ago
- ✨ Setup Apache Spark in GitHub Action workflows☆23Oct 30, 2024Updated last year
- Spray.io tutorial☆23Nov 5, 2014Updated 11 years ago
- Run TPCH Benchmark on Apache Kylin☆22Jan 24, 2022Updated 4 years ago
- Paper: A Zero-rename committer for object stores☆20Nov 7, 2025Updated 3 months ago
- ☆20Jun 29, 2017Updated 8 years ago
- memo & blog☆17Feb 8, 2015Updated 11 years ago
- Interactive SQL analytics in your browser!☆22Jan 31, 2018Updated 8 years ago
- Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive☆186Oct 15, 2025Updated 3 months ago
- HDFS compatible Distributed Filesystem backed Cassandra☆25Sep 17, 2015Updated 10 years ago
- ustat - an unified system stats collector tool☆22Feb 5, 2018Updated 8 years ago
- ☆24Feb 25, 2023Updated 2 years ago
- Cascading on Apache Flink®☆54Feb 5, 2024Updated 2 years ago
- This is the example code repository for Getting Started with Impala by John Russell (O'Reilly Media)☆22Aug 20, 2017Updated 8 years ago
- Pure python HDFS client: python3.x version☆25Jan 23, 2026Updated 3 weeks ago
- Data Mart As A Service☆28Apr 26, 2023Updated 2 years ago
- To String Verifier provides an easy and convenient way to test the toString method on your class.☆33Oct 31, 2022Updated 3 years ago
- A non-opinionated Java bootstrapping configuration library☆35Mar 20, 2025Updated 10 months ago
- Code to collect and analyze traceroute data within a network topology☆28Nov 20, 2018Updated 7 years ago
- Apache Spark Data Source for ROOT File Format☆29Jul 18, 2019Updated 6 years ago
- API and libraries for generating travelsheds from OSM & GTFS data☆40Jul 14, 2018Updated 7 years ago
- Standalone Semanticizer☆32Mar 4, 2015Updated 10 years ago