4mc - splittable lz4 and zstd in hadoop/spark/flink
☆109Apr 21, 2023Updated 2 years ago
Alternatives and similar repositories for 4mc
Users that are interested in 4mc are comparing it to the libraries listed below
Sorting:
- Java port of TLSH (Trend Micro Locality Sensitive Hash)☆24Apr 26, 2021Updated 4 years ago
- A linter for Thrift IDL files☆16Updated this week
- Presto SQL query formatter☆15Jan 1, 2024Updated 2 years ago
- Decoding Raymarine's ARCHIVE.FSH files, Garmin's IMG/ADM archives and the TRK subfiles.☆10Oct 22, 2019Updated 6 years ago
- Albis: High-Performance File Format for Big Data Systems☆21Jul 12, 2018Updated 7 years ago
- Refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20☆551Apr 24, 2024Updated last year
- C library for efficient string matching with Aho-Corasick☆21Jan 20, 2012Updated 14 years ago
- Hive User-Defined Functions (UDFs) for Text Mining☆14Feb 24, 2014Updated 12 years ago
- Set of hadoop input/output formats for use in combination with hadoop streaming☆32Jul 28, 2017Updated 8 years ago
- A Java implementation of SpamSum / SSDeep☆14Jan 9, 2017Updated 9 years ago
- A simple RELP library for Go☆11Apr 7, 2020Updated 5 years ago
- Utilities to use Avro files from Hadoop Map/Reduce jobs and Streaming☆26Sep 10, 2013Updated 12 years ago
- Demonstration of a Hive Input Format for Iceberg☆26Mar 12, 2021Updated 5 years ago
- ☆15Apr 2, 2025Updated 11 months ago
- OpenJDK 8 compact profiles builder☆12Sep 9, 2019Updated 6 years ago
- ☆12Aug 17, 2015Updated 10 years ago
- Deep Learning (Keras) Models Deployment using SQL databases☆17Jul 5, 2019Updated 6 years ago
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35Jul 9, 2025Updated 8 months ago
- Apache Kafka® sink for transferring events/messages from Kafka topics to Apache Cassandra®, DataStax Astra and DataStax Enterprise (DSE).☆19Jul 25, 2025Updated 7 months ago
- Tiny Transactions on Computer Systems (TinyToCS) Site☆32Mar 8, 2016Updated 10 years ago
- Collection of HDP Tuning Tricks & Tips (unofficial guide)☆17Sep 26, 2017Updated 8 years ago
- Hadoop Profiler, or hprofiler, is a tool which is able to analyze on- and off-CPU workloads on distributed computing environments.☆24Jul 7, 2016Updated 9 years ago
- The AWS KMS JCE Provider software library for Java is a vendor implementation for the Sun Java JCE (Java Cryptography Extension) provider…☆19Oct 8, 2025Updated 5 months ago
- Splittable Gzip codec for Hadoop☆76Feb 25, 2026Updated 3 weeks ago
- Haskell Sendgrid v3 API Library☆15May 2, 2024Updated last year
- HBase as a JSON Document Database☆25Jun 14, 2023Updated 2 years ago
- A configuration manager for WireGuard☆17Feb 5, 2020Updated 6 years ago
- functionstest☆33Oct 25, 2016Updated 9 years ago
- Export Hadoop YARN (resource-manager) metrics in prometheus format☆57Apr 15, 2025Updated 11 months ago
- Using Apache Spark in an ArcMap Toolbox☆27Jan 16, 2014Updated 12 years ago
- A docker file for running FoundationDB server and client☆10May 13, 2018Updated 7 years ago
- openwrt management tool☆12Dec 6, 2018Updated 7 years ago
- a simple image viewer for Java☆11May 10, 2025Updated 10 months ago
- A simple script for plotting the flight path of aircraft from ADS-B packet data☆12Jan 25, 2018Updated 8 years ago
- Prometheus exporter which fetches JSON from a URL and exports one of the values as gauge metrics☆24Mar 16, 2019Updated 7 years ago
- an experimental Scala extension of Jar Jar Links☆38Feb 23, 2026Updated 3 weeks ago
- Scrape real-time Dark Web data across Tor to your local kafka network☆14Mar 10, 2016Updated 10 years ago
- Prometheus jmx_exporter configurations for Cloudera Hadoop☆37Mar 11, 2018Updated 8 years ago
- Traverse HDFS without jvm startup delays and directory context!! Supports multiple HDFS hosts, command line history and tab completion.☆17May 20, 2016Updated 9 years ago