4mc - splittable lz4 and zstd in hadoop/spark/flink
☆109Apr 21, 2023Updated 3 years ago
Alternatives and similar repositories for 4mc
Users that are interested in 4mc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Java port of TLSH (Trend Micro Locality Sensitive Hash)☆25Apr 26, 2021Updated 5 years ago
- Presto SQL query formatter☆15Jan 1, 2024Updated 2 years ago
- Decoding Raymarine's ARCHIVE.FSH files, Garmin's IMG/ADM archives and the TRK subfiles.☆10Oct 22, 2019Updated 6 years ago
- Mikrotik EoIP implementation. Rewrite of the implementation at linux-eoip.googlecode.com for better latency and buffering performance.☆14Mar 27, 2014Updated 12 years ago
- Albis: High-Performance File Format for Big Data Systems☆21Jul 12, 2018Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20☆549Apr 24, 2024Updated 2 years ago
- C library for efficient string matching with Aho-Corasick☆21Jan 20, 2012Updated 14 years ago
- Hive User-Defined Functions (UDFs) for Text Mining☆14Feb 24, 2014Updated 12 years ago
- Set of hadoop input/output formats for use in combination with hadoop streaming☆32Jul 28, 2017Updated 8 years ago
- Demonstration of a Hive Input Format for Iceberg☆26Mar 12, 2021Updated 5 years ago
- ☆15Apr 2, 2025Updated last year
- OpenJDK 8 compact profiles builder☆12Sep 9, 2019Updated 6 years ago
- Deep Learning (Keras) Models Deployment using SQL databases☆17Jul 5, 2019Updated 6 years ago
- Yet Another Avro CLI Tool☆11Nov 8, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆18Jan 31, 2022Updated 4 years ago
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35Jul 9, 2025Updated 9 months ago
- Elixir stream-wrapper that transparently handles exceptions.☆12Nov 21, 2020Updated 5 years ago
- JavaScript Embedded Web Server☆19Sep 10, 2012Updated 13 years ago
- Apache Kafka® sink for transferring events/messages from Kafka topics to Apache Cassandra®, DataStax Astra and DataStax Enterprise (DSE).☆19Apr 24, 2026Updated last week
- Tiny Transactions on Computer Systems (TinyToCS) Site☆32Mar 8, 2016Updated 10 years ago
- Easy, flexible C unit testing☆11Feb 13, 2016Updated 10 years ago
- ☆21Jan 16, 2015Updated 11 years ago
- Collection of HDP Tuning Tricks & Tips (unofficial guide)☆17Sep 26, 2017Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Hadoop Profiler, or hprofiler, is a tool which is able to analyze on- and off-CPU workloads on distributed computing environments.☆24Jul 7, 2016Updated 9 years ago
- Unison syntax highlighting for VS code☆10Jul 13, 2022Updated 3 years ago
- Running MapR on DCOS☆19Nov 23, 2016Updated 9 years ago
- Indexing Module for elasticell (https://github.com/deepfabric/elasticell)☆24Feb 13, 2018Updated 8 years ago
- ☆11Nov 17, 2017Updated 8 years ago
- Clojure wrapper for LDA topic modeling in MALLET☆33Sep 6, 2011Updated 14 years ago
- Collection of NE2000+ software obtained from various sources☆12Jan 2, 2020Updated 6 years ago
- The AWS KMS JCE Provider software library for Java is a vendor implementation for the Sun Java JCE (Java Cryptography Extension) provider…☆21Oct 8, 2025Updated 6 months ago
- This is a mirror of https://github.com/LucaCanali/sparkMeasure - sparkMeasure is a tool for performance troubleshooting of Apache Spark w…☆16Oct 3, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A repo of Java examples using Apache Flink with flink-connector-kafka☆10Mar 10, 2026Updated last month
- Splittable Gzip codec for Hadoop☆77Apr 14, 2026Updated 2 weeks ago
- Haskell Sendgrid v3 API Library☆15May 2, 2024Updated last year
- Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream …☆22Feb 6, 2017Updated 9 years ago
- HBase as a JSON Document Database☆26Jun 14, 2023Updated 2 years ago
- Hadoop FSImage Analyzer (HFSA)☆68Apr 25, 2026Updated last week
- A project in which I work my way through a Clojure version of "The Reasoned Schemer"☆32Dec 12, 2012Updated 13 years ago