Refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20
☆549Apr 24, 2024Updated 2 years ago
Alternatives and similar repositories for hadoop-lzo
Users that are interested in hadoop-lzo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.☆1,134Apr 10, 2023Updated 3 years ago
- Snappy compression for Hadoop☆41Jun 18, 2015Updated 10 years ago
- Python MapReduce library written in Cython. Visit us in #hadoopy on freenode. See the link below for documentation and tutorials.☆242Jan 8, 2016Updated 10 years ago
- Elephant Twin is a framework for creating indexes in Hadoop☆98Oct 12, 2020Updated 5 years ago
- This source can record the position of file if the flume application has been killed,it also know which line should be read from next tim…☆19Jan 9, 2017Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Simple bash functions for manipulating Amazon Elastic MapReduce clusters☆45Jan 5, 2016Updated 10 years ago
- Example code for "Web-Scale Computer Vision using MapReduce for Multimedia Data Mining"☆48Aug 2, 2010Updated 15 years ago
- Splittable Gzip codec for Hadoop☆77Apr 14, 2026Updated 2 weeks ago
- 4mc - splittable lz4 and zstd in hadoop/spark/flink☆109Apr 21, 2023Updated 3 years ago
- Mirror of Apache Oozie☆728Jan 27, 2025Updated last year
- Oozie - workflow engine for Hadoop☆375Jun 8, 2017Updated 8 years ago
- Scribe is a server for aggregating log data streamed in real time from a large number of servers. It is designed to be scalable, extensib…☆112May 17, 2011Updated 14 years ago
- hRaven collects run time data and statistics from MapReduce jobs in an easily queryable format☆129Jan 14, 2022Updated 4 years ago
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,264Apr 22, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Python module that allows one to easily write and run Hadoop programs.☆1,031Jan 9, 2018Updated 8 years ago
- A C interface to watchman☆47Jun 7, 2019Updated 6 years ago
- Metrics produced to Kafka and consumers for monitoring them☆104Jan 10, 2015Updated 11 years ago
- mapreduce in bash☆920Oct 26, 2019Updated 6 years ago
- http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en//pubs/archive/36266.pdf☆14Apr 25, 2012Updated 14 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Oct 14, 2015Updated 10 years ago
- Apache Phoenix☆1,055Updated this week
- Easy Map/Reduce with Hadoop and Ruby. Also see http://github.com/forward/mandy-lab for examples.☆45Jan 25, 2012Updated 14 years ago
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark☆1,370Aug 22, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Solr on YARN prototype☆18Nov 14, 2014Updated 11 years ago
- WE HAVE MOVED to Apache Incubator. https://cwiki.apache.org/FLUME/ . Flume is a distributed, reliable, and available service for effici…☆943May 26, 2021Updated 4 years ago
- Open source SQL Query Assistant service for Databases/Warehouses☆1,409Updated this week
- Hadoop log aggregator and dashboard☆190Oct 29, 2013Updated 12 years ago
- REST job server for Apache Spark☆2,845Mar 3, 2026Updated last month
- XML-RPC version of the Stanford POS tagger☆21Aug 25, 2010Updated 15 years ago
- Apache Hive☆5,973Updated this week
- ☆18Aug 25, 2017Updated 8 years ago
- map reduce examples on HBaase☆48Apr 10, 2010Updated 16 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ## Auto-archived due to inactivity. ## Simple JVM Profiler Using StatsD and Other Metrics Backends☆15Oct 3, 2023Updated 2 years ago
- Mirror of Apache Hadoop MapReduce☆21Feb 2, 2011Updated 15 years ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,009Oct 5, 2022Updated 3 years ago
- Mirror of Apache Whirr☆95Apr 28, 2017Updated 9 years ago
- ☆10Aug 28, 2014Updated 11 years ago
- Prototype mesos framework using new low-level API built in Go☆61Nov 17, 2014Updated 11 years ago
- Apache ORC - the smallest, fastest columnar storage for Hadoop workloads☆766Updated this week