Refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20
☆548Apr 24, 2024Updated 2 years ago
Alternatives and similar repositories for hadoop-lzo
Users that are interested in hadoop-lzo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.☆1,134Apr 10, 2023Updated 3 years ago
- Patched, refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20☆100Jan 10, 2012Updated 14 years ago
- Snappy compression for Hadoop☆41Jun 18, 2015Updated 10 years ago
- Python MapReduce library written in Cython. Visit us in #hadoopy on freenode. See the link below for documentation and tutorials.☆243Jan 8, 2016Updated 10 years ago
- The Colossal Pipe framework for map/reduce processing.☆29Aug 19, 2014Updated 11 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Patched, refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20☆37Aug 13, 2012Updated 13 years ago
- This source can record the position of file if the flume application has been killed,it also know which line should be read from next tim…☆19Jan 9, 2017Updated 9 years ago
- Example code for "Web-Scale Computer Vision using MapReduce for Multimedia Data Mining"☆48Aug 2, 2010Updated 15 years ago
- 4mc - splittable lz4 and zstd in hadoop/spark/flink☆109Apr 21, 2023Updated 3 years ago
- Mirror of Apache Oozie☆728Jan 27, 2025Updated last year
- Parallel Algorithms in Python for Hadoop/Mapreduce☆55Aug 10, 2012Updated 13 years ago
- Scribe is a server for aggregating log data streamed in real time from a large number of servers. It is designed to be scalable, extensib…☆112May 17, 2011Updated 15 years ago
- Oozie - workflow engine for Hadoop☆375Jun 8, 2017Updated 8 years ago
- hRaven collects run time data and statistics from MapReduce jobs in an easily queryable format☆129Jan 14, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,266Updated this week
- Metrics produced to Kafka and consumers for monitoring them☆104Jan 10, 2015Updated 11 years ago
- Using Hadoop with Scala☆70Oct 5, 2013Updated 12 years ago
- http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en//pubs/archive/36266.pdf☆14Apr 25, 2012Updated 14 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Oct 14, 2015Updated 10 years ago
- Apache Phoenix☆1,057Updated this week
- A simple benchmark of noSQL databases for both read/update and MapReduce performances☆32May 14, 2011Updated 15 years ago
- Easy Map/Reduce with Hadoop and Ruby. Also see http://github.com/forward/mandy-lab for examples.☆45Jan 25, 2012Updated 14 years ago
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark☆1,369Aug 22, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Solr on YARN prototype☆18Nov 14, 2014Updated 11 years ago
- WE HAVE MOVED to Apache Incubator. https://cwiki.apache.org/FLUME/ . Flume is a distributed, reliable, and available service for effici…☆943May 26, 2021Updated 4 years ago
- Visualize your HDFS cluster usage☆228Oct 13, 2020Updated 5 years ago
- Open source SQL Query Assistant service for Databases/Warehouses☆1,402Updated this week
- Gather statistics about the jvm garbage collection and push into ganglia☆32Feb 18, 2010Updated 16 years ago
- Hadoop log aggregator and dashboard☆190Oct 29, 2013Updated 12 years ago
- REST job server for Apache Spark☆2,843Mar 3, 2026Updated 2 months ago
- Apache Hive☆5,966Updated this week
- ☆18Aug 25, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- map reduce examples on HBaase☆48Apr 10, 2010Updated 16 years ago
- ## Auto-archived due to inactivity. ## Simple JVM Profiler Using StatsD and Other Metrics Backends☆15Oct 3, 2023Updated 2 years ago
- Mirror of Apache Hadoop MapReduce☆21Feb 2, 2011Updated 15 years ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,008Oct 5, 2022Updated 3 years ago
- Mirror of Apache Whirr☆96Apr 28, 2017Updated 9 years ago
- ☆10Aug 28, 2014Updated 11 years ago
- Storehaus is a library that makes it easy to work with asynchronous key value stores☆465Jul 17, 2020Updated 5 years ago