Facebook's Realtime Distributed FS based on Apache Hadoop 0.20-append
☆880Oct 10, 2014Updated 11 years ago
Alternatives and similar repositories for hadoop-20
Users that are interested in hadoop-20 are comparing it to the libraries listed below
Sorting:
- USC Version of Hadoop that includes HDFS-RAID. Erasure codes like Locally Repairable Codes (aka Simple Regenerating Code), Reed Solomon C…☆71Jul 18, 2013Updated 12 years ago
- A distributed storage system for managing structured data while providing reliability at scale.☆217May 18, 2017Updated 8 years ago
- Mirror of Apache Hadoop MapReduce☆21Feb 2, 2011Updated 15 years ago
- Mirror of Apache Hadoop HDFS☆18Feb 2, 2011Updated 15 years ago
- Rain is a statistics-based workload generation toolkit that uses parameterized and empirical distributions to model the different classes…☆34Nov 2, 2016Updated 9 years ago
- ☆34Aug 1, 2025Updated 7 months ago
- Toolkit of simple scripts useful for managing Hadoop☆17Mar 31, 2011Updated 14 years ago
- Scribe is a server for aggregating log data streamed in real time from a large number of servers.☆3,914Aug 27, 2020Updated 5 years ago
- Oozie - workflow engine for Hadoop☆374Jun 8, 2017Updated 8 years ago
- Common metadata layer for Hadoop's Map Reduce, Pig, and Hive☆76Feb 17, 2011Updated 15 years ago
- ☆10Feb 20, 2021Updated 5 years ago
- Smart Storage Management for Big Data, a comprehensive hot/cold data optimized solution☆140Jan 3, 2023Updated 3 years ago
- RocksDB made replicated using Robust Distributed System Nucleus (rDSN) (Delta Learning)☆16Sep 15, 2015Updated 10 years ago
- Salt Formula to set up and configure Cassandra cluster☆12Aug 11, 2015Updated 10 years ago
- Common dependencies for Spinnaker☆14Jul 9, 2019Updated 6 years ago
- WE HAVE MOVED to Apache Incubator. https://cwiki.apache.org/FLUME/ . Flume is a distributed, reliable, and available service for effici…☆944May 26, 2021Updated 4 years ago
- A fully asynchronous, non-blocking, thread-safe, high-performance HBase client.☆609May 19, 2023Updated 2 years ago
- realtime search/indexing system☆42Dec 16, 2013Updated 12 years ago
- Mavuno: A Hadoop-Based Text Mining Toolkit☆47Feb 7, 2012Updated 14 years ago
- PHP Performance Metrics☆36Oct 1, 2013Updated 12 years ago
- Refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20☆551Apr 24, 2024Updated last year
- Reads a HBase table and writes the out as Text, Seq, Avro, or Parquet☆28May 15, 2014Updated 11 years ago
- Puts ganglia gmond information on a zeromq pub/sub☆34Dec 23, 2011Updated 14 years ago
- realtime search/indexing system☆59May 27, 2014Updated 11 years ago
- Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.☆1,133Apr 10, 2023Updated 2 years ago
- 第五次Druid meetup ppt集锦☆20Aug 7, 2017Updated 8 years ago
- ☆18Apr 7, 2025Updated 11 months ago
- ☆27Mar 30, 2021Updated 4 years ago
- Using logic programming (Clojure's core.logic) for test data manipulation and generation☆59Nov 23, 2012Updated 13 years ago
- MongoDB Connector for Hadoop☆1,620Jan 28, 2022Updated 4 years ago
- Quick RPC latency benchmark of Cap'n Proto RPC vs. Apache Thrift vs. ZeroC Ice☆19Dec 14, 2013Updated 12 years ago
- a project most codes extracting from spark-yarn module make build yarn program more easy☆13Apr 9, 2016Updated 9 years ago
- ☆16Mar 16, 2021Updated 5 years ago
- Manage node.js with SaltStack☆26Apr 7, 2025Updated 11 months ago
- Python driver for MongoDB branch py3k for python3.1 and higher. (see http://wiki.github.com/sovnarkom/mongo-python3-driver)☆13Jan 10, 2010Updated 16 years ago
- ☆20May 30, 2012Updated 13 years ago
- A search index specialised for LaTeX equations. Developed for latexsearch.com.☆17Jul 15, 2011Updated 14 years ago
- A platform for visualization and real-time monitoring of data workflows☆1,169Jan 22, 2020Updated 6 years ago
- Toolkit of simple scripts useful for managing Hadoop☆16May 3, 2012Updated 13 years ago