Facebook's Realtime Distributed FS based on Apache Hadoop 0.20-append
☆875Oct 10, 2014Updated 11 years ago
Alternatives and similar repositories for hadoop-20
Users that are interested in hadoop-20 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- USC Version of Hadoop that includes HDFS-RAID. Erasure codes like Locally Repairable Codes (aka Simple Regenerating Code), Reed Solomon C…☆71Jul 18, 2013Updated 12 years ago
- A distributed storage system for managing structured data while providing reliability at scale.☆218May 18, 2017Updated 9 years ago
- Mirror of Apache Hadoop MapReduce☆21Feb 2, 2011Updated 15 years ago
- Mirror of Apache Hadoop HDFS☆17Feb 2, 2011Updated 15 years ago
- Rain is a statistics-based workload generation toolkit that uses parameterized and empirical distributions to model the different classes…☆35Nov 2, 2016Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Toolkit of simple scripts useful for managing Hadoop☆17Mar 31, 2011Updated 15 years ago
- Scribe is a server for aggregating log data streamed in real time from a large number of servers.☆3,911Aug 27, 2020Updated 5 years ago
- Oozie - workflow engine for Hadoop☆374Jun 8, 2017Updated 9 years ago
- Datacash payment gateway integration for django-oscar☆18Jul 21, 2018Updated 7 years ago
- Common metadata layer for Hadoop's Map Reduce, Pig, and Hive☆77Feb 17, 2011Updated 15 years ago
- Smart Storage Management for Big Data, a comprehensive hot/cold data optimized solution☆139Jan 3, 2023Updated 3 years ago
- RocksDB made replicated using Robust Distributed System Nucleus (rDSN) (Delta Learning)☆16Sep 15, 2015Updated 10 years ago
- Better code coverage tool for JavaScript.☆89May 21, 2020Updated 6 years ago
- WE HAVE MOVED to Apache Incubator. https://cwiki.apache.org/FLUME/ . Flume is a distributed, reliable, and available service for effici…☆943May 26, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shu…☆257Apr 7, 2023Updated 3 years ago
- A fully asynchronous, non-blocking, thread-safe, high-performance HBase client.☆610May 19, 2023Updated 3 years ago
- PHP Performance Metrics☆36Oct 1, 2013Updated 12 years ago
- Refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20☆548Apr 24, 2024Updated 2 years ago
- Reads a HBase table and writes the out as Text, Seq, Avro, or Parquet☆28May 15, 2014Updated 12 years ago
- Puts ganglia gmond information on a zeromq pub/sub☆34Dec 23, 2011Updated 14 years ago
- realtime search/indexing system☆59May 27, 2014Updated 12 years ago
- ☆58Mar 27, 2019Updated 7 years ago
- Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.☆1,134Apr 10, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Secondary index on HBase☆18Oct 24, 2015Updated 10 years ago
- sgx-based encrypted deduplication prototype☆13May 14, 2021Updated 5 years ago
- new home for s3sync command line tools☆167Jul 29, 2015Updated 10 years ago
- R code needed to reproduce Relationship between Reddit Comment Score and Comment Length for 1.66 Billion Comments visualization☆17Jul 8, 2015Updated 10 years ago
- A generic monitoring client☆14May 17, 2012Updated 14 years ago
- ☆27Mar 30, 2021Updated 5 years ago
- Network visualization with Gephi: tutorial and example data files☆16Jan 4, 2017Updated 9 years ago
- Using logic programming (Clojure's core.logic) for test data manipulation and generation☆59Nov 23, 2012Updated 13 years ago
- Caravel is a data exploration platform designed to be visual, intuitive, and interactive☆20Aug 30, 2016Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Quick RPC latency benchmark of Cap'n Proto RPC vs. Apache Thrift vs. ZeroC Ice☆20Dec 14, 2013Updated 12 years ago
- ☆16Mar 16, 2021Updated 5 years ago
- a project most codes extracting from spark-yarn module make build yarn program more easy☆13Apr 9, 2016Updated 10 years ago
- One of the first go libraries, targets the old Twitter API which no longer works☆50Oct 4, 2012Updated 13 years ago
- ☆20May 30, 2012Updated 14 years ago
- A platform for visualization and real-time monitoring of data workflows☆1,170Jan 22, 2020Updated 6 years ago
- Honu is a large scale data collection and processing pipeline☆84Feb 4, 2011Updated 15 years ago