Facebook's Realtime Distributed FS based on Apache Hadoop 0.20-append
☆876Oct 10, 2014Updated 11 years ago
Alternatives and similar repositories for hadoop-20
Users that are interested in hadoop-20 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- USC Version of Hadoop that includes HDFS-RAID. Erasure codes like Locally Repairable Codes (aka Simple Regenerating Code), Reed Solomon C…☆71Jul 18, 2013Updated 12 years ago
- A distributed storage system for managing structured data while providing reliability at scale.☆218May 18, 2017Updated 8 years ago
- Mirror of Apache Hadoop HDFS☆17Feb 2, 2011Updated 15 years ago
- Rain is a statistics-based workload generation toolkit that uses parameterized and empirical distributions to model the different classes…☆35Nov 2, 2016Updated 9 years ago
- Real Time Proxy☆79Oct 10, 2014Updated 11 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- SHARDS implementation in C.☆19Aug 4, 2018Updated 7 years ago
- Toolkit of simple scripts useful for managing Hadoop☆17Mar 31, 2011Updated 15 years ago
- Scribe is a server for aggregating log data streamed in real time from a large number of servers.☆3,910Aug 27, 2020Updated 5 years ago
- libpcap bindings for node☆27May 14, 2014Updated 11 years ago
- Oozie - workflow engine for Hadoop☆375Jun 8, 2017Updated 8 years ago
- Statistical Workload Injector for MapReduce - Project at UC Berkeley AMP Lab☆128May 29, 2014Updated 11 years ago
- Common metadata layer for Hadoop's Map Reduce, Pig, and Hive☆76Feb 17, 2011Updated 15 years ago
- Hiding Things Out In The Open☆79Jun 7, 2014Updated 11 years ago
- RocksDB made replicated using Robust Distributed System Nucleus (rDSN) (Delta Learning)☆16Sep 15, 2015Updated 10 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Salt Formula to set up and configure Cassandra cluster☆12Aug 11, 2015Updated 10 years ago
- Common dependencies for Spinnaker☆14Jul 9, 2019Updated 6 years ago
- WE HAVE MOVED to Apache Incubator. https://cwiki.apache.org/FLUME/ . Flume is a distributed, reliable, and available service for effici…☆943May 26, 2021Updated 4 years ago
- Better code coverage tool for JavaScript.☆89May 21, 2020Updated 5 years ago
- Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shu…☆257Apr 7, 2023Updated 3 years ago
- A fully asynchronous, non-blocking, thread-safe, high-performance HBase client.☆610May 19, 2023Updated 2 years ago
- realtime search/indexing system☆41Dec 16, 2013Updated 12 years ago
- PHP Performance Metrics☆36Oct 1, 2013Updated 12 years ago
- Refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20☆549Apr 24, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Reads a HBase table and writes the out as Text, Seq, Avro, or Parquet☆28May 15, 2014Updated 11 years ago
- realtime search/indexing system☆59May 27, 2014Updated 11 years ago
- Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.☆1,134Apr 10, 2023Updated 3 years ago
- Secondary index on HBase☆18Oct 24, 2015Updated 10 years ago
- 第五次Druid meetup ppt集锦☆20Aug 7, 2017Updated 8 years ago
- ☆18Apr 7, 2025Updated last year
- R code needed to reproduce Relationship between Reddit Comment Score and Comment Length for 1.66 Billion Comments visualization☆17Jul 8, 2015Updated 10 years ago
- ☆27Mar 30, 2021Updated 5 years ago
- Network visualization with Gephi: tutorial and example data files☆16Jan 4, 2017Updated 9 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- MongoDB Connector for Hadoop☆1,579Jan 28, 2022Updated 4 years ago
- NEW: see http://www.hops.io/. OLD: This work aims to re-engineer the Hadoop Distributed File System (HDFS) so that it can be 1) highly av…☆26Jan 2, 2012Updated 14 years ago
- Quick RPC latency benchmark of Cap'n Proto RPC vs. Apache Thrift vs. ZeroC Ice☆20Dec 14, 2013Updated 12 years ago
- a project most codes extracting from spark-yarn module make build yarn program more easy☆13Apr 9, 2016Updated 10 years ago
- a tiny wrapper for mySQLDB☆39Jul 6, 2013Updated 12 years ago
- Python driver for MongoDB branch py3k for python3.1 and higher. (see http://wiki.github.com/sovnarkom/mongo-python3-driver)☆13Jan 10, 2010Updated 16 years ago
- One of the first go libraries, targets the old Twitter API which no longer works☆50Oct 4, 2012Updated 13 years ago