Facebook's Realtime Distributed FS based on Apache Hadoop 0.20-append
☆875Oct 10, 2014Updated 11 years ago
Alternatives and similar repositories for hadoop-20
Users that are interested in hadoop-20 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- USC Version of Hadoop that includes HDFS-RAID. Erasure codes like Locally Repairable Codes (aka Simple Regenerating Code), Reed Solomon C…☆71Jul 18, 2013Updated 12 years ago
- Mirror of Apache Hadoop MapReduce☆21Feb 2, 2011Updated 15 years ago
- Mirror of Apache Hadoop HDFS☆17Feb 2, 2011Updated 15 years ago
- Scribe is a server for aggregating log data streamed in real time from a large number of servers.☆3,911Aug 27, 2020Updated 5 years ago
- Oozie - workflow engine for Hadoop☆374Jun 8, 2017Updated 9 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Common metadata layer for Hadoop's Map Reduce, Pig, and Hive☆77Feb 17, 2011Updated 15 years ago
- Smart Storage Management for Big Data, a comprehensive hot/cold data optimized solution☆139Jan 3, 2023Updated 3 years ago
- RocksDB made replicated using Robust Distributed System Nucleus (rDSN) (Delta Learning)☆16Sep 15, 2015Updated 10 years ago
- Salt Formula to set up and configure Cassandra cluster☆12Aug 11, 2015Updated 10 years ago
- Common dependencies for Spinnaker☆13Jul 9, 2019Updated 6 years ago
- Better code coverage tool for JavaScript.☆89May 21, 2020Updated 6 years ago
- WE HAVE MOVED to Apache Incubator. https://cwiki.apache.org/FLUME/ . Flume is a distributed, reliable, and available service for effici…☆943May 26, 2021Updated 5 years ago
- Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shu…☆256Apr 7, 2023Updated 3 years ago
- A fully asynchronous, non-blocking, thread-safe, high-performance HBase client.☆610May 19, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- realtime search/indexing system☆41Dec 16, 2013Updated 12 years ago
- Mavuno: A Hadoop-Based Text Mining Toolkit☆48Feb 7, 2012Updated 14 years ago
- Refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20☆548Apr 24, 2024Updated 2 years ago
- Reads a HBase table and writes the out as Text, Seq, Avro, or Parquet☆28May 15, 2014Updated 12 years ago
- realtime search/indexing system☆59May 27, 2014Updated 12 years ago
- ☆58Mar 27, 2019Updated 7 years ago
- Nitro Web Application Framework☆74Aug 6, 2010Updated 15 years ago
- Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.☆1,134Apr 10, 2023Updated 3 years ago
- Secondary index on HBase☆18Oct 24, 2015Updated 10 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- a toy duckdb based timeseries database☆15Sep 30, 2020Updated 5 years ago
- 第五次Druid meetup ppt集锦☆20Aug 7, 2017Updated 8 years ago
- new home for s3sync command line tools☆167Jul 29, 2015Updated 10 years ago
- R code needed to reproduce Relationship between Reddit Comment Score and Comment Length for 1.66 Billion Comments visualization☆17Jul 8, 2015Updated 10 years ago
- A generic monitoring client☆14May 17, 2012Updated 14 years ago
- Network visualization with Gephi: tutorial and example data files☆16Jan 4, 2017Updated 9 years ago
- Caravel is a data exploration platform designed to be visual, intuitive, and interactive☆20Aug 30, 2016Updated 9 years ago
- MongoDB Connector for Hadoop☆1,559Jan 28, 2022Updated 4 years ago
- Quick RPC latency benchmark of Cap'n Proto RPC vs. Apache Thrift vs. ZeroC Ice☆20Dec 14, 2013Updated 12 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- NEW: see http://www.hops.io/. OLD: This work aims to re-engineer the Hadoop Distributed File System (HDFS) so that it can be 1) highly av…☆26Jan 2, 2012Updated 14 years ago
- a project most codes extracting from spark-yarn module make build yarn program more easy☆13Apr 9, 2016Updated 10 years ago
- Python driver for MongoDB branch py3k for python3.1 and higher. (see http://wiki.github.com/sovnarkom/mongo-python3-driver)☆13Jan 10, 2010Updated 16 years ago
- ☆20May 30, 2012Updated 14 years ago
- A platform for visualization and real-time monitoring of data workflows☆1,170Jan 22, 2020Updated 6 years ago
- A search index specialised for LaTeX equations. Developed for latexsearch.com.☆17Jul 15, 2011Updated 14 years ago
- Honu is a large scale data collection and processing pipeline☆84Feb 4, 2011Updated 15 years ago