facebookarchive / hadoop-20View external linksLinks
Facebook's Realtime Distributed FS based on Apache Hadoop 0.20-append
☆877Oct 10, 2014Updated 11 years ago
Alternatives and similar repositories for hadoop-20
Users that are interested in hadoop-20 are comparing it to the libraries listed below
Sorting:
- A distributed storage system for managing structured data while providing reliability at scale.☆217May 18, 2017Updated 8 years ago
- Mirror of Apache Hadoop MapReduce☆21Feb 2, 2011Updated 15 years ago
- PHP Performance Metrics☆36Oct 1, 2013Updated 12 years ago
- Rain is a statistics-based workload generation toolkit that uses parameterized and empirical distributions to model the different classes…☆35Nov 2, 2016Updated 9 years ago
- Common metadata layer for Hadoop's Map Reduce, Pig, and Hive☆76Feb 17, 2011Updated 14 years ago
- A generic monitoring client☆14May 17, 2012Updated 13 years ago
- Oozie - workflow engine for Hadoop☆374Jun 8, 2017Updated 8 years ago
- Reads a HBase table and writes the out as Text, Seq, Avro, or Parquet☆28May 15, 2014Updated 11 years ago
- Datacash payment gateway integration for django-oscar☆18Jul 21, 2018Updated 7 years ago
- Nitro Web Application Framework☆74Aug 6, 2010Updated 15 years ago
- ☆20May 30, 2012Updated 13 years ago
- Real-time Monitoring☆29May 14, 2012Updated 13 years ago
- One of the first go libraries, targets the old Twitter API which no longer works☆50Oct 4, 2012Updated 13 years ago
- TV show scraper/renamer thingy☆12May 10, 2013Updated 12 years ago
- Scribe is a server for aggregating log data streamed in real time from a large number of servers.☆3,917Aug 27, 2020Updated 5 years ago
- A fully asynchronous, non-blocking, thread-safe, high-performance HBase client.☆609May 19, 2023Updated 2 years ago
- Smart Storage Management for Big Data, a comprehensive hot/cold data optimized solution☆140Jan 3, 2023Updated 3 years ago
- new home for s3sync command line tools☆168Jul 29, 2015Updated 10 years ago
- A compiler and runtime for Google's Sawzall language, optimized for Hadoop☆41Apr 26, 2013Updated 12 years ago
- HTTP request spewer / load generator☆52May 4, 2020Updated 5 years ago
- A simple data serializer in C☆200Mar 6, 2014Updated 11 years ago
- Network visualization with Gephi: tutorial and example data files☆15Jan 4, 2017Updated 9 years ago
- Honu is a large scale data collection and processing pipeline☆83Feb 4, 2011Updated 15 years ago
- ☆57Mar 27, 2019Updated 6 years ago
- Mavuno: A Hadoop-Based Text Mining Toolkit☆47Feb 7, 2012Updated 14 years ago
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Jul 11, 2018Updated 7 years ago
- ☆18Nov 23, 2020Updated 5 years ago
- Quick RPC latency benchmark of Cap'n Proto RPC vs. Apache Thrift vs. ZeroC Ice☆19Dec 14, 2013Updated 12 years ago
- S4 is a general-purpose, distributed, scalable, partially fault-tolerant, pluggable platform that allows programmers to easily develop ap…☆233Mar 4, 2011Updated 14 years ago
- Hiding Things Out In The Open☆78Jun 7, 2014Updated 11 years ago
- A platform for visualization and real-time monitoring of data workflows☆1,171Jan 22, 2020Updated 6 years ago
- Mirror of Apache Slider☆77Dec 11, 2018Updated 7 years ago
- Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.☆1,132Apr 10, 2023Updated 2 years ago
- A mailbox based distributed computing library☆83May 9, 2014Updated 11 years ago
- IP stack written in Dylan - includes binary parsing and interactive GUI☆25Jan 30, 2014Updated 12 years ago
- ICFP Programming Contest 2011 repository☆24Jul 1, 2011Updated 14 years ago
- Hive I/O Library☆66Oct 28, 2021Updated 4 years ago
- a set of benchmarks to test Storm performance☆15Jul 23, 2016Updated 9 years ago
- "Functional Programming in Scala" exercises☆17Dec 5, 2014Updated 11 years ago