Hadoop (Utilities, Patches and Examples)
☆243Jun 21, 2016Updated 9 years ago
Alternatives and similar repositories for Hadoop
Users that are interested in Hadoop are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Hadoop Cluster Configurations☆32Aug 5, 2021Updated 4 years ago
- Optimized joins using bloom filters on Hadoop via Cascading.☆23Sep 25, 2009Updated 16 years ago
- ☆26Jan 2, 2024Updated 2 years ago
- Eclipse plugin for Apache Pig☆33Jul 22, 2013Updated 12 years ago
- Contractive Auto-Encoders in Numpy☆29Oct 17, 2012Updated 13 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Using Hadoop with Scala☆70Oct 5, 2013Updated 12 years ago
- General utility code used across BDG products. Apache 2 licensed.☆18Mar 17, 2026Updated last month
- Binary of pullcontainer☆10Dec 12, 2014Updated 11 years ago
- Examples of distributed computation using Celery☆33Feb 19, 2012Updated 14 years ago
- Example source code accompanying O'Reilly's "Hadoop: The Definitive Guide" by Tom White☆3,507Mar 17, 2020Updated 6 years ago
- A branch of the boilerpipe project☆15Mar 18, 2011Updated 15 years ago
- Docker Compose based system for running remote browsers (including Flash and Java support) connected to web archives☆16Jun 10, 2021Updated 4 years ago
- Open Source/Service libraries, examples, and experiments.☆42Jul 13, 2009Updated 16 years ago
- Benchmark Python and Cython code☆13Jun 13, 2014Updated 11 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Simple stacktrace analysis tool for the JVM☆24Sep 8, 2017Updated 8 years ago
- A port of the base jQuery UI theme to LESS.☆26May 29, 2011Updated 14 years ago
- A S3 hybrid storage interface for dat and hyperdrive☆13Jul 31, 2018Updated 7 years ago
- DevOps for Serverless Applications, published by Packt☆12Jan 18, 2023Updated 3 years ago
- ☆75Jun 18, 2013Updated 12 years ago
- Talks at the <Programming> 2022 Conference in Porto, Portugal☆11Mar 30, 2022Updated 4 years ago
- Working example of consuming Avro data from Kafka with Spark Streaming☆12Feb 21, 2016Updated 10 years ago
- Hands-on Microservices with Python [ video], published by Packt☆16Dec 8, 2022Updated 3 years ago
- ☆10Dec 3, 2025Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- NEW: see http://www.hops.io/. OLD: This work aims to re-engineer the Hadoop Distributed File System (HDFS) so that it can be 1) highly av…☆26Jan 2, 2012Updated 14 years ago
- ☆11Dec 10, 2015Updated 10 years ago
- Python client for Elasticsearch Watcher (deprecated)☆23Jun 4, 2018Updated 7 years ago
- Apache Hadoop☆15,519Apr 10, 2026Updated last week
- RabbitMQ Federation Management☆15Nov 16, 2020Updated 5 years ago
- Dockerfile to build image of Vertica Community Edition.☆21Apr 5, 2017Updated 9 years ago
- Dockerfile for Apache Zeppelin☆17Dec 9, 2015Updated 10 years ago
- A Python MapReduce and HDFS API for Hadoop☆242Jan 19, 2026Updated 2 months ago
- Use word2vec embedding with LSTM for the "Bag of Words meets Bag of Popcorn" challenge☆16May 12, 2017Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Kirk's Zeppelin Notebooks☆11May 22, 2018Updated 7 years ago
- Learn how to use Spark SQL and HSpark connector package to create / query data tables that reside in HBase region servers☆69Sep 17, 2025Updated 7 months ago
- Simple Structured Perceptron tagger in Python☆10May 30, 2017Updated 8 years ago
- Utilities to use Avro files from Hadoop Map/Reduce jobs and Streaming☆26Sep 10, 2013Updated 12 years ago
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Mar 23, 2016Updated 10 years ago
- Docker image for Apache Zeppelin Created from Zeppelin base image to minimize traffic and deployment time in case of changes should be ap…☆13Oct 23, 2018Updated 7 years ago
- Cloud9 is a Hadoop toolkit for working with big data☆236Dec 15, 2015Updated 10 years ago