matteobertozzi/Hadoop

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/matteobertozzi/Hadoop)

matteobertozzi / Hadoop

Hadoop (Utilities, Patches and Examples)

☆240

Alternatives and similar repositories for Hadoop

Users that are interested in Hadoop are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

netxillon / Hadoop
View on GitHub
Hadoop Cluster Configurations
☆32Aug 5, 2021Updated 4 years ago
nathanmarz / cascading-batch-query
View on GitHub
Optimized joins using bloom filters on Hadoop via Cascading.
☆22Sep 25, 2009Updated 16 years ago
jpfuentes2 / swim
View on GitHub
Haskell implementation of the SWIM epidemic gossip protocol
☆12Apr 10, 2019Updated 7 years ago
ofermend / practical-data-science-with-hadoop-and-spark
View on GitHub
☆26Jan 2, 2024Updated 2 years ago
chef-boneyard / omnibus
View on GitHub
Prepares a machine to be an Omnibus builder. ┬──┬◡ﾉ(° -°ﾉ)
☆27Oct 20, 2020Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
codius-deprecated / codius-cli
View on GitHub
Codius command-line interface (CLI) for Node.js
☆19Mar 6, 2015Updated 11 years ago
romainr / PigEditor
View on GitHub
Eclipse plugin for Apache Pig
☆33Jul 22, 2013Updated 12 years ago
deanwampler / scala-hadoop
View on GitHub
Using Hadoop with Scala
☆70Oct 5, 2013Updated 12 years ago
jperla / pullcontainer-binary
View on GitHub
Binary of pullcontainer
☆10Dec 12, 2014Updated 11 years ago
chef-boneyard / delivery-cluster
View on GitHub
DEPRECATED: Deployment cookbook for standing up Delivery clusters using chef-provisioning.
☆19May 31, 2017Updated 9 years ago
alextp / pylda
View on GitHub
An implementation of gibbs sampling for Latent Dirichlet Allocation
☆30Aug 3, 2011Updated 14 years ago
larsbutler / celery-examples
View on GitHub
Examples of distributed computation using Celery
☆33Feb 19, 2012Updated 14 years ago
tomwhite / hadoop-book
View on GitHub
Example source code accompanying O'Reilly's "Hadoop: The Definitive Guide" by Tom White
☆3,500Mar 17, 2020Updated 6 years ago
mesosphere-backup / mesos-slave-dind
View on GitHub
Mesos Slave with Docker-in-Docker
☆12May 7, 2018Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zooie / opensearch
View on GitHub
Open Source/Service libraries, examples, and experiments.
☆42Jul 13, 2009Updated 16 years ago
joshdevins / storm-kafka
View on GitHub
Library to use Kafka as a spout within Storm
☆43Sep 26, 2011Updated 14 years ago
mentat-collective / Mafs.cljs
View on GitHub
Reagent interface to the Mafs interactive 2d math visualization library.
☆15Jun 1, 2024Updated 2 years ago
webrecorder / dat-s3-hybrid-storage
View on GitHub
A S3 hybrid storage interface for dat and hyperdrive
☆13Jul 31, 2018Updated 7 years ago
jed / hyperspider
View on GitHub
A declarative HATEOAS API crawler for node.js
☆113Sep 25, 2012Updated 13 years ago
ymc-geko / ansible-cdh-cluster
View on GitHub
install Cloudera's distribution of Hadoop including Cloudera Manager and Cloudera Search (Beta)
☆32Aug 16, 2013Updated 12 years ago
gwenshap / sqoop2hive
View on GitHub
Few scripts to automate daily data loads from RDBMS to Partitioned Avro Hive table
☆30Sep 25, 2014Updated 11 years ago
cloudera / python-ngrams
View on GitHub
☆74Jun 18, 2013Updated 13 years ago
pandoraui / jquery-chm
View on GitHub
jQuery 在线手册
☆10Sep 25, 2015Updated 10 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
seanpquig / confluent-platform-spark-streaming
View on GitHub
Working example of consuming Avro data from Kafka with Spark Streaming
☆12Feb 21, 2016Updated 10 years ago
smarzola / anypubsub
View on GitHub
A generic interface wrapping multiple backends to provide a consistent pubsub API
☆13Oct 31, 2018Updated 7 years ago
lalithsuresh / Scaling-HDFS-NameNode
View on GitHub
NEW: see http://www.hops.io/. OLD: This work aims to re-engineer the Hadoop Distributed File System (HDFS) so that it can be 1) highly av…
☆26Jan 2, 2012Updated 14 years ago
gbraccialli / SparkUtils
View on GitHub
☆11Dec 10, 2015Updated 10 years ago
apache / hadoop
View on GitHub
Apache Hadoop
☆15,591Updated this week
ipedrazas / Zeppelin-docker
View on GitHub
Dockerfile for Apache Zeppelin
☆17Dec 9, 2015Updated 10 years ago
webrecorder / markdown-to-respec
View on GitHub
A Github Action for turning Markdown into ReSpec HTML
☆16Jun 6, 2024Updated 2 years ago
crs4 / pydoop
View on GitHub
A Python MapReduce and HDFS API for Hadoop
☆241Jan 19, 2026Updated 5 months ago
google / casfs
View on GitHub
Content-addressable storage, implemented over pyfilesystem2.
☆17Jun 17, 2020Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Yelp / mrjob
View on GitHub
Run MapReduce jobs on Hadoop or Amazon Web Services
☆2,611Apr 2, 2026Updated 3 months ago
kirkhas / zeppelin-notebooks
View on GitHub
Kirk's Zeppelin Notebooks
☆11May 22, 2018Updated 8 years ago
frol / python-tutorials
View on GitHub
My personal tutorials to dive into Python in an hour or so
☆10Jul 15, 2016Updated 9 years ago
webrecorder / oembed.link
View on GitHub
A Cloudflare Worker to render embeds on a single page using oEmbed
☆25Nov 17, 2022Updated 3 years ago
tomslabs / avro-utils
View on GitHub
Utilities to use Avro files from Hadoop Map/Reduce jobs and Streaming
☆26Sep 10, 2013Updated 12 years ago
randerzander / HiveToPhoenix
View on GitHub
An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase
☆14Mar 23, 2016Updated 10 years ago
IngloriousCoderz / react-property-grid
View on GitHub
A react/redux implementation of an editable property grid.
☆10Jun 1, 2017Updated 9 years ago