Python MapReduce library written in Cython. Visit us in #hadoopy on freenode. See the link below for documentation and tutorials.
☆241Jan 8, 2016Updated 10 years ago
Alternatives and similar repositories for hadoopy
Users that are interested in hadoopy are comparing it to the libraries listed below
Sorting:
- Example code for "Web-Scale Computer Vision using MapReduce for Multimedia Data Mining"☆49Aug 2, 2010Updated 15 years ago
- Python module that allows one to easily write and run Hadoop programs.☆1,032Jan 9, 2018Updated 8 years ago
- A Python MapReduce and HDFS API for Hadoop☆242Jan 19, 2026Updated last month
- Utilities to use Avro files from Hadoop Map/Reduce jobs and Streaming☆26Sep 10, 2013Updated 12 years ago
- Library for GPU-related statistical functions☆84Oct 16, 2012Updated 13 years ago
- A pure python HDFS client☆859Apr 19, 2022Updated 3 years ago
- A Python module for dealing with so called "typed bytes".☆31Dec 6, 2011Updated 14 years ago
- vertical search crawler☆38Jan 9, 2012Updated 14 years ago
- NEW: see http://www.hops.io/. OLD: This work aims to re-engineer the Hadoop Distributed File System (HDFS) so that it can be 1) highly av…☆26Jan 2, 2012Updated 14 years ago
- A simple benchmark of noSQL databases for both read/update and MapReduce performances☆32May 14, 2011Updated 14 years ago
- Run MapReduce jobs on Hadoop or Amazon Web Services☆2,617Mar 24, 2023Updated 2 years ago
- PySOM - The Simple Object Machine Smalltalk implemented in Python☆19Aug 19, 2025Updated 6 months ago
- Nearest Neighbor Search in High Dimensional Spaces☆13Nov 18, 2015Updated 10 years ago
- an easy blog hosted at redhat openshift☆16May 28, 2013Updated 12 years ago
- Oozie - workflow engine for Hadoop☆374Jun 8, 2017Updated 8 years ago
- ☆17Mar 6, 2012Updated 13 years ago
- sqlite-backed dictionary conforming to the dbm interface☆26Dec 16, 2012Updated 13 years ago
- SQL Windowing Functions for Hadoop☆65Jun 20, 2022Updated 3 years ago
- Bokeh tutorial, PyData Berlin☆10May 29, 2015Updated 10 years ago
- simple lightweight Linux cron daemon☆10Nov 1, 2016Updated 9 years ago
- Redis bulk-loader for Apache Pig☆40Apr 21, 2012Updated 13 years ago
- GoldenOrb is an open-source implementation of Pregel, Google's graph processing framework☆294Jun 29, 2022Updated 3 years ago
- Latent Dirichlet Allocation with Gibbs sampling☆16Dec 18, 2013Updated 12 years ago
- SOM++ - C++ implementation of the Simple Object Machine Smalltalk☆13Aug 23, 2025Updated 6 months ago
- Showing the relationship between ImageNet ID and labels and pytorch pre-trained model output ID and labels☆10Oct 11, 2020Updated 5 years ago
- A Python package for visualizing 1d and 2d NumPy arrays☆18Dec 31, 2015Updated 10 years ago
- A stylesheet for rst2html5.py☆11Jun 29, 2015Updated 10 years ago
- Uses TF-IDF and inverted search to cluster search results☆22Mar 10, 2011Updated 14 years ago
- Detect duplicated items framework。内容排重框架。☆12Apr 30, 2015Updated 10 years ago
- ctypes bindings for libphash to robustly compare media files