Repository for MapReduce Design Patterns (O'Reilly 2012) example source code
☆234Jul 5, 2015Updated 10 years ago
Alternatives and similar repositories for mapreducepatterns
Users that are interested in mapreducepatterns are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Provides a simple archetype to create MapReduce jobs with Maven.☆24Dec 3, 2010Updated 15 years ago
- MapReduce by examples☆99Apr 16, 2019Updated 7 years ago
- MapReduce Demo☆396Apr 8, 2016Updated 10 years ago
- Data-Intensive Text Processing with MapReduce☆628Mar 3, 2021Updated 5 years ago
- Page Rank, Inverted Index and Matrix Multiplication☆10May 23, 2017Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- MapReduce, Spark, Java, and Scala for Data Algorithms Book☆1,081Oct 14, 2024Updated last year
- Code repository for Java Data Science Cookbook, published by Packt☆25Jan 30, 2023Updated 3 years ago
- Will come later...☆20Jul 1, 2022Updated 3 years ago
- JMXTrans configuration for hadoop/cassandra/zookeeper☆31Dec 3, 2015Updated 10 years ago
- Hadoop3.2 single/cluster mode with web terminal gotty, spark, jupyter pyspark, hive, eco etc.☆11Nov 7, 2019Updated 6 years ago
- A skills challenge for hiring!☆12Dec 21, 2016Updated 9 years ago
- Naive K-Means clustering with MapReduce☆20Dec 10, 2021Updated 4 years ago
- Tools to build knowledge graphs from multi-modal extractions☆12Apr 2, 2020Updated 6 years ago
- Python wrapper for the hadoop WebHDFS Rest API☆32Apr 11, 2015Updated 11 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Some extensions to Flume to help with collecting logs and storing as Avro.☆17Feb 22, 2014Updated 12 years ago
- A kafka source & sink for flume☆72Dec 10, 2013Updated 12 years ago
- Example source code accompanying O'Reilly's "Hadoop: The Definitive Guide" by Tom White☆3,507Mar 17, 2020Updated 6 years ago
- JIT compiler from scratch, derived from Nick Desaulniers' great work☆12Oct 24, 2016Updated 9 years ago
- Lock tailing on your rotating files☆12Dec 4, 2019Updated 6 years ago
- A super simple utility for testing Apache Hive scripts locally for non-Java developers.☆73Feb 11, 2017Updated 9 years ago
- Example of basic Storm topology that updates DB persistent state☆33Feb 5, 2014Updated 12 years ago
- ☆20Apr 27, 2012Updated 13 years ago
- parse structed data from btc binary file☆15Sep 27, 2017Updated 8 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A simple implementation of Microsoft's AdPredictor (http://bit.ly/SFgcq8) in Python☆92Dec 26, 2013Updated 12 years ago
- Customer Product search clicks analytics using big data Hadoop, Hive, Oozie, ElasticSearch, Akka, Spring Data☆73Oct 5, 2022Updated 3 years ago
- 清华大数据作业MapReduce处理几百个G的JSON数据☆50Jun 27, 2016Updated 9 years ago
- GraphQL for PostGIS☆19Jul 15, 2015Updated 10 years ago
- Working example of consuming Avro data from Kafka with Spark Streaming☆12Feb 21, 2016Updated 10 years ago
- Elijah OpenStack integration☆27Jan 4, 2019Updated 7 years ago
- 7th in a competition organised by ICT☆24Dec 23, 2015Updated 10 years ago
- Deep Dive into Apache Spark 深入研读Spark源码☆260Jan 5, 2017Updated 9 years ago
- Meta-repository of big data tools -- source and essential plugins for hadoop, pig, wukong, storm, kafka etc.☆30Jun 29, 2014Updated 11 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Source code to accompany the book "Hadoop in Practice", published by Manning.☆204Feb 11, 2020Updated 6 years ago
- Hyperparameters-Optimization☆17Nov 22, 2025Updated 4 months ago
- Few scripts to automate daily data loads from RDBMS to Partitioned Avro Hive table☆30Sep 25, 2014Updated 11 years ago
- stats by ngx-lua☆15Aug 31, 2013Updated 12 years ago
- Examples of Spark 3.0☆45Nov 11, 2020Updated 5 years ago
- A trading demo application☆16Aug 28, 2013Updated 12 years ago
- K-Means Clustering using MapReduce☆74May 20, 2022Updated 3 years ago