Data-Intensive Text Processing with MapReduce
☆628Mar 3, 2021Updated 5 years ago
Alternatives and similar repositories for MapReduceAlgorithms
Users that are interested in MapReduceAlgorithms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for MapReduce Design Patterns (O'Reilly 2012) example source code☆234Jul 5, 2015Updated 10 years ago
- Spark Tutorial at the University of Maryland☆38Oct 24, 2014Updated 11 years ago
- ☆12Mar 31, 2021Updated 5 years ago
- MapReduce, Spark, Java, and Scala for Data Algorithms Book☆1,080Oct 14, 2024Updated last year
- Mirror of Apache Crunch (Incubating)☆110Feb 2, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Example application for analyzing Twitter data using CDH - Flume, Oozie, Hive☆289Aug 25, 2016Updated 9 years ago
- Examples for learning spark☆19Aug 19, 2015Updated 10 years ago
- Provides a simple archetype to create MapReduce jobs with Maven.☆24Dec 3, 2010Updated 15 years ago
- Spark + Jupyer + Hive☆12Sep 24, 2015Updated 10 years ago
- ☆22Sep 20, 2016Updated 9 years ago
- A scrapper that takes an online book from ORilley and turns into an epub book, because I want to read'em in my nook, away from my compute…☆25Sep 11, 2016Updated 9 years ago
- A course in numerical methods with Python for engineers and scientists: currently 5 learning modules, with student assignments.☆10Dec 6, 2017Updated 8 years ago
- Solution to the Higgs Boson Machine Learning Challenge on Kaggle☆32Sep 16, 2014Updated 11 years ago
- Ansible modules for interacting with Azure Resource Manager☆10Aug 16, 2016Updated 9 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- crumbling large graphs into connected components☆12Jan 8, 2018Updated 8 years ago
- ☆20Jun 29, 2022Updated 3 years ago
- Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University☆165Dec 4, 2025Updated 5 months ago
- Python Client for WebHDFS REST API☆43May 8, 2015Updated 11 years ago
- ☆15Aug 5, 2016Updated 9 years ago
- Generate word-word similarities from Gensim's latent semantic indexing (Python)☆11Jan 10, 2017Updated 9 years ago
- This is a port of the Google+ iPad app timeline purely done with CSS3☆88Aug 2, 2012Updated 13 years ago
- Convert a nested map to a flat map of sql-friendly columns☆16Oct 5, 2023Updated 2 years ago
- Example source code accompanying O'Reilly's "Hadoop: The Definitive Guide" by Tom White☆3,506Mar 17, 2020Updated 6 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Workshop for Hadoop Operations Best Practices☆10Feb 24, 2015Updated 11 years ago
- A web scraping tutorial with Illinois Department of Revenue tax data☆26May 8, 2013Updated 13 years ago
- A pseudo3d old school racing game made with pure Javascript.☆58Jun 29, 2020Updated 5 years ago
- Source code that accompanies the book "Hadoop in Practice, Second Edition".☆80Sep 10, 2014Updated 11 years ago
- Cloud9 is a Hadoop toolkit for working with big data☆236Dec 15, 2015Updated 10 years ago
- Drone Tank Arena is a BattleZone-like FPS made in WebGL made during 7dfps contest.☆58Dec 2, 2016Updated 9 years ago
- Uses Node.js and Leap Motion to control an AR Drone and stream video to the browser.☆62Nov 20, 2013Updated 12 years ago
- Deprecated. Formerly: scripts to make it easier to set up and manipulate clusters at Amazon EC2☆110Jul 26, 2012Updated 13 years ago
- ☆29Jan 23, 2019Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- NoSQL y_serial Python module – warehouse compressed objects with SQLite☆17Jun 24, 2015Updated 10 years ago
- Code repository for O'Reilly Hadoop Application Architectures book☆163May 26, 2015Updated 10 years ago
- Vector Space Model Framework developed for InPhO☆39May 9, 2025Updated 11 months ago
- My website developed in pelican☆14Dec 26, 2025Updated 4 months ago
- Indri search implementation on top of Lucene search engine☆37Mar 12, 2024Updated 2 years ago
- Gates of Olympus: A multi-layer tower defense game in WebGL☆15Jan 30, 2011Updated 15 years ago
- A simple implementation of a pairs trading strategy☆13Mar 17, 2015Updated 11 years ago