This contain how to install Hadoop on google colab and how to run map-reduce in Hadoop
☆33Sep 11, 2020Updated 5 years ago
Alternatives and similar repositories for Hadoop
Users that are interested in Hadoop are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Power Plant ML Pipeline Application - Apache Spark☆12Dec 12, 2016Updated 9 years ago
- source code☆10Dec 13, 2023Updated 2 years ago
- Simple Implement Transformer with C and Python for educational purpose☆14Oct 30, 2024Updated last year
- A simple example of using LlamaIndex☆15May 24, 2023Updated 2 years ago
- Traditionally, engineers were needed to implement business logic via data pipelines before business users can start using it. Using this …☆12Mar 18, 2026Updated last week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Cloud formation script for solr servers☆17Jul 1, 2015Updated 10 years ago
- A python script to convert your youtube URL to an mp3 file and download it to the same directory as the .py file.☆10May 20, 2025Updated 10 months ago
- ☆15May 28, 2022Updated 3 years ago
- A Real-time ChatGPT based Interview Answering application. Listens to the systems output voice and generates answer in real-time.☆17Apr 27, 2024Updated last year
- Integrating Apache Airflow, dbt, Great Expectations and Apache Superset to develop a modern open source data stack.☆16Jun 19, 2022Updated 3 years ago
- This is a study guide preparation to achive the CDP Administrator Private Cloud Base Exam (CDP-2001)☆14May 25, 2023Updated 2 years ago
- QuizPortal is a MERN based full stack application📝 Curated to replicate and conduct online Test.🌐 It is a platform where any Quiz🤔 can…☆12Jul 21, 2023Updated 2 years ago
- Code for the paper "FinRLlama: A Solution to LLM-Engineered Signals Challenge at FinRL Contest 2024"☆13Feb 14, 2025Updated last year
- CMU 15-712 lecture slides☆11Jan 6, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆26Sep 4, 2018Updated 7 years ago
- Cours et TP sur Apache Spark☆12Feb 7, 2022Updated 4 years ago
- My solutions for the problem sets in the Udacity Intro to Hadoop and MapReduce course☆15Apr 17, 2014Updated 11 years ago
- ☆13Aug 14, 2025Updated 7 months ago
- ☆10Jul 20, 2020Updated 5 years ago
- ☆10Mar 22, 2021Updated 5 years ago
- ☆10Jan 27, 2025Updated last year
- End-to-End ELT data pipeline with Postgres, Airbyte, dbt, Dagster, Snowflake and Metabase☆11Jul 13, 2023Updated 2 years ago
- ☆12Sep 22, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it☆80Apr 27, 2025Updated 11 months ago
- This repository provides a set of pre-configured settings to help you quickly set up and start using Obsidian☆17Jan 19, 2024Updated 2 years ago
- Solr Demo☆26Jan 14, 2019Updated 7 years ago
- plan, design and implement enterprise data infrastructure solutions and create the blueprints for an organization’s data management syste…☆14Jun 25, 2023Updated 2 years ago
- An dashboard for book publisher created with React, Redux-Saga and Cloud-Firestore (Firebase)☆12Jun 30, 2019Updated 6 years ago
- Datagenerator for Data Services☆16Sep 29, 2025Updated 5 months ago
- Data Streaming Nanodegree (from Udacity) exercises, projects and their solutions☆17Aug 14, 2023Updated 2 years ago
- A Kafka Connect Single Message Transform (SMT) that enables you to append the record key to the value as a named field☆19Mar 18, 2026Updated last week
- A simple home-made way of getting AI to help you out during a remote interview☆11Nov 30, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- This repository shows my personal notes taken while doing the Udacity Data engineering Nanodegree☆13May 28, 2020Updated 5 years ago
- Biometric Attendance System built with computer vision (Python OpenCV), Flask and the MERN stack☆15Jan 12, 2023Updated 3 years ago
- reating a modern data pipeline using a combination of Terraform, AWS Lambda and S3, Snowflake, DBT, Mage AI, and Dash.☆15Jun 26, 2023Updated 2 years ago
- 🎓 Harvard CS50x 2022 - problem sets solutions☆21Feb 18, 2026Updated last month
- We define and estimate smooth unique information of samples with respect to classifier weights and predictions. We compute these quantiti…☆11Mar 9, 2021Updated 5 years ago
- Demo KafkaJS application to notify Slack webhook on NPM package releases☆21Nov 4, 2020Updated 5 years ago
- Learn how to deploy and manage a data tier based on Apache Cassandra™ cluster in Kubernetes using K8ssandra.☆22Jan 20, 2023Updated 3 years ago