This contain how to install Hadoop on google colab and how to run map-reduce in Hadoop
☆33Sep 11, 2020Updated 5 years ago
Alternatives and similar repositories for Hadoop
Users that are interested in Hadoop are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Traditionally, engineers were needed to implement business logic via data pipelines before business users can start using it. Using this …☆12Apr 7, 2026Updated last week
- GraphQL to SPARQL bridge☆24Feb 9, 2022Updated 4 years ago
- Run Local Kafka and Kpow with Docker Compose☆19May 29, 2025Updated 10 months ago
- Cloud formation script for solr servers☆17Jul 1, 2015Updated 10 years ago
- A python script to convert your youtube URL to an mp3 file and download it to the same directory as the .py file.☆10May 20, 2025Updated 10 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Improving Low-Resource Neural Machine Translation of Related Languages by Transfer Learning☆19Nov 3, 2022Updated 3 years ago
- <Hello!> -|DaS.Algo|- +Pr0bl3m5! `Comp::Ete` {Solve~Fun} ^Join_us^☆12Jan 31, 2025Updated last year
- Production Setup of Containerized NiFi Cluster (3 nodes)☆14Apr 2, 2022Updated 4 years ago
- Build a semantic search application with deep learning models.☆15Dec 3, 2024Updated last year
- Integrating Apache Airflow, dbt, Great Expectations and Apache Superset to develop a modern open source data stack.☆16Jun 19, 2022Updated 3 years ago
- ☆11Nov 21, 2023Updated 2 years ago
- Real-time Credit card Fraud detection using Spark Streaming, Spark ML, Spark SQL, Kafka, Cassandra and Airflow☆11Jul 1, 2022Updated 3 years ago
- ☆11Feb 24, 2022Updated 4 years ago
- This workshop will familiarize you with some of the key steps towards building an autonomous driving data lake and extracting images from…☆10Jul 12, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆21Sep 6, 2020Updated 5 years ago
- 🧠 A curated list of awesome ChatGPT resources, including libraries, SDKs, APIs, and more. 🌟 Please consider supporting this project by …☆20Mar 6, 2023Updated 3 years ago
- Cours et TP sur Apache Spark☆12Feb 7, 2022Updated 4 years ago
- My solutions for the problem sets in the Udacity Intro to Hadoop and MapReduce course☆15Apr 17, 2014Updated 12 years ago
- Run Impala in a Docker container.☆16Jan 5, 2019Updated 7 years ago
- ☆10Jul 20, 2020Updated 5 years ago
- ☆10Mar 22, 2021Updated 5 years ago
- distributed computing toolkit in rust☆22Sep 21, 2018Updated 7 years ago
- End-to-End ELT data pipeline with Postgres, Airbyte, dbt, Dagster, Snowflake and Metabase☆11Jul 13, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆10Jan 27, 2025Updated last year
- ☆20Jan 5, 2025Updated last year
- Db2 JDBC connector for Trino☆19Jan 6, 2023Updated 3 years ago
- PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it☆81Apr 27, 2025Updated 11 months ago
- Solr Demo☆26Jan 14, 2019Updated 7 years ago
- This repository provides a set of pre-configured settings to help you quickly set up and start using Obsidian☆17Jan 19, 2024Updated 2 years ago
- Simple, extensible implementations of some meta-learning algorithms in Jax☆11Oct 6, 2020Updated 5 years ago
- Data Streaming Nanodegree (from Udacity) exercises, projects and their solutions☆17Aug 14, 2023Updated 2 years ago
- A Kafka Connect Single Message Transform (SMT) that enables you to append the record key to the value as a named field☆19Mar 18, 2026Updated 3 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- AxonOps™ Workbench for Apache Cassandra® - Desktop application for Mac, Windows and Linux☆23Updated this week
- reating a modern data pipeline using a combination of Terraform, AWS Lambda and S3, Snowflake, DBT, Mage AI, and Dash.☆15Jun 26, 2023Updated 2 years ago
- An Ansible collection for Cloudera Platform for cloud and Data Services☆21Apr 6, 2026Updated last week
- Paper Collection for Batch RL with brief introductions.☆85Feb 26, 2022Updated 4 years ago
- Demo KafkaJS application to notify Slack webhook on NPM package releases☆21Nov 4, 2020Updated 5 years ago
- the benchmark for finance☆11Jul 4, 2023Updated 2 years ago
- Learn how to deploy and manage a data tier based on Apache Cassandra™ cluster in Kubernetes using K8ssandra.☆22Jan 20, 2023Updated 3 years ago