This repository focuses on gathering and making a curated list resources to learn Hadoop for FREE.
☆57Jun 10, 2018Updated 7 years ago
Alternatives and similar repositories for Learn-Hadoop-and-Spark
Users that are interested in Learn-Hadoop-and-Spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A bridge to Apache Atlas for provenance metadata created in course of using Apache NiFi☆15Jan 2, 2023Updated 3 years ago
- This repository focuses on saving my linkedin articles and stuff that I find "USEFUL" on LinkedIn.☆156Jan 18, 2023Updated 3 years ago
- POC for all the stack of big data (kafka, spark, cassandra, hdfs, docker, springboot)☆12Dec 16, 2022Updated 3 years ago
- Deep Q-Networks in tensorflow☆10Apr 4, 2017Updated 9 years ago
- Drafts and ideas for my blog☆14Oct 28, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Source code for 'Up and Running with DAX for Power BI' by Alison Box☆12Jun 10, 2022Updated 3 years ago
- Educational notes,Hands on problems w/ solutions for hadoop ecosystem☆87Jan 22, 2019Updated 7 years ago
- The goal of this project is to analyse the impact of Covid-19 on the Aviation industry through data engineering processes using technolog…☆13Jun 26, 2022Updated 3 years ago
- Learning PySpark video series☆11Mar 5, 2018Updated 8 years ago
- ☆18Aug 15, 2022Updated 3 years ago
- Apache Spark using SQL☆14Aug 18, 2021Updated 4 years ago
- Contains all scripts and data of blog contents☆10May 3, 2017Updated 8 years ago
- Displays current weather conditions inside and out☆12Aug 3, 2015Updated 10 years ago
- This repo consists of all important concepts for data engineers.☆11Dec 24, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Self-contained examples using Apache Spark with the functional features of Java 8☆66Apr 8, 2018Updated 8 years ago
- This repository is to help with the Partner Demonstration of the Apache Atlas project.☆30Oct 29, 2015Updated 10 years ago
- Apache Spark Guide☆35Feb 1, 2022Updated 4 years ago
- Gobblin is a distributed big data integration framework (ingestion, replication, compliance, retention) for batch and streaming systems.…☆11Jul 29, 2017Updated 8 years ago
- ☆13Feb 18, 2022Updated 4 years ago
- MOVED: now at https://opendev.org/x/iotronic☆17Sep 26, 2019Updated 6 years ago
- Terraform Module to create a Apache Zookeeper cluster on AWS☆13Jan 3, 2022Updated 4 years ago
- ☆14May 16, 2019Updated 6 years ago
- Ambari service for RedHat FreeIPA☆11Sep 30, 2016Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Botoflow is an asynchronous framework for Amazon SWF that helps you build SWF applications using Python☆13Dec 26, 2022Updated 3 years ago
- Resilient Automation Functions and Scripts☆15Jan 5, 2022Updated 4 years ago
- List of playbooks to manage Ambari☆13Oct 3, 2018Updated 7 years ago
- A Python script to swoop and decrypt passwords from Chrome's local storage.☆11Dec 10, 2018Updated 7 years ago
- ☆18Nov 16, 2018Updated 7 years ago
- Grafana Prometheus exporter☆10Oct 17, 2017Updated 8 years ago
- Building pipeline to process the real-time data using Spark and Mongodb.☆12Oct 30, 2019Updated 6 years ago
- Implement common statistical machine learning algorithms with raw Numpy.☆16Jun 30, 2020Updated 5 years ago
- SQL on HBase with Apache Phoenix in Docker☆29Mar 21, 2016Updated 10 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Avro Schema Shredder is a REST API that enables storage of Avro Schemas in Apache Atlas. This API enables an organization to use Apache A…☆13Jan 11, 2017Updated 9 years ago
- Teaching notes from my Advanced SQL workshops as local lead instructor at General Assembly New York. The first edition was created for th…☆18Feb 14, 2020Updated 6 years ago
- Python scripts for Agisoft Photoscan☆12Jun 18, 2015Updated 10 years ago
- Packer Template to build a AWS Apache Cassandra AMI☆10Jan 3, 2022Updated 4 years ago
- Various data stream/batch process demo with Apache Scala Spark 🚀☆12Feb 28, 2020Updated 6 years ago
- Adds a framework to enable Natural Language interactions in your Hubot scripts☆11Dec 6, 2016Updated 9 years ago
- We store attacks and exploits that we've found useful in our research☆13Jun 4, 2015Updated 10 years ago