This repository focuses on gathering and making a curated list resources to learn Hadoop for FREE.
☆57Jun 10, 2018Updated 7 years ago
Alternatives and similar repositories for Learn-Hadoop-and-Spark
Users that are interested in Learn-Hadoop-and-Spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A bridge to Apache Atlas for provenance metadata created in course of using Apache NiFi☆15Jan 2, 2023Updated 3 years ago
- Extract, transform, and load data for analytic processing using AWS Glue☆17May 2, 2021Updated 5 years ago
- A consumer of a Kafka topic based on Flink☆12Oct 5, 2022Updated 3 years ago
- Full-Text Search System to stream, collect, clean, store and filter data collected from different sources using Docker, Kafka, Elasticsea…☆21Jan 7, 2023Updated 3 years ago
- This is base code of fastapi, mongodb and machine learning with poetry env☆13Nov 4, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 【易车】- Spark、flink、HBase、Hive、flume集成了一些Hadoop的原生api的一些demo(如HDFS、MapReduce:目前就这两个);同时测试一些异常功能☆16Apr 4, 2019Updated 7 years ago
- Drafts and ideas for my blog☆14Oct 28, 2024Updated last year
- Collection of Pig scripts that I use for my talks and workshops☆39Apr 30, 2013Updated 13 years ago
- Source code for 'Up and Running with DAX for Power BI' by Alison Box☆12Jun 10, 2022Updated 3 years ago
- Workshop materials for a June 2017 workshop on microservices, Docker & Kubernetes, Node.JS, Kafka, Redis and choreography☆15Jun 1, 2017Updated 9 years ago
- Educational notes,Hands on problems w/ solutions for hadoop ecosystem☆87Jan 22, 2019Updated 7 years ago
- Workshop for demonstrating how to rapidly build, test, and deploy a Spring Boot application on Kubernetes☆12Feb 5, 2020Updated 6 years ago
- Big Data Resources and References☆13Sep 4, 2024Updated last year
- Learning PySpark video series☆11Mar 5, 2018Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆18Aug 15, 2022Updated 3 years ago
- Apache Spark using SQL☆14Aug 18, 2021Updated 4 years ago
- ☆10May 18, 2019Updated 7 years ago
- A Hubot script for creating quick reminders through natural language.☆11Jun 29, 2017Updated 8 years ago
- ☆13Dec 12, 2017Updated 8 years ago
- Apache Spark Guide☆38Feb 1, 2022Updated 4 years ago
- Terraform Module to create a Apache Zookeeper cluster on AWS☆13Jan 3, 2022Updated 4 years ago
- Ambari service for RedHat FreeIPA☆11Sep 30, 2016Updated 9 years ago
- Collection of best practices for Java persistence performance in Spring Boot applications☆10Nov 27, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Resilient Automation Functions and Scripts☆15Jan 5, 2022Updated 4 years ago
- List of playbooks to manage Ambari☆13Oct 3, 2018Updated 7 years ago
- Run dynamic SQL in SQL. This package allows queries with an unknown number of select-list items and can solve challenging problems like d…☆12Oct 5, 2024Updated last year
- Grafana Prometheus exporter☆10Oct 17, 2017Updated 8 years ago
- Code used in the experiments of the paper COEGAN: Evaluating the Coevolution Effect in Generative Adversarial Networks http://gecco-2019.…☆14Jul 6, 2023Updated 2 years ago
- Avro Schema Shredder is a REST API that enables storage of Avro Schemas in Apache Atlas. This API enables an organization to use Apache A…☆13Jan 11, 2017Updated 9 years ago
- Packer Template to build a AWS Apache Cassandra AMI☆10Jan 3, 2022Updated 4 years ago
- Adds a framework to enable Natural Language interactions in your Hubot scripts☆11Dec 6, 2016Updated 9 years ago
- We store attacks and exploits that we've found useful in our research☆13Jun 4, 2015Updated 11 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Custom Alerts for Ambari server☆12Jul 27, 2015Updated 10 years ago
- openEHR Clinical modelling tooling setup☆10Jun 24, 2018Updated 7 years ago
- This repo contains commands that data engineers use in day to day work.☆61Feb 4, 2023Updated 3 years ago
- Subscriber Registry API and SIP Authentication Server☆18Jul 13, 2016Updated 9 years ago
- Reactive monitoring dashboard☆11Oct 13, 2020Updated 5 years ago
- Resources for Data Science Kick Starter Workshop at ODSC India 2019☆19Jul 4, 2020Updated 5 years ago
- Notes and tasks code for Cloudera / Udacity hadoop course☆16Jul 31, 2015Updated 10 years ago