san089/Cloudera_Material

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/san089/Cloudera_Material)

san089 / Cloudera_Material

Cloudera_Material: Study Material to help people preparing for Cloudera CCA Spark and Hadoop Developer Exam (CCA175). Feel free to collaborate.

☆42

Alternatives and similar repositories for Cloudera_Material

Users that are interested in Cloudera_Material are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Venkata09 / BigDataCertificationPrep
View on GitHub
All Certification and preparation, examples & others
☆11Oct 18, 2018Updated 7 years ago
okmich / cca175notes
View on GitHub
Preparatory notes for the Cloudera Spark and Hadoop Certification
☆18Dec 5, 2018Updated 7 years ago
san089 / Big_Data_Project
View on GitHub
Fake News Detection - Feature Extraction using Vectorization such as Count Vectorizer, TFIDF Vectorizer, Hash Vectorizer,. Then used an E…
☆23Feb 21, 2020Updated 6 years ago
Pushkr / Apache-Spark-Hands-On
View on GitHub
Educational notes,Hands on problems w/ solutions for hadoop ecosystem
☆87Jan 22, 2019Updated 7 years ago
Prakash-Ponnusamy1 / CCA175_Master_Preparation
View on GitHub
☆19Apr 9, 2020Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
san089 / data-engineer-roadmap
View on GitHub
Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups
☆18Sep 17, 2018Updated 7 years ago
leandrohmvieira / databricks-crt020-notes
View on GitHub
docs, codes and resources to prepare for the CRT020: Databricks Certified Associate Developer for Apache Spark 2.4 with Python 3 certific…
☆10Sep 25, 2019Updated 6 years ago
DrakeData / data-engineer-nanodegree
View on GitHub
This is a repository for my data engineer course through Udacity.
☆15Oct 18, 2019Updated 6 years ago
khurramturk / CloudAge
View on GitHub
Hadoop Scripts
☆15Mar 22, 2016Updated 10 years ago
mohamedYoussfi / deeplearning4j-cnn-mnist-app
View on GitHub
☆12Jul 28, 2019Updated 6 years ago
iaasacademy / aws-solutions-architect-associate
View on GitHub
Code samples for the AWS Solutions Architect Associate Course
☆15Jun 18, 2026Updated last month
carsonskjerdal / GroceryShop
View on GitHub
A grocery buying app
☆12Jul 24, 2021Updated 5 years ago
cloudera-labs / cloudera-deploy
View on GitHub
A general purpose framework for automating Cloudera Products
☆70Mar 4, 2025Updated last year
onefoursix / kill-long-running-impala-queries
View on GitHub
☆16Nov 8, 2015Updated 10 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
VinayChaudhari1996 / pyspark-dataframe-made-easy
View on GitHub
pyspark dataframe made easy
☆16Dec 15, 2021Updated 4 years ago
ibm-wm-transition / WxMonitoring
View on GitHub
webMethods Monitoring using Open Elastic Stack
☆11Jun 27, 2025Updated last year
syedhassaanahmed / databricks-notebooks
View on GitHub
Collection of Databricks and Jupyter Notebooks
☆22Feb 9, 2026Updated 5 months ago
AleNegrini / CCA131-Required-Skills
View on GitHub
This repo is intended to share how I would face with the skills required by Cloudera
☆19Jul 9, 2017Updated 9 years ago
BenSchr / Udacity-Data-Engineering-Projects
View on GitHub
My solutions for the Udacity Data Engineering Nanodegree
☆34Oct 14, 2019Updated 6 years ago
antimoz-om / Antimoz
View on GitHub
A data engineering pipeline for digital marketers.
☆11Dec 21, 2018Updated 7 years ago
fahadhaidari / qKit
View on GitHub
qKit is an open source JavaScript 2D rendering library that renders quadratic shapes (rectangles and squares).
☆22May 21, 2019Updated 7 years ago
taylorty / Battle-City
View on GitHub
A Java implementation of video game Battle City
☆15Mar 20, 2018Updated 8 years ago
aseigneurin / spark-sandbox
View on GitHub
Experiments made with Spark
☆15Dec 9, 2014Updated 11 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
okmich / hadoop-training-projects
View on GitHub
Projects from my Hadoop training sessions
☆16Feb 22, 2018Updated 8 years ago
LeonardoEmili / stock-price-forecasting
View on GitHub
Distributed stock price forecasting system to predict S&P 500 stock prices.
☆11Nov 12, 2021Updated 4 years ago
chagaz / ml-notebooks
View on GitHub
Some machine learning notebooks
☆11Jul 10, 2025Updated last year
adityajain10 / pyspark-mlib-based-stock-predictor
View on GitHub
PredictorFinc is a scalable supervised machine learning model the predicts stock price change through Decision Tree Regressor using data …
☆12Sep 5, 2023Updated 2 years ago
g00glen00b / ng-spring-data
View on GitHub
Spring Data REST + AngularJS example
☆17Nov 29, 2014Updated 11 years ago
hellokoding / email-verification-springboot-mysql-nginx-dockercompose
View on GitHub
Email Verification with Spring Boot, MySQL, NGINX, Docker Compose
☆13Jan 6, 2019Updated 7 years ago
Marlowess / spark-exercises
View on GitHub
Some exercises to learn Spark. Solved in Python.
☆21Oct 15, 2024Updated last year
prakashdontaraju / google-cloud-ecommerce
View on GitHub
ecommerce GCP Streaming pipeline ― Cloud Storage, Compute Engine, Pub/Sub, Dataflow, Apache Beam, BigQuery and Tableau; GCP Batch pipelin…
☆11Mar 9, 2022Updated 4 years ago
minzhang-1 / PointHop-PointHop2_Spark
View on GitHub
A fast and low memory requirement version of PointHop and PointHop++, which is built upon Apache Spark.
☆10Jul 14, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mateuspicanco / project-atlas-sao-paulo
View on GitHub
A project for the development of rich geospatial data from the city of São Paulo for use in Machine Learning models.
☆12Jul 4, 2021Updated 5 years ago
ericbellet / databricks-certification
View on GitHub
Databricks Certified Associate Developer for Apache Spark 3.0
☆34Jul 27, 2020Updated 5 years ago
vsouza / spark-kinesis-redshift
View on GitHub
Example project for consuming AWS Kinesis streamming and save data on Amazon Redshift using Apache Spark
☆11May 22, 2018Updated 8 years ago
AWS-Big-Data-Projects / Run-a-Spark-job-within-Amazon-EMR
View on GitHub
Run a Spark job within Amazon EMR
☆12Sep 12, 2020Updated 5 years ago
TreeKat71 / 30DaysOfAirflow
View on GitHub
30 Days of Airflow
☆10Aug 13, 2019Updated 6 years ago
AWS-Big-Data-Projects / AWS-EMR
View on GitHub
Analyzing Big Data with Amazon EMR
☆12Sep 14, 2020Updated 5 years ago
cloudera-labs / cloudera.cluster
View on GitHub
An Ansible collection for Cloudera Platform for on-premise and cloud Datahubs
☆38Aug 26, 2025Updated 10 months ago