Cloudera_Material: Study Material to help people preparing for Cloudera CCA Spark and Hadoop Developer Exam (CCA175). Feel free to collaborate.
☆41Apr 21, 2020Updated 6 years ago
Alternatives and similar repositories for Cloudera_Material
Users that are interested in Cloudera_Material are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- All Certification and preparation, examples & others☆11Oct 18, 2018Updated 7 years ago
- Preparatory notes for the Cloudera Spark and Hadoop Certification☆18Dec 5, 2018Updated 7 years ago
- Fake News Detection - Feature Extraction using Vectorization such as Count Vectorizer, TFIDF Vectorizer, Hash Vectorizer,. Then used an E…☆22Feb 21, 2020Updated 6 years ago
- Educational notes,Hands on problems w/ solutions for hadoop ecosystem☆87Jan 22, 2019Updated 7 years ago
- ☆19Apr 9, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- docs, codes and resources to prepare for the CRT020: Databricks Certified Associate Developer for Apache Spark 2.4 with Python 3 certific…☆10Sep 25, 2019Updated 6 years ago
- A collection of data analysis projects done using PySpark via Jupyter notebooks.☆10Oct 8, 2022Updated 3 years ago
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆18Sep 17, 2018Updated 7 years ago
- This is a repository for my data engineer course through Udacity.☆16Oct 18, 2019Updated 6 years ago
- A general purpose framework for automating Cloudera Products☆69Mar 4, 2025Updated last year
- Udacity Data Engineering Nanodegree Capstone Project☆36May 9, 2020Updated 6 years ago
- pyspark dataframe made easy☆16Dec 15, 2021Updated 4 years ago
- Cloudera CCA175 Spark and Hadoop Developer exam preparation☆16Jan 19, 2018Updated 8 years ago
- This repo is intended to share how I would face with the skills required by Cloudera☆20Jul 9, 2017Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Examples of Using DBTunnel☆11Apr 24, 2024Updated 2 years ago
- Collection of Databricks and Jupyter Notebooks☆22Feb 9, 2026Updated 3 months ago
- webMethods Monitoring using Open Elastic Stack☆11Jun 27, 2025Updated 10 months ago
- Code for tutorial on my blog http://taywils.me/2013/11/05/javasparkframeworktutorial/☆22Aug 26, 2017Updated 8 years ago
- Local Development of AWS Glue with Docker and Visual Studio Code☆14Nov 29, 2021Updated 4 years ago
- My solutions for the Udacity Data Engineering Nanodegree☆34Oct 14, 2019Updated 6 years ago
- Solutions of LeetCode interview questions☆15Feb 7, 2019Updated 7 years ago
- ☆13Jan 25, 2024Updated 2 years ago
- Experiments made with Spark☆15Dec 9, 2014Updated 11 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Rasa Chatbot using Django backend and Sockets for communication☆12Dec 8, 2022Updated 3 years ago
- qKit is an open source JavaScript 2D rendering library that renders quadratic shapes (rectangles and squares).☆22May 21, 2019Updated 7 years ago
- ☆14Sep 14, 2021Updated 4 years ago
- Ejemplos y ejercicios del curso de Kubernetes de CodeURJC☆24Oct 13, 2020Updated 5 years ago
- ☆18Apr 28, 2018Updated 8 years ago
- A data engineering pipeline for digital marketers.☆11Dec 21, 2018Updated 7 years ago
- Email Verification with Spring Boot, MySQL, NGINX, Docker Compose☆13Jan 6, 2019Updated 7 years ago
- A fast and low memory requirement version of PointHop and PointHop++, which is built upon Apache Spark.☆10Jul 14, 2020Updated 5 years ago
- A project for the development of rich geospatial data from the city of São Paulo for use in Machine Learning models.☆11Jul 4, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Demo showcasing Spark Streaming, Kafka, Kudu - all in Python☆27Jun 12, 2017Updated 8 years ago
- 30 Days of Airflow☆10Aug 13, 2019Updated 6 years ago
- Databricks Certified Associate Developer for Apache Spark 3.0☆31Jul 27, 2020Updated 5 years ago
- Example project for consuming AWS Kinesis streamming and save data on Amazon Redshift using Apache Spark☆11May 22, 2018Updated 8 years ago
- Run a Spark job within Amazon EMR☆12Sep 12, 2020Updated 5 years ago
- End-to-End deployment of E-commerce customers segmentation using Clustering Machine learning algorithms in Google Cloud Platform and MLOp…☆19Jun 5, 2024Updated last year
- Crop Yield Prediction with Deep Learning☆20May 31, 2019Updated 6 years ago