( These solutions tested on 4 node Hortonwork cluster on my laptop. Do not test on your production environment until you test... :)
☆20Apr 18, 2020Updated 5 years ago
Alternatives and similar repositories for -CCP-Data-Engineer---preparation
Users that are interested in -CCP-Data-Engineer---preparation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Different ways to connect to storage in Azure Databricks☆11Jul 19, 2019Updated 6 years ago
- Pyspark Spotify ETL☆17Aug 19, 2021Updated 4 years ago
- ☆11Nov 29, 2020Updated 5 years ago
- An end-to-end Recommendation System built on Azure Databricks☆56Jul 29, 2019Updated 6 years ago
- List customize [dot] files config.☆11May 14, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Spark UDFs to deserialize Avro messages with schemas stored in Schema Registry.☆20Jan 11, 2018Updated 8 years ago
- ☆10Aug 2, 2021Updated 4 years ago
- Classification problem to predict loan defaulters using Lending Club Dataset☆11Jan 26, 2019Updated 7 years ago
- Mass Suricata rules creator, from a list of domain☆14Sep 14, 2018Updated 7 years ago
- ☆11Feb 11, 2020Updated 6 years ago
- native Rust implementation of Kafka protocol and api☆14Jun 13, 2023Updated 2 years ago
- Preparatory notes for the Cloudera Spark and Hadoop Certification☆18Dec 5, 2018Updated 7 years ago
- Simple wrapper over SOLR to emulate Azure Search (for development only)☆12Jul 8, 2017Updated 8 years ago
- A simple php toolbox to interact with the Microsoft Azure Search Service REST API.☆11Feb 2, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Rust And Delta Demo. Explanation and walkthrough on delta-rs☆10Aug 21, 2023Updated 2 years ago
- A GameBoy Emulator written in Rust, written as a learning project for both☆10Jun 6, 2023Updated 2 years ago
- ☆10May 24, 2021Updated 4 years ago
- Examples of all Machine Learning Algorithm in Apache Spark☆15Nov 2, 2017Updated 8 years ago
- Data Engineering Project at Insight☆15Nov 17, 2015Updated 10 years ago
- Guess what! ;)☆17Dec 16, 2025Updated 3 months ago
- A versioned database inspired by Git☆16Dec 16, 2017Updated 8 years ago
- Assignments for UC San Diego's Hadoop Platform and Application Framework class on Coursera☆10Jan 27, 2016Updated 10 years ago
- ☆10Apr 3, 2019Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- reating a modern data pipeline using a combination of Terraform, AWS Lambda and S3, Snowflake, DBT, Mage AI, and Dash.☆15Jun 26, 2023Updated 2 years ago
- POC: Spark consumer for bottledwater-pg Kafka Avro topics☆16Aug 20, 2020Updated 5 years ago
- ☆11Jul 13, 2020Updated 5 years ago
- Single Sign Out: Authentication Service Example with JSON Web Token (JWT), Spring Boot and Redis☆30Jun 28, 2017Updated 8 years ago
- files created in ardan labs golang training☆12Nov 8, 2023Updated 2 years ago
- Classify images of different kitchenware items☆11Apr 17, 2023Updated 2 years ago
- A Spark datasource for the HadoopCryptoLedger library☆13Sep 29, 2025Updated 6 months ago
- Flask based Web application for predicting the income of a person☆13Dec 23, 2018Updated 7 years ago
- a Ruby gem for feature selection and ranking☆26Oct 25, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Simple command line application to read/write message to kafka topic using protobuf☆14Mar 27, 2023Updated 3 years ago
- Sample client programs in different languages that access data on Cassandra nodes.☆19Feb 6, 2015Updated 11 years ago
- ☆17Feb 3, 2018Updated 8 years ago
- Projects from my Hadoop training sessions☆16Feb 22, 2018Updated 8 years ago
- Run Airflow on Kubernetes. This repository contains scripts to 1) run a multinode kubernets cluster on local machine using KinD, 2) prepa…☆17Apr 12, 2023Updated 3 years ago
- Ingress data from kafka topic into clickhouse table (JSON format)☆24Apr 12, 2018Updated 8 years ago
- Machine Learning DevOps Engineer Nanodegree☆11Jan 27, 2022Updated 4 years ago