daddydrac/PySpark-Confluent-Kafka-Apache-Drill-

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/daddydrac/PySpark-Confluent-Kafka-Apache-Drill-)

daddydrac / PySpark-Confluent-Kafka-Apache-Drill-

A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill using Docker and Cassandra (NoSQL DB) for storage; This allows for for fast feature engineering and data cleaning.

☆28

Alternatives and similar repositories for PySpark-Confluent-Kafka-Apache-Drill-

Users that are interested in PySpark-Confluent-Kafka-Apache-Drill- are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jukkakansanaho / udacity-dend-project-3
View on GitHub
Udacity Data Engineer Nano Degree - Project-3 (Data Warehouse)
☆22Jun 20, 2019Updated 7 years ago
pmservice / product-line-prediction
View on GitHub
☆13Sep 3, 2020Updated 5 years ago
indiacloudtv / structuredstreamingkafkapyspark
View on GitHub
Apche Spark Structured Streaming with Kafka using Python(PySpark)
☆40May 16, 2019Updated 7 years ago
Gymnott1 / VibeEx-CLI
View on GitHub
VibEx (vx) is a developer-friendly CLI tool that streamlines the process of working with AI coding assistants. It helps developers prepar…
☆29Jul 10, 2026Updated last week
syedhassaanahmed / databricks-notebooks
View on GitHub
Collection of Databricks and Jupyter Notebooks
☆22Feb 9, 2026Updated 5 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
omarsar / clinical_nlp_elastic
View on GitHub
Clinical NLP Analysis with Elasticsearch and Kibana
☆35Feb 28, 2019Updated 7 years ago
aaronstone007 / Udacity-Data-Streaming
View on GitHub
Projects from Udacity Data Streaming Nanodegree
☆15Aug 14, 2023Updated 2 years ago
DC-777 / ML-construction-cost-prediction
View on GitHub
In this work, we compared the predictive capabilities of six different machine learning algorithms - linear regression, random forest, ex…
☆17Sep 21, 2020Updated 5 years ago
alteryx / promote-python
View on GitHub
Python library for deploying models built using Python to Alteryx Promote.
☆15Dec 10, 2021Updated 4 years ago
pmservice / wml-sample-models
View on GitHub
This repository is archived. Please navigate to: https://github.com/IBM/watson-machine-learning-samples
☆39Sep 3, 2020Updated 5 years ago
chuktuk / Alzheimers_Disease_Analysis
View on GitHub
This repo contains a data science project to identify patients at high-risk of Alzheimer's disease.
☆12Feb 20, 2021Updated 5 years ago
Dhruv9051 / file-scavenger
View on GitHub
File Scavenger is a powerful VS Code extension designed to help developers identify and manage unused files in their projects. With an in…
☆21Feb 1, 2025Updated last year
ParticipaPY / civic-crowdanalytics
View on GitHub
Analytics tool that applies Natural Language Processing (NLP) and Machine Learning (ML), such as concept extraction, idea classification,…
☆10Dec 7, 2022Updated 3 years ago
mljar / automl_comparison
View on GitHub
Comparison of automatic machine learning libraries
☆29Dec 7, 2017Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
irdanish11 / Sentence-Prediction-using-LSTMs_aka-Language-Modeling
View on GitHub
LSTM text generation by word. Used to generate multiple sentence suggestions based on the input words or a sentence
☆27Nov 10, 2020Updated 5 years ago
IBM / mean-app
View on GitHub
WARNING: This repository is no longer maintained ⚠️ This repository will not be updated.
☆12May 31, 2022Updated 4 years ago
vinmuk / NFL-predict-yards
View on GitHub
7th place code at NFL Big Data Bowl
☆12Jan 8, 2020Updated 6 years ago
dalpozz / AMLFD
View on GitHub
Adaptive Machine Learning for Credit Card Fraud Detection
☆37Sep 4, 2017Updated 8 years ago
ggangliu / uC-OS-III
View on GitHub
A clean offical source code for uC/OS-III
☆14Nov 20, 2017Updated 8 years ago
paiml / awsbigdata
View on GitHub
AWS Big Data Certification
☆25Mar 26, 2026Updated 3 months ago
rebremer / devopsai_databricks
View on GitHub
DevOps for AI project using Azure Databricks, Azure DevOps and Azure Machine Learning Service
☆15Jul 21, 2021Updated 5 years ago
saritmaitra / Segmentation-Clustering
View on GitHub
Recency, Frequency, and Monetary are three behavioral attributes and are quite simple, in that they can be easily computed for any databa…
☆15Jun 18, 2026Updated last month
akshaybahadur21 / Breast-Cancer-Neural-Networks
View on GitHub
Classifying malignant and benign tumors using Neural Networks 🔬
☆18Jun 4, 2021Updated 5 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
LiXiling / ml_insurancePred
View on GitHub
Insurance Claim Prediction using Machine Learning - Udacity Nanodegree Capstone Project
☆16Nov 1, 2016Updated 9 years ago
bolicd / practicalcqrs
View on GitHub
Clean code CQRS/ES with projections example project
☆15Nov 4, 2021Updated 4 years ago
mayukh18 / Google-Landmark-Recognition-Retrieval-2019
View on GitHub
Silver and Bronze medal solutions to the Kaggle challenges on Google Landmark Dataset
☆18Jun 9, 2019Updated 7 years ago
davidctj / react-plotlyjs-ts
View on GitHub
A react-typescript component for Plotly.JS graphs.
☆15Feb 29, 2020Updated 6 years ago
bhattbhavesh91 / decision_tree_grid_search
View on GitHub
Implementation of Grid Search to find better hyper-parameters for decision tree to reduce the over fitting.
☆12May 29, 2021Updated 5 years ago
Shivansh-Khunger / nitro
View on GitHub
create-nitro: A powerful scaffolding tool for quickly setting up Node.js APIs with industry-standard templates. Features include database…
☆22Jun 3, 2024Updated 2 years ago
nikhilno1 / nlp_projects
View on GitHub
A collection of my NLP projects
☆19Aug 26, 2019Updated 6 years ago
databricks / drunken-data-quality-1
View on GitHub
Spark package for checking data quality
☆26Mar 30, 2023Updated 3 years ago
icmpnorequest / Pytorch_BERT_Text_Classification
View on GitHub
Tutorial for text classification with BERT, using HuggingFace's transformers.
☆13Jan 15, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
voila-dashboards / voila-reveal
View on GitHub
Slideshow template for Voilà based on RevealJS
☆16Nov 17, 2021Updated 4 years ago
bobbywlindsey / docker-data-science
View on GitHub
Set up an automated data science environment using Docker
☆14Oct 2, 2018Updated 7 years ago
muukii / Presenter
View on GitHub
Screen transition with safe and clean code.
☆15Oct 25, 2016Updated 9 years ago
abhishekkrthakur / imet-collection
View on GitHub
☆23Jun 11, 2019Updated 7 years ago
MiyainNYC / Distributed-Machine-Learning
View on GitHub
PySpark, Databrick, h2o, MLlib
☆20Aug 25, 2016Updated 9 years ago
rebryk / kaggle
View on GitHub
Kaggle solutions
☆17Nov 22, 2022Updated 3 years ago
vtigranv / Front-end-Guidelines
View on GitHub
How to write a super-clean front-end code
☆20Dec 13, 2017Updated 8 years ago