A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill using Docker and Cassandra (NoSQL DB) for storage; This allows for for fast feature engineering and data cleaning.
☆28Jul 8, 2019Updated 6 years ago
Alternatives and similar repositories for PySpark-Confluent-Kafka-Apache-Drill-
Users that are interested in PySpark-Confluent-Kafka-Apache-Drill- are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Sep 3, 2020Updated 5 years ago
- Stripe Payment Gateway integration in Django☆10May 24, 2021Updated 4 years ago
- Clinical NLP Analysis with Elasticsearch and Kibana☆35Feb 28, 2019Updated 7 years ago
- A simple POC app on Django framework☆11Feb 14, 2019Updated 7 years ago
- Collection of Databricks and Jupyter Notebooks☆22Feb 9, 2026Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Version 1 of Habaneras de Lino is an online ecommerce. This repo contains the backed api of the website using Django and Django Rest Fram…☆13Dec 16, 2022Updated 3 years ago
- MongoDB Change Streams and Kafka Example Application☆14Nov 16, 2017Updated 8 years ago
- Projects from Udacity Data Streaming Nanodegree☆15Aug 14, 2023Updated 2 years ago
- Kaggle Human Protein Atlas Image Classification 73th solution☆19Jan 14, 2019Updated 7 years ago
- 🍴A responsive restaurant theme built with Bootstrap 4☆14Dec 17, 2018Updated 7 years ago
- Kaggle solutions☆17Nov 22, 2022Updated 3 years ago
- This repo contains a data science project to identify patients at high-risk of Alzheimer's disease.☆12Feb 20, 2021Updated 5 years ago
- Python library for deploying models built using Python to Alteryx Promote.☆15Dec 10, 2021Updated 4 years ago
- Article for Special Edition of Information: Machine Learning with Python☆14Jan 8, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- mysql-workbench☆15Nov 11, 2018Updated 7 years ago
- A repository to store articles, links, and other resources the club finds helpful☆10Apr 29, 2019Updated 7 years ago
- Automate claim approval in personal insurance sector.☆20Apr 21, 2016Updated 10 years ago
- ☆16Jun 18, 2025Updated 11 months ago
- This repo is for building Docker containers for RStudio, PostgreSQL, Hadoop, Spark, etc.☆22May 12, 2021Updated 5 years ago
- Example project using Cucumber-JVM and Scala steps☆17Apr 13, 2026Updated last month
- AWS Big Data Certification☆25Mar 26, 2026Updated last month
- DevOps for AI project using Azure Databricks, Azure DevOps and Azure Machine Learning Service☆15Jul 21, 2021Updated 4 years ago
- Recency, Frequency, and Monetary are three behavioral attributes and are quite simple, in that they can be easily computed for any databa…☆15Nov 20, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- LSTM text generation by word. Used to generate multiple sentence suggestions based on the input words or a sentence☆27Nov 10, 2020Updated 5 years ago
- 🍀 Opinionated LATEX-based Resume Template for Data Science Role 🍀☆12May 23, 2019Updated 6 years ago
- Classifying malignant and benign tumors using Neural Networks 🔬☆18Jun 4, 2021Updated 4 years ago
- Final Project for IoT: Big Data Processing and Analytics class. Analyzing U.S nationwide temperature from IoT sensors in real-time☆71Nov 21, 2016Updated 9 years ago
- Silver and Bronze medal solutions to the Kaggle challenges on Google Landmark Dataset☆18Jun 9, 2019Updated 6 years ago
- R package 2013 google trend☆15Jan 5, 2015Updated 11 years ago
- QuasiModo: Assessing viral genomic analysis methods on HCMV strain mixture☆12Sep 22, 2022Updated 3 years ago
- A collection of my NLP projects☆19Aug 26, 2019Updated 6 years ago
- Tutorial for text classification with BERT, using HuggingFace's transformers.☆13Jan 15, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Slideshow template for Voilà based on RevealJS☆16Nov 17, 2021Updated 4 years ago
- Set up an automated data science environment using Docker☆14Oct 2, 2018Updated 7 years ago
- ☆23Jun 11, 2019Updated 6 years ago
- R-Machine-Learning-Projects☆30Jan 30, 2023Updated 3 years ago
- PySpark, Databrick, h2o, MLlib☆20Aug 25, 2016Updated 9 years ago
- MeatPy☆31May 12, 2026Updated last week
- Cognitive Compute aims to present some micro service capabilities as front end to Watson Conversation, Discovery and other bluemix servic…☆11Dec 7, 2018Updated 7 years ago