A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill using Docker and Cassandra (NoSQL DB) for storage; This allows for for fast feature engineering and data cleaning.
☆28Jul 8, 2019Updated 6 years ago
Alternatives and similar repositories for PySpark-Confluent-Kafka-Apache-Drill-
Users that are interested in PySpark-Confluent-Kafka-Apache-Drill- are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Machine Learning to predict a customer's next purchase - Fulfills many use-cases from recommendation systems to loyalty programs.☆12Aug 30, 2021Updated 4 years ago
- ☆13Sep 3, 2020Updated 5 years ago
- Machine Learning for Industrial IoT Applications: Predict how long a part will work before performance degrades Perect for 5G cell phone…☆39Aug 30, 2021Updated 4 years ago
- Apche Spark Structured Streaming with Kafka using Python(PySpark)☆40May 16, 2019Updated 6 years ago
- Product Recommender System for Retail Dataset☆14Sep 22, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Deploy a Flask-based microservice (along with Postgres and React) to a Kubernetes cluster☆18May 6, 2021Updated 4 years ago
- Clinical NLP Analysis with Elasticsearch and Kibana☆35Feb 28, 2019Updated 7 years ago
- Collection of Databricks and Jupyter Notebooks☆22Feb 9, 2026Updated 2 months ago
- Leverage in-memory data storage to make your Python apps snappy.☆22Updated this week
- Kafka-connect telegram connector☆16Nov 21, 2025Updated 5 months ago
- In this work, we compared the predictive capabilities of six different machine learning algorithms - linear regression, random forest, ex…☆16Sep 21, 2020Updated 5 years ago
- Projects from Udacity Data Streaming Nanodegree☆15Aug 14, 2023Updated 2 years ago
- Kaggle Human Protein Atlas Image Classification 73th solution☆19Jan 14, 2019Updated 7 years ago
- Studying usage of Xray (JIRA plugin)☆14Oct 5, 2016Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Kaggle solutions☆17Nov 22, 2022Updated 3 years ago
- This repository is archived. Please navigate to: https://github.com/IBM/watson-machine-learning-samples☆39Sep 3, 2020Updated 5 years ago
- This repo contains a data science project to identify patients at high-risk of Alzheimer's disease.☆12Feb 20, 2021Updated 5 years ago
- Slides from my talk on spaCy IRL, regarding sparse attention.☆12Jul 9, 2019Updated 6 years ago
- ☆37Jul 8, 2019Updated 6 years ago
- Agent to integrate CucumberJS with ReportPortal.☆14Apr 11, 2026Updated last week
- A repository to store articles, links, and other resources the club finds helpful☆10Apr 29, 2019Updated 6 years ago
- Automate claim approval in personal insurance sector.☆20Apr 21, 2016Updated 10 years ago
- A yeoman generator to add protractor to your project☆24Jan 15, 2016Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Apache maven GUI☆16Oct 13, 2020Updated 5 years ago
- File Scavenger is a powerful VS Code extension designed to help developers identify and manage unused files in their projects. With an in…☆21Feb 1, 2025Updated last year
- 7th place code at NFL Big Data Bowl☆12Jan 8, 2020Updated 6 years ago
- Python client to integrate Cleanlab Codex with your AI Agent☆19Nov 19, 2025Updated 5 months ago
- DevOps for AI project using Azure Databricks, Azure DevOps and Azure Machine Learning Service☆16Jul 21, 2021Updated 4 years ago
- This container is no longer supported, and has been deprecated in favor of: https://github.com/joehoeller/NVIDIA-GPU-Tensor-Core-Accelera…☆45Aug 30, 2021Updated 4 years ago
- Explains how to develop Ionic application with Apollo GraphQL client☆18Nov 25, 2018Updated 7 years ago
- LSTM text generation by word. Used to generate multiple sentence suggestions based on the input words or a sentence☆27Nov 10, 2020Updated 5 years ago
- Study Guide for AWS Big Data Speciality Certification☆19May 27, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Final Project for IoT: Big Data Processing and Analytics class. Analyzing U.S nationwide temperature from IoT sensors in real-time☆71Nov 21, 2016Updated 9 years ago
- Silver and Bronze medal solutions to the Kaggle challenges on Google Landmark Dataset☆18Jun 9, 2019Updated 6 years ago
- A collection of my NLP projects☆19Aug 26, 2019Updated 6 years ago
- Rust library to work with global positions and vectors☆16Mar 12, 2026Updated last month
- This is the behavior scorecard, which includes three modules, including data processing, establishment of score card and effect evaluatio…☆19May 21, 2019Updated 6 years ago
- ☆10Feb 14, 2019Updated 7 years ago
- Code developed live during Devoxx 2016: video is at https://www.youtube.com/watch?v=dzdjP3CPOCs and slides are at http://www.slideshare.n…☆25Apr 14, 2022Updated 4 years ago