A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill using Docker and Cassandra (NoSQL DB) for storage; This allows for for fast feature engineering and data cleaning.
☆28Jul 8, 2019Updated 6 years ago
Alternatives and similar repositories for PySpark-Confluent-Kafka-Apache-Drill-
Users that are interested in PySpark-Confluent-Kafka-Apache-Drill- are comparing it to the libraries listed below
Sorting:
- ☆13Sep 3, 2020Updated 5 years ago
- ☆19Oct 10, 2020Updated 5 years ago
- Code developed live during Devoxx 2016: video is at https://www.youtube.com/watch?v=dzdjP3CPOCs and slides are at http://www.slideshare.n…☆25Apr 14, 2022Updated 3 years ago
- This repository is archived. Please navigate to: https://github.com/IBM/watson-machine-learning-samples☆39Sep 3, 2020Updated 5 years ago
- Project Repo for GSoC 2019 at CERN☆11Jan 4, 2023Updated 3 years ago
- Our project aims to create a more reliable and accurate way for startups to place a valuation on themselves, by using a reliable ML model…☆11Jan 27, 2023Updated 3 years ago
- ☆10Jul 5, 2023Updated 2 years ago
- My Tensorflow Notebook. In this notebooks I have implemented various kind of model optimisation techniques.☆10Dec 4, 2021Updated 4 years ago
- Examples for eclairjs-node and eclairjs-nashorn☆12Jan 24, 2017Updated 9 years ago
- Article for Special Edition of Information: Machine Learning with Python☆14Jan 8, 2025Updated last year
- Curso de Machine Learning☆11Apr 22, 2018Updated 7 years ago
- WILL™ SDK for ink supports a variety of input technologies and generates the highest quality, most attractive digital ink outputs via the…☆13Jul 1, 2024Updated last year
- ☆10Dec 9, 2018Updated 7 years ago
- Utilizing AutoXGB for Credit Card Financial Fraud Detection☆12Dec 1, 2021Updated 4 years ago
- ☆11Jul 26, 2020Updated 5 years ago
- Scalable Bayes via Barycenter in Wasserstein Space☆10Sep 7, 2017Updated 8 years ago
- Built the chatbot using rule-based approach.☆11Feb 27, 2018Updated 8 years ago
- Openscoring application for the Docker distributed applications platform☆12Nov 8, 2020Updated 5 years ago
- Link to the dashboard☆12Apr 21, 2023Updated 2 years ago
- ☆11Feb 16, 2021Updated 5 years ago
- Module for working with linear algebra in Elixir.☆15Jan 3, 2017Updated 9 years ago
- A package full of linear algebra operators for Apache Spark MLlib's linalg package☆10Sep 9, 2015Updated 10 years ago
- KumuluzEE REST extension for implementation of common, advanced and flexible REST API functionalities and patterns as microservices.☆10Jan 12, 2026Updated last month
- ☆12Mar 17, 2023Updated 2 years ago
- Custom Search Experience☆11Aug 4, 2017Updated 8 years ago
- Code pattern to show notebook generation feature of AutoAI☆11Jun 7, 2021Updated 4 years ago
- Implementations of transformer models in pytorch☆14Jun 2, 2020Updated 5 years ago
- A step-by-step guide to data structures and algorithms☆10Jan 30, 2023Updated 3 years ago
- Emusify is a real-time mood-based music recommendation system that runs in the background and plays music according to a user's mood.☆12Jun 4, 2023Updated 2 years ago
- ☆17Mar 11, 2019Updated 6 years ago
- This publication on medium solves DL datasets with neural nets (Complete analysis of data sets)☆10Mar 24, 2023Updated 2 years ago
- An example of a medical app built with Flutter for the classification of the arterial blood pressure.☆10Jan 28, 2019Updated 7 years ago
- ☆12Jul 27, 2015Updated 10 years ago
- ☆11May 8, 2016Updated 9 years ago
- Stencila for Python☆17Aug 3, 2018Updated 7 years ago
- Neo4j, MySQL, Spring boot example☆12May 19, 2017Updated 8 years ago
- ☆10Feb 14, 2019Updated 7 years ago
- Lightweight Tap-Tempo in jQuery☆10Mar 15, 2017Updated 8 years ago
- A UAV-specific Python image processing library built upon xarray and geopandas.☆15Dec 17, 2024Updated last year