A lightweight data processing framework for Apache Spark
☆16Dec 8, 2022Updated 3 years ago
Alternatives and similar repositories for sparklanes
Users that are interested in sparklanes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Recency, Frequency, and Monetary are three behavioral attributes and are quite simple, in that they can be easily computed for any databa…☆15Nov 20, 2025Updated 5 months ago
- ☆25Apr 24, 2019Updated 7 years ago
- In this brief post I’d like to share my experience with the Kaggle Python Docker image, which simplifies the Data Scientist’s life ….☆10Jan 8, 2018Updated 8 years ago
- Scripts used frequently by me and maybe usefull for others☆10Jan 14, 2026Updated 4 months ago
- ☆14Apr 25, 2017Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- U-Net for BDD100K Dataset☆13Jan 1, 2019Updated 7 years ago
- My collection of puppet modules - mostly licensed under GPLv3☆19May 11, 2017Updated 9 years ago
- A selection of test cases used to test accessibility and Section 508 compliance of mobile applications☆12Apr 1, 2015Updated 11 years ago
- GAN-enhanced Conditional Echocardiogram Generation☆14Mar 24, 2023Updated 3 years ago
- List of Papers read for Wireless and Ubiquitous Computing☆11Apr 23, 2019Updated 7 years ago
- ☆13Mar 18, 2021Updated 5 years ago
- Colab, MLflow and papermill are individually great. Together they form a dream team.☆10Jun 9, 2020Updated 5 years ago
- A pure python mock of pyspark's RDD☆27Jun 22, 2018Updated 7 years ago
- Extract data from a variety of eGift card emails, and from swiped physical gift cards.☆11Feb 1, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- MQTT client GUI to observe messages exchanged on MQTT server.☆13Jul 10, 2020Updated 5 years ago
- Repo for Service worker Workshop at Google I/O Extended Bangkok☆13Jun 25, 2016Updated 9 years ago
- ☆16Apr 21, 2025Updated last year
- word2vec with a context based on sentences.☆15Jan 30, 2017Updated 9 years ago
- This is a pipeline of an ETL application in GCP with open airport code data, which you can find here: https://datahub.io/core/airport-cod…☆15Nov 15, 2021Updated 4 years ago
- This repo contains the code demonstrated in the Analytics Vidhya article about PyWebIO usage and the ML model prediction code.☆11Apr 22, 2021Updated 5 years ago
- ☆12Jan 23, 2023Updated 3 years ago
- Agent Development Kit (ADK)☆22Sep 24, 2025Updated 7 months ago
- Materials for demonstrating video model deployment☆17Jun 14, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- TestKit - Embedded Kafka, Zookeeper, Schema Registry☆10Dec 28, 2017Updated 8 years ago
- Get Twitter trends with twitter4j, stream it to a Kafka topic, save it to MongoDB and visualize in Google Maps☆13Sep 30, 2021Updated 4 years ago
- Mined data from Twitter and classify the users based on their locations and preferences to target them through marketing campaigns.☆13Dec 27, 2020Updated 5 years ago
- RFM (Recency, Frequency, Monetary) Analysis is a marketing technique used to determine quantitatively which customers are the best ones b…☆11May 15, 2018Updated 8 years ago
- Analytics on Apache Projects for Diversity☆18Jun 18, 2019Updated 6 years ago
- Code to reproduce experiments from "A Statistical Approach to Assessing Neural Network Robustness"☆12Feb 11, 2019Updated 7 years ago
- Random jupyter notebooks on data analysis and machine learning☆16Nov 23, 2018Updated 7 years ago
- A simple web-scraping script to find all relevant extracts from Earnings Call Transcripts of S&P 500 companies in a given sector containi…☆12Feb 13, 2017Updated 9 years ago
- ☆15May 12, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Terraform script for launching multiple EMR clusters for training purposes.☆16Oct 30, 2025Updated 6 months ago
- ☆13Jan 24, 2023Updated 3 years ago
- ☆12Aug 22, 2018Updated 7 years ago
- pytest plugin to run the tests with support of pyspark☆88May 21, 2025Updated 11 months ago
- Various data stream/batch process demo with Apache Scala Spark 🚀☆12Feb 28, 2020Updated 6 years ago
- Machine Learning with TensorFlow Extended (TFX) Pipelines☆13Nov 9, 2023Updated 2 years ago
- Final year project to create a robot for agriculture using ROS support☆11Mar 20, 2018Updated 8 years ago