dvgodoy / handysparkView external linksLinks
HandySpark - bringing pandas-like capabilities to Spark dataframes
☆197May 19, 2019Updated 6 years ago
Alternatives and similar repositories for handyspark
Users that are interested in handyspark are comparing it to the libraries listed below
Sorting:
- Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.☆102Aug 20, 2019Updated 6 years ago
- ☆17Jan 17, 2023Updated 3 years ago
- Demonstrates calling a Scala UDF from Python using spark-submit with an EGG and JAR☆23Mar 3, 2020Updated 5 years ago
- Jupyter magics and kernels for working with remote Spark clusters☆1,363Sep 9, 2025Updated 5 months ago
- GBM multicore scaling: h2o, xgboost and lightgbm on multicore and multi-socket systems☆20May 13, 2018Updated 7 years ago
- AWS Spot instances for ML☆39Mar 21, 2023Updated 2 years ago
- Apache (Py)Spark type annotations (stub files).☆118Aug 17, 2022Updated 3 years ago
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆811Feb 5, 2026Updated last week
- Easy to use library to bring Tensorflow on Apache Spark☆296Oct 11, 2023Updated 2 years ago
- JupyterLab Notebook for Mesosphere DC/OS☆11Aug 6, 2019Updated 6 years ago
- Code for the Attention-Gated Convolutional Neural Network for sentence classification☆11May 23, 2019Updated 6 years ago
- This repository contains the codes and figures used in the paper "Illuminating Dark Fishing Fleets in North Korea", published in Science …☆16Oct 11, 2020Updated 5 years ago
- Run FeatureTools to automate Feature Engineering distributionally on Spark.☆11Oct 11, 2018Updated 7 years ago
- Yet Another Sequence Encoder - Encode sequences to vector of vector in python !☆13May 15, 2017Updated 8 years ago
- GDG London hackathon. Prototype for Android app to get display public data on your location in an info-graphic style.☆24May 31, 2013Updated 12 years ago
- Snippets of code used in blog posts and other media.☆13Nov 11, 2025Updated 3 months ago
- Cloud Spanner Connector for Apache Spark☆17Updated this week
- Kaggle IEEE-CIS Fraud Detection 17th place solution☆12Oct 7, 2019Updated 6 years ago
- CS498HS4: Computational Advertising in Fall 2018, UIUC☆11Mar 10, 2019Updated 6 years ago
- Demo of shiny working with echarts4r☆15Feb 18, 2020Updated 5 years ago
- Project for the Advanced Multimodal Machine Learning course at CMU.☆14May 14, 2016Updated 9 years ago
- Tweet Analysis with Spark☆14Aug 28, 2017Updated 8 years ago
- A disciplined approach to neural network parameters - Reviewing the approach for setting Hyper parameters by Leslie Smith☆12Jul 18, 2018Updated 7 years ago
- Mirror of Apache Toree (Incubating)☆749Feb 7, 2026Updated last week
- ☆31Oct 14, 2019Updated 6 years ago
- A very simple way to deploy any machine learning model using Azure Functions☆29Jan 6, 2019Updated 7 years ago
- general functions for your data .pipe()-lines.☆17Nov 8, 2023Updated 2 years ago
- Spark SQL DBF Library☆16Jan 2, 2015Updated 11 years ago
- low-level helpers for Apache Spark libraries and tests☆16Dec 29, 2018Updated 7 years ago
- A low-overhead sampling profiler for PySpark, that outputs Flame Graphs☆16Dec 17, 2020Updated 5 years ago
- Featureselection methods as Spark MLlib Pipelines☆31Apr 29, 2018Updated 7 years ago
- Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark☆1,541Dec 2, 2024Updated last year
- Integrate Apache Spark with Citus distributed Postgres☆17Apr 3, 2019Updated 6 years ago
- ☆15Aug 16, 2018Updated 7 years ago
- pyspark-cassandra is a Python port of the awesome @datastax Spark Cassandra connector. Compatible w/ Spark 2.0, 2.1, 2.2, 2.3 and 2.4☆69Oct 15, 2024Updated last year
- 🎮 Notebook Enterprise Summit☆18Jun 15, 2021Updated 4 years ago
- Simple face alignment library by using face_recognition and opencv☆16Mar 13, 2019Updated 6 years ago
- [UNMAINTAINED] A starter pack for creating a lightweight responsive web app for Fast.AI PyTorch models.☆16Dec 5, 2018Updated 7 years ago
- A Python library that implements scoring utilities, analysis strategies, and visualization methods which can serve uplift modeling use-ca…☆24Mar 1, 2023Updated 2 years ago