manuzhang / jupyterlab_sparkLinks
Spark Application UI extension for JupyterLab
☆10Updated 3 years ago
Alternatives and similar repositories for jupyterlab_spark
Users that are interested in jupyterlab_spark are comparing it to the libraries listed below
Sorting:
- Set-oriented Operations in Pandas☆24Updated 5 years ago
- ☕⛵WIP PySpark dependency management☆22Updated 6 years ago
- A curated list of awesome PrestoDB / Trino software, libraries, tools and resources☆17Updated 3 years ago
- Example python spark machine learning on NYC taxi data☆9Updated 10 years ago
- Python and Scala APIs for enhanced Spark analytics☆12Updated 8 years ago
- 💻 CLI for reporting events to Faros platform☆14Updated 3 weeks ago
- Documentation and resources for deploying JupyterHub on Hadoop☆18Updated 5 years ago
- Model management example using Polyaxon, Argo and Seldon☆23Updated 6 years ago
- Model explanation provides the ability to interpret the effect of the predictors on the composition of an individual score.☆13Updated 4 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- from zero to storm cluster for realtime classification using sklearn☆12Updated 10 years ago
- A quick start project for polyaxon☆29Updated 10 months ago
- Custom JupyterLab container for local-workstations and in-cluster Kubernetes Data Science, Machine Learning and IoT.☆12Updated 5 years ago
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Updated 9 years ago
- Dask tutorial for PyData DC 2016☆11Updated 8 years ago
- feng - feature engineering for machine-learning champions☆27Updated 8 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- Real time and offline time series analysis with Spark, Spark Streaming and Storm☆21Updated 4 years ago
- Set of iPython and Jupyter extensions to improve user experience☆50Updated 5 years ago
- Data Catalog for Databases and Data Warehouses☆35Updated last year
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 6 years ago
- Connect DBVisualizer to Hortonwork HiveServer2☆9Updated 10 years ago
- A utility for labeling clusters of text data.☆28Updated 3 years ago
- Understand and modelize the structure behind your data with Decision Trees☆25Updated 7 years ago
- This repository is no longer maintained.☆15Updated 3 years ago
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to…☆29Updated 5 months ago
- My 2nd place submission (working with Kevin Goetsch) out of 28 teams at the Kaggle competition at PyCon2015.☆23Updated 10 years ago
- PMML evaluator library for the PostgreSQL database (http://www.postgresql.org/)☆11Updated 10 years ago
- Python Streaming Pipelines with Beam on Flink - Demo☆14Updated 2 years ago
- In-database parallel grid-search for XGBoost on Greenplum☆15Updated 7 years ago