geyungjen / jentekllcLinks
Apache Spark Application Development -- George Jen, Jen Tek LLC
☆15Updated 2 years ago
Alternatives and similar repositories for jentekllc
Users that are interested in jentekllc are comparing it to the libraries listed below
Sorting:
- Spark NLP for Streamlit☆15Updated 3 years ago
- How to do data science with Optimus, Spark and Python.☆19Updated 5 years ago
- H2OAI Driverless AI Code Samples and Tutorials☆37Updated 7 months ago
- Analysis of City Of Chicago Taxi Trip Dataset Using AWS EMR, Spark, PySpark, Zeppelin and Airbnb's Superset☆15Updated 7 years ago
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- ☆16Updated 2 years ago
- Model management example using Polyaxon, Argo and Seldon☆23Updated 6 years ago
- ☆19Updated 4 years ago
- Mastering Spark for Data Science, published by Packt☆47Updated 2 years ago
- ☆16Updated 4 years ago
- Productivity Utilities for Data Science with Python Notebooks☆6Updated 5 years ago
- Best practices for engineering ML pipelines.☆35Updated 2 years ago
- Openscoring application for the Docker distributed applications platform☆10Updated 4 years ago
- Tools for creating Dataproc custom images☆33Updated 2 weeks ago
- Instant search for and access to many datasets in Pyspark.☆34Updated 2 years ago
- Predict whether a student will correctly answer a problem based on past performance using automated feature engineering☆32Updated 4 years ago
- Splittable SAS (.sas7bdat) Input Format for Hadoop and Spark SQL☆91Updated last year
- R Code + R Notebook on how to process and visualize NCAA basketball data.☆16Updated 7 years ago
- Business Data Analysis by HiPIC of CalStateLA☆20Updated 6 years ago
- Tweet Analysis with Spark☆15Updated 7 years ago
- Showcase for using H2O and R for churn prediction (inspired by ZhouFang928 examples)☆58Updated 7 years ago
- Docker compose and Google Colab demo to build a CDC with Delta Lake☆15Updated 2 years ago
- Analyzing NBA data using Spark 2.1☆46Updated 8 years ago
- Getting Great Expectations setup to run on DataBricks with Spark Dataframes.☆13Updated 3 years ago
- Python bindings for Matroid API☆16Updated 4 months ago
- ☆11Updated 6 years ago
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples☆70Updated 4 years ago
- PyConDE & PyData Berlin 2019 Airflow Workshop: Airflow for machine learning pipelines.☆47Updated last year
- Automated Exploratory Data Analysis. Simplifying Data Exploration☆36Updated 4 years ago
- Building an API with the FastAPI framework to serve a scikit-learn model.☆18Updated 6 years ago