geyungjen / jentekllc
Apache Spark Application Development -- George Jen, Jen Tek LLC
☆15Updated last year
Alternatives and similar repositories for jentekllc:
Users that are interested in jentekllc are comparing it to the libraries listed below
- How to do data science with Optimus, Spark and Python.☆19Updated 5 years ago
- H2OAI Driverless AI Code Samples and Tutorials☆37Updated 3 months ago
- Large-scale Graph Mining with Spark☆40Updated 6 years ago
- Materials for Apache Arrow workshop at VLDB 2019☆42Updated 4 years ago
- ☆19Updated 3 years ago
- R Code + R Notebook on how to process and visualize NCAA basketball data.☆16Updated 6 years ago
- ☆11Updated 2 years ago
- PySpark phonetic and string matching algorithms☆39Updated 11 months ago
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆36Updated 6 months ago
- Business Data Analysis by HiPIC of CalStateLA☆20Updated 6 years ago
- Openscoring application for the Docker distributed applications platform☆10Updated 4 years ago
- Big Data Demystified meetup and blog examples☆31Updated 5 months ago
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- Best practices for engineering ML pipelines.☆37Updated 2 years ago
- Model management example using Polyaxon, Argo and Seldon☆23Updated 6 years ago
- ☆18Updated 2 months ago
- Productivity Utilities for Data Science with Python Notebooks☆6Updated 4 years ago
- Tools for creating Dataproc custom images☆32Updated this week
- Mastering Spark for Data Science, published by Packt☆47Updated 2 years ago
- Work for Mastering Large Datasets with Python☆18Updated 2 years ago
- Capturing model drift and handling its response - Example webinar☆107Updated 5 years ago
- ☆16Updated 4 years ago
- Spark NLP for Streamlit☆15Updated 3 years ago
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆36Updated 5 years ago
- Data validation library for PySpark 3.0.0☆34Updated 2 years ago
- ☆15Updated 7 years ago
- Analysis of City Of Chicago Taxi Trip Dataset Using AWS EMR, Spark, PySpark, Zeppelin and Airbnb's Superset☆15Updated 7 years ago
- ☆21Updated last year
- Automated Exploratory Data Analysis. Simplifying Data Exploration☆34Updated 4 years ago
- Use Kafka and Apache Spark streaming to perform click stream analytics☆76Updated 4 years ago