geyungjen / jentekllcLinks
Apache Spark Application Development -- George Jen, Jen Tek LLC
☆16Updated 2 years ago
Alternatives and similar repositories for jentekllc
Users that are interested in jentekllc are comparing it to the libraries listed below
Sorting:
- H2OAI Driverless AI Code Samples and Tutorials☆37Updated last year
- Spark NLP for Streamlit☆15Updated 4 years ago
- PySpark phonetic and string matching algorithms☆39Updated last year
- How to do data science with Optimus, Spark and Python.☆19Updated 6 years ago
- Large-scale Graph Mining with Spark☆39Updated 7 years ago
- ☆19Updated 4 years ago
- Instant search for and access to many datasets in Pyspark.☆34Updated 3 years ago
- Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops☆118Updated 2 years ago
- Openscoring application for the Docker distributed applications platform☆12Updated 5 years ago
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟☆53Updated 3 years ago
- Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on sing…☆46Updated 3 months ago
- Big Data Demystified meetup and blog examples☆31Updated last year
- Slides and materials for most of my talks by year☆92Updated 2 years ago
- Very basic introduction to pyspark☆15Updated 8 years ago
- ☆18Updated 4 years ago
- ☆34Updated 6 years ago
- Analysis of City Of Chicago Taxi Trip Dataset Using AWS EMR, Spark, PySpark, Zeppelin and Airbnb's Superset☆15Updated 8 years ago
- ☆33Updated 3 years ago
- Best practices for engineering ML pipelines.☆36Updated 3 years ago
- Simple demonstration of how to build a complex real time machine learning visualization tool.☆16Updated 9 years ago
- An example PySpark project with pytest☆17Updated 8 years ago
- Model management example using Polyaxon, Argo and Seldon☆23Updated 7 years ago
- Guide on creating an API for serving your ML model☆66Updated 3 years ago
- KDD Hands-On Tutorial (2018)☆29Updated 2 years ago
- Record matching and entity resolution at scale in Spark☆35Updated 2 years ago
- Notes for Data Science 350 Class☆24Updated 8 years ago
- Source code from my Master's thesis @Polytechnique Montréal. A solution to the assortment optimization problem, able to deal with large n…☆19Updated 8 years ago
- Operations Research Algorithms☆18Updated last year
- Using Kafka-Python to illustrate a ML production pipeline☆112Updated 2 years ago
- ☆103Updated 2 years ago