geyungjen / jentekllc
Apache Spark Application Development -- George Jen, Jen Tek LLC
☆15Updated last year
Alternatives and similar repositories for jentekllc:
Users that are interested in jentekllc are comparing it to the libraries listed below
- How to do data science with Optimus, Spark and Python.☆19Updated 5 years ago
- Spark NLP for Streamlit☆15Updated 3 years ago
- PySpark phonetic and string matching algorithms☆39Updated 11 months ago
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- ☆19Updated 3 years ago
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆36Updated 7 months ago
- ☆11Updated 6 years ago
- Project template for highly effective data science workflows☆29Updated 10 months ago
- Predict whether a student will correctly answer a problem based on past performance using automated feature engineering☆32Updated 4 years ago
- code, labs and lectures for the course☆46Updated last year
- H2OAI Driverless AI Code Samples and Tutorials☆37Updated 3 months ago
- Productivity Utilities for Data Science with Python Notebooks☆6Updated 5 years ago
- Data Scientist code test☆19Updated 4 years ago
- Business Data Analysis by HiPIC of CalStateLA☆20Updated 6 years ago
- ☆14Updated 2 years ago
- A curated list of awesome Machine Learning frameworks, libraries and software.☆20Updated 9 years ago
- Openscoring application for the Docker distributed applications platform☆10Updated 4 years ago
- Extracting LinkedIn comments from any post and export it to Excel file☆23Updated 6 years ago
- ☆16Updated 4 years ago
- Instant search for and access to many datasets in Pyspark.☆34Updated 2 years ago
- ☆16Updated last year
- ☆45Updated 11 months ago
- In the customer management lifecycle, customer churn refers to a decision made by the customer about ending the business relationship. It…☆59Updated last year
- Materials for dask talk at PyData NYC☆15Updated 9 years ago
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆36Updated 5 years ago
- Article for Special Edition of Information: Machine Learning with Python☆13Updated last month
- Data validation library for PySpark 3.0.0☆34Updated 2 years ago
- Interactive notebooks containing demonstration code of the splink library☆37Updated last year
- ☆26Updated last year
- Materials for Machine Learning with H2O Open Platform at ODSC Masterclass Summit 2017☆12Updated 7 years ago