geyungjen / jentekllcLinks
Apache Spark Application Development -- George Jen, Jen Tek LLC
☆15Updated 2 years ago
Alternatives and similar repositories for jentekllc
Users that are interested in jentekllc are comparing it to the libraries listed below
Sorting:
- H2OAI Driverless AI Code Samples and Tutorials☆37Updated 10 months ago
- PyConDE & PyData Berlin 2019 Airflow Workshop: Airflow for machine learning pipelines.☆47Updated last year
- Data Scientist code test☆19Updated 5 years ago
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆38Updated 6 years ago
- Openscoring application for the Docker distributed applications platform☆10Updated 4 years ago
- ☆19Updated 4 years ago
- PySpark phonetic and string matching algorithms☆39Updated last year
- ☆16Updated 2 years ago
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟☆53Updated 3 years ago
- Best practices for engineering ML pipelines.☆35Updated 3 years ago
- Predict whether a student will correctly answer a problem based on past performance using automated feature engineering☆32Updated 5 years ago
- Apache Toree quickstart tutorial☆29Updated 9 years ago
- Model management example using Polyaxon, Argo and Seldon☆23Updated 6 years ago
- Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on sing…☆46Updated last month
- scaffold of Apache Airflow executing Docker containers☆86Updated 2 years ago
- This repository contains my ML scripts in R☆33Updated 8 years ago
- Analysis of City Of Chicago Taxi Trip Dataset Using AWS EMR, Spark, PySpark, Zeppelin and Airbnb's Superset☆15Updated 8 years ago
- An experiment on explicit vs implicit feedback recommenders☆25Updated 7 years ago
- Resources for the Data Mining for Bussiness and Governance course.☆54Updated 4 years ago
- Big Data Demystified meetup and blog examples☆31Updated last year
- Analyzing NBA data using Spark 2.1☆46Updated 8 years ago
- Splittable SAS (.sas7bdat) Input Format for Hadoop and Spark SQL☆92Updated last year
- Instant search for and access to many datasets in Pyspark.☆34Updated 2 years ago
- Spark and Python (PySpark) Examples☆39Updated 4 years ago
- Sentiment Analysis of a Twitter Topic with Spark Structured Streaming☆55Updated 6 years ago
- Very basic introduction to pyspark☆15Updated 8 years ago
- Projects developed by Domino's R&D team☆77Updated 3 years ago
- An in depth tutorial on sklearn's Pipeline and FeatureUnion classes.☆16Updated 8 years ago
- How to do data science with Optimus, Spark and Python.☆19Updated 6 years ago
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 9 years ago