geyungjen / jentekllcLinks
Apache Spark Application Development -- George Jen, Jen Tek LLC
☆16Updated 2 years ago
Alternatives and similar repositories for jentekllc
Users that are interested in jentekllc are comparing it to the libraries listed below
Sorting:
- H2OAI Driverless AI Code Samples and Tutorials☆37Updated 11 months ago
- Instant search for and access to many datasets in Pyspark.☆34Updated 3 years ago
- Spark and Python (PySpark) Examples☆39Updated 4 years ago
- Model management example using Polyaxon, Argo and Seldon☆23Updated 7 years ago
- Analyzing NBA data using Spark 2.1☆46Updated 8 years ago
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟☆53Updated 3 years ago
- Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops☆118Updated 2 years ago
- Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on sing…☆46Updated 2 months ago
- Blog post on ETL pipelines with Airflow☆24Updated last month
- Code and notebooks for a talk given at PyBay, 2018-08-19☆49Updated 4 years ago
- Openscoring application for the Docker distributed applications platform☆11Updated 4 years ago
- PySpark phonetic and string matching algorithms☆39Updated last year
- ☆19Updated 4 years ago
- Labs and data files for a full-day Spark workshop☆24Updated 4 months ago
- ☆44Updated 7 years ago
- ☆12Updated 5 years ago
- Project template for highly effective data science workflows☆29Updated last year
- Large-scale Graph Mining with Spark☆39Updated 7 years ago
- An experiment on explicit vs implicit feedback recommenders☆25Updated 7 years ago
- Use Kafka and Apache Spark streaming to perform click stream analytics☆76Updated 5 years ago
- Notes for Data Science 350 Class☆24Updated 8 years ago
- ☆15Updated 7 years ago
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆38Updated 6 years ago
- Code examples for the Introduction to Kubeflow course☆14Updated 4 years ago
- Analysis of City Of Chicago Taxi Trip Dataset Using AWS EMR, Spark, PySpark, Zeppelin and Airbnb's Superset☆15Updated 8 years ago
- Best practices for engineering ML pipelines.☆36Updated 3 years ago
- PySpark Machine Learning Examples☆45Updated 7 years ago
- Know your ML Score based on Sculley's paper☆34Updated 6 years ago
- An example PySpark project with pytest☆17Updated 8 years ago
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 9 years ago