geyungjen / jentekllcLinks
Apache Spark Application Development -- George Jen, Jen Tek LLC
☆16Updated 2 years ago
Alternatives and similar repositories for jentekllc
Users that are interested in jentekllc are comparing it to the libraries listed below
Sorting:
- Large-scale Graph Mining with Spark☆39Updated 7 years ago
- H2OAI Driverless AI Code Samples and Tutorials☆37Updated last year
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟☆53Updated 3 years ago
- Instant search for and access to many datasets in Pyspark.☆34Updated 3 years ago
- Spark NLP for Streamlit☆15Updated 4 years ago
- Openscoring application for the Docker distributed applications platform☆12Updated 5 years ago
- ☆19Updated 4 years ago
- Source code from my Master's thesis @Polytechnique Montréal. A solution to the assortment optimization problem, able to deal with large n…☆19Updated 8 years ago
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆38Updated 6 years ago
- Notes for Data Science 350 Class☆24Updated 8 years ago
- Scalable Data Science, course sets in big data Using Apache Spark over databricks and their mathematical, statistical and computational f…☆168Updated 3 months ago
- Operations Research Algorithms☆18Updated last year
- Use Kafka and Apache Spark streaming to perform click stream analytics☆76Updated 5 years ago
- Watson OpenScale tutorials including sample models, notebooks and applications☆22Updated 3 years ago
- Apache Toree quickstart tutorial☆29Updated 9 years ago
- ☆16Updated 2 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 9 years ago
- Labs and data files for a full-day Spark workshop☆24Updated 6 months ago
- Sample data science projects (machine learning, optimization, business intelligence)☆28Updated 7 years ago
- PySpark phonetic and string matching algorithms☆39Updated last year
- An example PySpark project with pytest☆17Updated 8 years ago
- Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on sing…☆46Updated 2 weeks ago
- Analyzing NBA data using Spark 2.1☆46Updated 8 years ago
- KDD Hands-On Tutorial (2018)☆29Updated 3 years ago
- Project template for highly effective data science workflows☆29Updated 3 weeks ago
- Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops☆118Updated 2 years ago
- Tweet Analysis with Spark☆15Updated 8 years ago
- SCARFF (SCAlable Real-time Frauds Finder) is a framework which enables credit card fraud detection.☆19Updated 8 years ago
- Best practices for engineering ML pipelines.☆36Updated 3 years ago
- How to do data science with Optimus, Spark and Python.☆19Updated 6 years ago