wsuen / pygotham2018_graphmining
Large-scale Graph Mining with Spark
☆40Updated 6 years ago
Alternatives and similar repositories for pygotham2018_graphmining:
Users that are interested in pygotham2018_graphmining are comparing it to the libraries listed below
- A Scalable Data Cleaning Library for PySpark.☆26Updated 5 years ago
- Use Kafka and Apache Spark streaming to perform click stream analytics☆76Updated 4 years ago
- PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2☆83Updated 5 years ago
- Apache Spark Application Development -- George Jen, Jen Tek LLC☆15Updated last year
- Hands-On Data Analysis with Scala, published by Packt☆20Updated 2 years ago
- Spark functions to run popular phonetic and string matching algorithms☆60Updated 2 years ago
- Mastering Spark for Data Science, published by Packt☆47Updated 2 years ago
- Scala and Spark for Big Data Analytics, published by Packt☆35Updated last year
- An example PySpark project with pytest☆17Updated 7 years ago
- A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill u…☆26Updated 5 years ago
- Business Data Analysis by HiPIC of CalStateLA☆20Updated 6 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 8 years ago
- Repository of Notebooks taken from https://neo4j.com/graph-algorithms-book/☆26Updated 4 years ago
- Jupyter notebooks showing how to use Neo4j Graph Algorithms☆52Updated 4 years ago
- SparkER: an Entity Resolution framework for Apache Spark☆63Updated 10 months ago
- ☆33Updated 5 years ago
- Building blocks and patterns for building data prep transformations and feature engineering in Spark.☆16Updated 8 years ago
- PySpark phonetic and string matching algorithms☆39Updated 11 months ago
- Code repository for Large Scale Machine Learning with Spark by Packt☆20Updated 2 years ago
- Binding the GDELT universe in a Spark environment☆23Updated last year
- notebooks for nlp-on-spark☆13Updated 8 years ago
- Materials for Apache Arrow workshop at VLDB 2019☆42Updated 4 years ago
- Apache Spark 2x Machine Learning Cookbook, published by Packt☆29Updated 2 years ago
- MLFlow Spark Summit 2019 Presentation☆67Updated 5 years ago
- Machine Learning with Scala Quick Start Guide, published by Packt☆23Updated last year
- ☆44Updated 7 years ago
- "Building a Recommender System from Scratch" Workshop Material for PyDataDC 2018☆24Updated 6 years ago
- Simple sentiment analysis model with PySpark☆42Updated 6 years ago
- Learning Spark SQL, published by Packt☆42Updated 2 years ago
- PySpark Cookbook, published by Packt☆90Updated 2 years ago