maxmelnick / spark-graph-erLinks
☆15Updated 6 years ago
Alternatives and similar repositories for spark-graph-er
Users that are interested in spark-graph-er are comparing it to the libraries listed below
Sorting:
- PySpark phonetic and string matching algorithms☆39Updated last year
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Updated last year
- Data validation library for PySpark 3.0.0☆33Updated 3 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated 2 years ago
- Instant search for and access to many datasets in Pyspark.☆34Updated 3 years ago
- Repository used for Spark Trainings☆54Updated 2 years ago
- ☆31Updated 7 years ago
- Repository for medium article☆21Updated last year
- Record matching and entity resolution at scale in Spark☆35Updated 2 years ago
- Repo for all my code on the articles I post on medium☆107Updated 3 years ago
- HandySpark - bringing pandas-like capabilities to Spark dataframes☆195Updated 6 years ago
- Making Machine Learning Simple and Scalable with Python, Jupyter Notebook, TensorFlow, Keras, Apache Kafka and KSQL☆97Updated 6 years ago
- ☆16Updated 2 years ago
- A repository for a PySpark Cookbook by Tomasz Drabas and Denny Lee☆60Updated 7 years ago
- ☆34Updated 6 years ago
- Spark functions to run popular phonetic and string matching algorithms☆60Updated 3 years ago
- MLflow samples - deprecated☆22Updated 2 years ago
- Sentiment Analysis of a Twitter Topic with Spark Structured Streaming☆55Updated 6 years ago
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated 10 months ago
- Presentations and other resources.☆36Updated 5 years ago
- Oh you know, just a coupla, two, tree Kafka Streams in Scala☆24Updated 4 years ago
- How to evaluate the Quality of your Data with Great Expectations and Spark.☆31Updated 2 years ago
- A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill u…☆27Updated 6 years ago
- Notebooks for the ML Link Prediction Course☆14Updated 5 years ago
- Learning PySpark video series☆11Updated 7 years ago
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples☆71Updated 5 years ago
- Delta Lake examples☆233Updated last year
- Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops☆118Updated 2 years ago
- MLFlow Spark Summit 2019 Presentation☆67Updated 6 years ago
- Repository of sample Databricks notebooks☆272Updated last year