maxmelnick / spark-graph-erLinks
☆15Updated 6 years ago
Alternatives and similar repositories for spark-graph-er
Users that are interested in spark-graph-er are comparing it to the libraries listed below
Sorting:
- PySpark phonetic and string matching algorithms☆40Updated last year
- MLflow samples - deprecated☆22Updated 2 years ago
- notebooks for nlp-on-spark☆13Updated 8 years ago
- Repo for all my code on the articles I post on medium☆106Updated 3 years ago
- Notebooks for the ML Link Prediction Course☆14Updated 5 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated 2 years ago
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Updated last year
- A simple introduction to using spark ml pipelines☆26Updated 7 years ago
- Instant search for and access to many datasets in Pyspark.☆34Updated 3 years ago
- A repository for a PySpark Cookbook by Tomasz Drabas and Denny Lee☆60Updated 7 years ago
- Repository used for Spark Trainings☆54Updated 2 years ago
- ☆69Updated 4 years ago
- Models and Pipelines for the Spark NLP library☆112Updated 4 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 6 years ago
- SparkER: an Entity Resolution framework for Apache Spark☆65Updated last year
- A sample implementation of the Spark Datasource API☆24Updated 8 years ago
- Mastering Spark for Data Science, published by Packt☆49Updated 2 years ago
- PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2☆88Updated 6 years ago
- Spark and Delta Lake Workshop☆22Updated 3 years ago
- Data validation library for PySpark 3.0.0☆33Updated 3 years ago
- ☆31Updated 7 years ago
- Making Machine Learning Simple and Scalable with Python, Jupyter Notebook, TensorFlow, Keras, Apache Kafka and KSQL☆97Updated 6 years ago
- Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops☆118Updated 2 years ago
- ☆16Updated 2 years ago
- Takes a kafka stream into spark, apply transformations and sink into Druid. Everything Dockerised.☆30Updated 2 years ago
- Fuzzy matching function in spark (https://spark-packages.org/package/itspawanbhardwaj/spark-fuzzy-matching)☆24Updated 6 years ago
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆62Updated last year
- Repository for medium article☆21Updated last year
- Demonstration code for MLeap, both Jupyter notebooks and projects☆24Updated 6 years ago
- MLflow App Library☆77Updated 7 years ago