maxmelnick / spark-graph-erLinks
☆15Updated 6 years ago
Alternatives and similar repositories for spark-graph-er
Users that are interested in spark-graph-er are comparing it to the libraries listed below
Sorting:
- PySpark phonetic and string matching algorithms☆39Updated last year
- Notebooks for the ML Link Prediction Course☆14Updated 5 years ago
- SparkER: an Entity Resolution framework for Apache Spark☆65Updated last year
- Oh you know, just a coupla, two, tree Kafka Streams in Scala☆24Updated 4 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated 2 years ago
- A sample implementation of the Spark Datasource API☆24Updated 8 years ago
- ☆31Updated 7 years ago
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Updated last year
- Repo for all my code on the articles I post on medium☆107Updated 3 years ago
- A simple Spark-powered ETL framework that just works 🍺☆181Updated 2 months ago
- PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2☆87Updated 5 years ago
- Making Machine Learning Simple and Scalable with Python, Jupyter Notebook, TensorFlow, Keras, Apache Kafka and KSQL☆97Updated 6 years ago
- Record matching and entity resolution at scale in Spark☆36Updated 2 years ago
- Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful …☆145Updated last year
- Data validation library for PySpark 3.0.0☆33Updated 3 years ago
- ☆34Updated 6 years ago
- Delta Lake Documentation☆51Updated last year
- Spark and Delta Lake Workshop☆22Updated 3 years ago
- Morpheus brings the leading graph query language, Cypher, onto the leading distributed processing platform, Spark.☆345Updated 2 weeks ago
- MLflow samples - deprecated☆22Updated 2 years ago
- ☆63Updated 6 years ago
- Read Delta tables without any Spark☆47Updated last year
- High-performance data retrieval from Neo4j with Apache Arrow 🏹☆32Updated 3 years ago
- Mastering Spark for Data Science, published by Packt☆49Updated 2 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Updated last week
- ☆24Updated 3 years ago
- Spark functions to run popular phonetic and string matching algorithms☆60Updated 3 years ago
- How to evaluate the Quality of your Data with Great Expectations and Spark.☆31Updated 2 years ago
- Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops☆118Updated 2 years ago
- Apache NiFi NLP Processor☆18Updated 2 years ago