maxmelnick / spark-graph-er
☆15Updated 5 years ago
Alternatives and similar repositories for spark-graph-er:
Users that are interested in spark-graph-er are comparing it to the libraries listed below
- PySpark phonetic and string matching algorithms☆39Updated last year
- Record matching and entity resolution at scale in Spark☆34Updated last year
- Notebooks for the ML Link Prediction Course☆14Updated 4 years ago
- ☆16Updated last year
- A pyspark lib to validate data quality☆18Updated 2 years ago
- Spark NLP for Streamlit☆15Updated 3 years ago
- notebooks for nlp-on-spark☆13Updated 8 years ago
- real-time data + ML pipeline☆54Updated last month
- ☆30Updated 2 years ago
- Code that was used as an example during the Data+AI Summit 2020☆15Updated 4 years ago
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 3 years ago
- How to evaluate the Quality of your Data with Great Expectations and Spark.☆29Updated last year
- SparkER: an Entity Resolution framework for Apache Spark☆63Updated 11 months ago
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- A simple introduction to using spark ml pipelines☆26Updated 6 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆61Updated 6 months ago
- Pandas helper functions☆30Updated 2 years ago
- Spark functions to run popular phonetic and string matching algorithms☆60Updated 3 years ago
- Simple machine learning in Python/Tensorflow with model saving☆14Updated 7 years ago
- Kubeflow example of machine learning/model serving☆36Updated 5 years ago
- High-performance data retrieval from Neo4j with Apache Arrow 🏹☆31Updated 2 years ago
- type-class based data cleansing library for Apache Spark SQL☆78Updated 5 years ago
- ☆34Updated 5 years ago
- A repository for a PySpark Cookbook by Tomasz Drabas and Denny Lee☆60Updated 6 years ago
- ☆55Updated last year
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 8 years ago
- Jupyter notebooks showing how to use Neo4j Graph Algorithms☆52Updated 4 years ago
- Asynchronous actions for PySpark☆47Updated 3 years ago
- Build your feature store with macros right within your dbt repository☆38Updated 2 years ago