davidfoerster / schema-matching
Match schema attributes of relational databases by value similarity. As a study assignment, this isn't well documented, but you can contact me for questions and I may even add docs, if I sense enough interest.
☆23Updated 4 years ago
Related projects: ⓘ
- FlexMatcher is a schema matching package in Python which handles the problem of matching multiple schemas to a single mediated schema.☆31Updated this week
- Algorithms for "schema matching"☆25Updated 8 years ago
- Tutorial code and data for the entity resolution workshops.☆45Updated 9 years ago
- Demo of a supervised machine learning approach for Entity Resolution in graph using Neo4j GDS Link Prediction Pipelines☆21Updated 2 years ago
- Hidden alignment conditional random field for classifying string pairs.☆25Updated this week
- ☆16Updated 3 years ago
- A Cython implementation of the affine gap string distance☆58Updated last year
- Collection of some algorithms for entity resolution☆28Updated 9 years ago
- Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the same…☆28Updated last year
- High-performance data retrieval from Neo4j with Apache Arrow 🏹☆31Updated 2 years ago
- SparkER: an Entity Resolution framework for Apache Spark☆63Updated 5 months ago
- In this project, we need to find out commercial products listed on Google that refer to the same entity across Amazon by comparing the si…☆12Updated 7 years ago
- Jupyter notebooks showing how to use Neo4j Graph Algorithms☆52Updated 4 years ago
- ☆16Updated 9 years ago
- ☆56Updated this week
- Record matching and entity resolution at scale in Spark☆31Updated 10 months ago
- NetworkX API for Neo4j Graph Algorithms.☆124Updated 4 years ago
- ☆19Updated 6 years ago
- Tutorial for Topic Modelling using PySpark and Spark NLP☆16Updated 4 years ago
- A python wrapper to call Neo4j Graph Data Science procedures from python using the Neo4j python driver☆30Updated 2 years ago
- Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Sear…☆84Updated 3 years ago
- TAXI: a Taxonomy Induction Method based on Lexico-Syntactic Patterns, Substrings and Focused Crawling☆29Updated last year
- Exploring News Recommendation With Neo4j GDS☆30Updated last year
- openclean - Data Cleaning and data profiling library for Python☆66Updated 2 years ago
- Graph databases, Knowledge Graphs, SPARQ☆74Updated 3 years ago
- Investigating into how to extract meaningful topic names from textual data☆20Updated 4 years ago
- Machine Learning Procedures and Functions for Neo4j☆64Updated 5 years ago
- A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologies☆15Updated last year
- Slides and Code Tutorials for Strata Data 2018 Tutorial on Deep Learning Methodologies for Natural Language Processing☆22Updated 6 years ago
- ☆27Updated last year