davidfoerster / schema-matching
Match schema attributes of relational databases by value similarity. As a study assignment, this isn't well documented, but you can contact me for questions and I may even add docs, if I sense enough interest.
☆24Updated 5 years ago
Alternatives and similar repositories for schema-matching:
Users that are interested in schema-matching are comparing it to the libraries listed below
- FlexMatcher is a schema matching package in Python which handles the problem of matching multiple schemas to a single mediated schema.☆29Updated 4 months ago
- Algorithms for "schema matching"☆26Updated 8 years ago
- Tutorial code and data for the entity resolution workshops.☆45Updated 9 years ago
- A Cython implementation of the affine gap string distance☆57Updated 2 years ago
- Use ML-Annotate to label data for machine learning purposes☆109Updated 4 years ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 3 years ago
- ☄️ Parallel and distributed training with spaCy and Ray☆53Updated last year
- An End-to-End Evaluation Framework for Entity Resolution Systems☆27Updated last year
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆85Updated 2 years ago
- PySpark phonetic and string matching algorithms☆39Updated last year
- Inter-annotator agreement for Doccano☆27Updated 4 years ago
- Instant search for and access to many datasets in Pyspark.☆34Updated 2 years ago
- ☆16Updated 4 years ago
- Record matching and entity resolution at scale in Spark☆34Updated last year
- Hidden alignment conditional random field for classifying string pairs.☆24Updated 6 months ago
- Demo of a supervised machine learning approach for Entity Resolution in graph using Neo4j GDS Link Prediction Pipelines☆22Updated 3 years ago
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 4 years ago
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆137Updated 8 months ago
- Using word embeddings (word2vec) for ontology learning☆19Updated 8 years ago
- An ontology matcher for matching entities between knowledgebases☆67Updated 2 years ago
- A collection of simple tutorials for using Fonduer☆99Updated 4 years ago
- Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stan…☆53Updated 2 years ago
- High-performance data retrieval from Neo4j with Apache Arrow 🏹☆31Updated 2 years ago
- Repository for the research and implementation of categorical encoding into a Featuretools-compatible Python library☆51Updated 2 years ago
- Collection of some algorithms for entity resolution☆28Updated 9 years ago
- Scalable String Similarity Joins in Python☆39Updated 9 months ago
- SparkER: an Entity Resolution framework for Apache Spark☆64Updated last year
- Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the same…☆28Updated 2 years ago
- This repository contains machine learning related work for the corpus to graph project, including Jupyter research notebooks and a Flask …☆46Updated 8 years ago
- Functional and structural analysis of tables in research papers (Table disentangling)☆20Updated 7 years ago