biggorilla-gh / flexmatcher
FlexMatcher is a schema matching package in Python which handles the problem of matching multiple schemas to a single mediated schema.
☆29Updated 2 months ago
Alternatives and similar repositories for flexmatcher:
Users that are interested in flexmatcher are comparing it to the libraries listed below
- Match schema attributes of relational databases by value similarity. As a study assignment, this isn't well documented, but you can conta…☆24Updated 5 years ago
- A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologies☆19Updated 2 years ago
- Graph Engine for Exploration and Search☆40Updated last year
- Algorithms for "schema matching"☆26Updated 8 years ago
- A systematic Benchmarking on the performance of Spark-SQL for processing Vast RDF datasets☆14Updated 2 years ago
- Sketch and LSH Index library for Java, including OPH methods as well as the Lazo method☆13Updated last year
- It has never been easier to transform your RDF data into a property graph based on TinkerPop-Gremlin.☆24Updated 4 years ago
- SparkER: an Entity Resolution framework for Apache Spark☆63Updated 10 months ago
- A Jupyter notebook extension to centralize and manage data☆14Updated 2 years ago
- 📚 CORE ontology of ML-Schema and mapping to other machine learning vocabularies and ontologies (DMOP, Exposé, OntoDM, and MEX)☆27Updated 4 years ago
- Hidden alignment conditional random field for classifying string pairs.☆24Updated 4 months ago
- High-performance data retrieval from Neo4j with Apache Arrow 🏹☆31Updated 2 years ago
- ☆11Updated last year
- A Flask decorator to output RDF using content negotiation.☆16Updated 4 years ago
- Inspect ML Pipelines in Python in the form of a DAG☆70Updated 11 months ago
- Record matching and entity resolution at scale in Spark☆34Updated last year
- Scalable String Similarity Joins in Python☆38Updated 7 months ago
- A fast and lightweight Python RDF parser which wraps bindings to Rust's Rio using PyO3☆29Updated last year
- Abstractions for feature engineering on large graphs of tabular data.☆21Updated 3 weeks ago
- A python tool using XGboost and sentence-transformers to perform schema matching task on tables.☆31Updated this week
- A tool facilitating matching for any dataset discovery method. Also, an extensible experiment suite for state-of-the-art schema matching …☆86Updated 2 weeks ago
- A Cython implementation of the affine gap string distance☆57Updated 2 years ago
- Linked Data to Natural Language☆11Updated last year
- RDFLib store using SQLAlchemy dbapi as back-end☆151Updated last year
- openclean - Data Cleaning and data profiling library for Python☆72Updated 3 years ago
- Knowledge Graph Extension for Python - Team Project 2020 @ Uni Mannheim☆76Updated 3 years ago
- An End-to-End Evaluation Framework for Entity Resolution Systems☆26Updated last year
- RDFLib and SQLAlchemy bindings for Virtuoso☆16Updated 7 years ago
- Project overview and links to various resources☆18Updated 3 years ago
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to…☆26Updated 2 months ago