chu-data-lab / AutomaticFuzzyJoinLinks
☆26Updated 4 years ago
Alternatives and similar repositories for AutomaticFuzzyJoin
Users that are interested in AutomaticFuzzyJoin are comparing it to the libraries listed below
Sorting:
- A tool facilitating matching for any dataset discovery method. Also, an extensible experiment suite for state-of-the-art schema matching …☆92Updated 4 months ago
- DynaMo: Dynamic Community Detection by Incrementally Maximizing Modularity☆29Updated 4 years ago
- The SEMB library is an easy-to-use tool for getting and evaluating structural node embeddings in graphs.☆18Updated 2 years ago
- A tiny library for larger graphs☆119Updated last year
- Code for the paper "Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond…☆23Updated 3 years ago
- ☆26Updated 7 years ago
- Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …☆33Updated 2 years ago
- Code for the CIKM 2019 Paper "Fast and Accurate Network Embeddings via Very Sparse Random Projection"☆58Updated 5 years ago
- Entity resolution using zero labeled examples☆30Updated last year
- Code for extracting, parsing and annotating tables from GitTables (https://gittables.github.io).☆45Updated 3 years ago
- Code and data for Sato https://arxiv.org/abs/1911.06311.☆115Updated last year
- A fast, parallelized, memory efficient, and cache-optimized Python implementation of node2vec☆169Updated 2 weeks ago
- Graph Embedding via Frequent Subgraphs☆45Updated 5 years ago
- ☆32Updated 4 years ago
- Hardware-agnostic Framework for Large-scale Knowledge Graph Embeddings☆52Updated last week
- Welcome to Snowman App – a Data Matching Benchmark Platform.☆38Updated 2 years ago
- Compositional and Parameter-Efficient Representations for Large Knowledge Graphs (ICLR'22)☆142Updated 3 years ago
- Project overview and links to various resources☆19Updated 3 years ago
- Code for the paper "Deep Entity Matching with Pre-trained Language Models"☆293Updated last year
- LEMON: Explainable Entity Matching☆18Updated 3 years ago
- TransformerDB☆19Updated 4 years ago
- ☆193Updated last year
- Learn2Clean: Optimizing the Sequence of Tasks for Data Preparation and Cleaning☆51Updated 2 years ago
- Data-Centric What-If Analysis for Native Machine Learning Pipelines☆16Updated 2 years ago
- Clustering for arbitrary data and dissimilarity function☆97Updated last year
- An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.☆82Updated 2 weeks ago
- Node Embeddings in Dynamic Graphs☆57Updated 3 years ago
- Automatic feature extraction and node role assignment for transfer learning on graphs (ReFeX & RolX)☆88Updated last year
- Benchmark for Graph Embedding Methods☆47Updated 4 years ago
- Jenga is an experimentation library that allows data science practititioners and researchers to study the effect of common data corruptio…☆41Updated 2 years ago