olehmberg / WebTableStitchingLinks
☆11Updated 8 years ago
Alternatives and similar repositories for WebTableStitching
Users that are interested in WebTableStitching are comparing it to the libraries listed below
Sorting:
- T2K Match is a matching algorithm optimised to match millions of web tables to a central knowledge base.☆21Updated 7 years ago
- ☆79Updated 2 years ago
- WInte.r is a Java framework for end-to-end data integration. The WInte.r framework implements well-known methods for data pre-processing,…☆110Updated 3 years ago
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆140Updated last year
- Collection of some algorithms for entity resolution☆28Updated 9 years ago
- ☆192Updated last year
- Hidden alignment conditional random field for classifying string pairs.☆24Updated last week
- A Machine Learning System for Data Enrichment.☆75Updated 6 years ago
- Record Linkage ToolKit (Find and link entities)☆110Updated last year
- Welcome to Snowman App – a Data Matching Benchmark Platform.☆38Updated 2 years ago
- SparkER: an Entity Resolution framework for Apache Spark☆65Updated last year
- An open source, high scalability toolkit in Java for Entity Resolution.☆218Updated 3 weeks ago
- Python library for information extraction of quantities from unstructured text☆119Updated 2 years ago
- A Cython implementation of the affine gap string distance☆57Updated 2 years ago
- TAXI: a Taxonomy Induction Method based on Lexico-Syntactic Patterns, Substrings and Focused Crawling☆28Updated 2 years ago
- Knowledge extraction from web data☆92Updated 7 years ago
- Machine Learning Procedures and Functions for Neo4j☆64Updated 6 years ago
- Python package aiding in entity disambiguation based on string and location matching☆18Updated last year
- Project overview and links to various resources☆19Updated 3 years ago
- Fork of the Freely Extensible Biomedical Record Linkage program☆24Updated 8 years ago
- Repository for the paper "Ethnicity sensitive author disambiguation using semi-supervised learning"☆22Updated 8 years ago
- Resources for tackling record linkage / deduplication / data matching problems☆125Updated last year
- Applications and APIs from Oracle Graph☆51Updated last month
- Numba-based version of DimmWitted Gibbs sampler☆46Updated 7 years ago
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 4 years ago
- Scalable String Similarity Joins in Python☆39Updated last year
- Algorithms for "schema matching"☆26Updated 9 years ago
- Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents☆290Updated 2 years ago
- A Python wrapper over the GraphGen system☆37Updated 7 years ago
- Tutorial code and data for the entity resolution workshops.☆45Updated 10 years ago