mitdbg / lazoLinks
Sketch and LSH Index library for Java, including OPH methods as well as the Lazo method
☆15Updated 2 years ago
Alternatives and similar repositories for lazo
Users that are interested in lazo are comparing it to the libraries listed below
Sorting:
- A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologies☆21Updated 3 years ago
- Project overview and links to various resources☆20Updated 4 years ago
- Graph Engine for Exploration and Search☆42Updated 2 years ago
- Scalable String Similarity Joins in Python☆39Updated last year
- FlexMatcher is a schema matching package in Python which handles the problem of matching multiple schemas to a single mediated schema.☆29Updated last year
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆142Updated last year
- Algorithms for "schema matching"☆26Updated 9 years ago
- Record Linkage ToolKit (Find and link entities)☆111Updated 2 years ago
- Mirror from: https://gitlab.com/ViDA-NYU/auctus/auctus☆44Updated 8 months ago
- A Cython implementation of the affine gap string distance☆57Updated 3 years ago
- ☆70Updated 3 years ago
- Entity linking, entity typing and relation extraction: Matching CSV to a Wikibase instance (e.g., Wikidata) via Meta-lookup☆70Updated 8 months ago
- ☆11Updated 2 years ago
- A maximum-strength name parser for record linkage.☆39Updated 5 months ago
- A Jupyter notebook extension to centralize and manage data☆15Updated 3 years ago
- quadipy is a python package to help transform structured data into RDF graph format☆19Updated 2 years ago
- ☆17Updated 10 years ago
- utils to use word embedding models like word2vec vectors in a PostgreSQL database☆144Updated 4 years ago
- Distributed Bayesian Entity Resolution in Apache Spark☆59Updated 4 years ago
- This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs o…☆47Updated 7 years ago
- A tool to read CSV files with CSVW metadata and transform them into other formats.☆33Updated 6 years ago
- High-performance data retrieval from Neo4j with Apache Arrow 🏹☆32Updated 3 years ago
- ☆193Updated last year
- It has never been easier to transform your RDF data into a property graph based on TinkerPop-Gremlin.☆25Updated 5 years ago
- Extraction Toolkit☆83Updated 4 years ago
- A Python wrapper over the GraphGen system☆37Updated 8 years ago
- An index data structure for approximate string search.☆23Updated 6 years ago
- Trying to generate name synonyms from wikidata☆35Updated 5 years ago
- Ensemble topic modelling with pLSA☆114Updated 4 years ago
- Python package for deduplication/entity resolution using active learning☆83Updated last year