Senzing / awesomeLinks
Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.
☆62Updated this week
Alternatives and similar repositories for awesome
Users that are interested in awesome are comparing it to the libraries listed below
Sorting:
- PyPi module for Graphlet AI Knowledge Graph Factory☆29Updated 2 years ago
- Entity resolution, also known as Data Matching or Record linkage is the task of finding a data set that refer to the same or similar real…☆29Updated 5 months ago
- Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️☆18Updated this week
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆36Updated 3 years ago
- A maximum-strength name parser for record linkage.☆38Updated last week
- Application and python script to identify, remove, and/or recode personally identifiable information (PII) from field experiment datasets…☆46Updated 4 years ago
- Various Jupyter notebooks about Common Crawl data☆57Updated 5 months ago
- Python based Wikidata framework for easy dataframe extraction☆45Updated last year
- Tools to construct and process Common Crawl webgraphs☆96Updated 2 weeks ago
- Examples scripts that showcase how to use Private AI Text to de-identify, redact, hash, tokenize, mask and synthesize PII in text.☆83Updated 4 months ago
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆24Updated 3 years ago
- Record Linkage ToolKit (Find and link entities)☆110Updated 2 years ago
- Scalable String Similarity Joins in Python☆39Updated last year
- MLOps simplified. One-stop AI delivery platform, all the features you need.☆102Updated last week
- High-performance data retrieval from Neo4j with Apache Arrow 🏹☆31Updated 3 years ago
- Resources for tackling record linkage / deduplication / data matching problems☆125Updated last year
- ☆70Updated 2 years ago
- 🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)☆141Updated 2 years ago
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.☆92Updated 3 years ago
- Trying to generate name synonyms from wikidata☆33Updated 5 years ago
- ☆51Updated this week
- MkDocs plugin to generate semantic reference Markdown pages from a knowledge graph☆39Updated last year
- Socrates is a thin wrapper around an early-stage [AllenNLP](https://allennlp.org/) model that enables machine reading comprehension (MRC)…☆14Updated 4 years ago
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆99Updated 2 years ago
- TextGraphs + LLMs + graph ML for entity extraction, linking, ranking, and constructing a lemma graph☆25Updated last year
- API for OpenSanctions with support for entity search and bulk matching of data collections. Supports Reconciliation API spec.☆97Updated this week
- GraphiPy: Universal Social Data Extractor☆83Updated 2 years ago
- Aim-spaCy integration☆34Updated 2 years ago
- Magniv Core - A Python-decorator based job orchestration platform. Avoid responsibility handoffs by abstracting infra and DevOps.☆80Updated last year
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients☆38Updated last year