dell-research-harvard / linktransformerLinks
A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning
☆133Updated last month
Alternatives and similar repositories for linktransformer
Users that are interested in linktransformer are comparing it to the libraries listed below
Sorting:
- Entity Matching Model solves the problem of matching company names between two possibly very large datasets.☆88Updated 2 weeks ago
- Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets☆118Updated 2 weeks ago
- Nesta's Skills Extractor Library☆148Updated 6 months ago
- An End-to-End Evaluation Framework for Entity Resolution Systems☆35Updated 2 years ago
- Given a job title and job description, the algorithm assigns a standard occupational classification (SOC) code to the job.☆74Updated last year
- Powerful topic model visualization in Python☆136Updated 8 months ago
- Name matching is a Python package for the matching of company names. This package has been developed to match the names of companies from…☆161Updated last week
- Python package for text mining of time-series data☆76Updated 7 months ago
- Innovation across ages☆72Updated 2 years ago
- ☆53Updated this week
- ☆80Updated last week
- Embedding Vector Oriented Clustering☆161Updated last week
- List of entity resolution software and resources.☆102Updated 9 months ago
- Tools for interactive visual exploration of semantic embeddings.☆39Updated last year
- Fast, flexible name matching for large datasets☆71Updated 3 months ago
- Prototype search engine for ONS bulletins☆24Updated last month
- The official Github for the American Stories dataset as in {link}☆127Updated last year
- PatentSBERTa: A Deep NLP based Hybrid Model for Patent Distance and Classification using Augmented SBERT☆100Updated last year
- Interactive notebooks containing demonstration code of the splink library☆40Updated last year
- Robust and fast topic models with sentence-transformers.☆83Updated last week
- My personal frontpage app☆108Updated last week
- ☆103Updated last year
- Zero/few shot learning components for scikit-learn pipelines with LLMs and transformers.☆18Updated last year
- Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers…☆374Updated this week
- code base for constructing narrative statements from text☆116Updated 2 years ago
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆191Updated 6 months ago
- VIINA: Violent Incident Information from News Articles on the 2022 Russian Invasion of Ukraine☆327Updated this week
- A Flexible Deep Learning Approach to Fuzzy String Matching☆147Updated last year
- Google Trends, made easy.☆115Updated last year
- This repository contains the raw data, code, and sources used to create an individual level and state municipal incorporation date datase…☆26Updated 9 months ago