dell-research-harvard / linktransformer
A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning
☆118Updated 2 weeks ago
Alternatives and similar repositories for linktransformer:
Users that are interested in linktransformer are comparing it to the libraries listed below
- Entity Matching Model solves the problem of matching company names between two possibly very large datasets.☆71Updated last month
- Innovation across ages☆69Updated 2 years ago
- Powerful topic model visualization in Python☆119Updated last month
- Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets☆112Updated 4 months ago
- Code for measuring novelty in science using publication text☆26Updated last month
- Nesta's Skills Extractor Library☆129Updated 5 months ago
- code base for constructing narrative statements from text☆107Updated last year
- Given a job title and job description, the algorithm assigns a standard occupational classification (SOC) code to the job.☆73Updated 9 months ago
- ☆54Updated last year
- Google Trends, made easy.☆105Updated 10 months ago
- An End-to-End Evaluation Framework for Entity Resolution Systems☆27Updated last year
- ☆39Updated last week
- A python package to enrich Twitter Data☆75Updated last year
- Noise-robust de-duplication at scale☆18Updated 2 years ago
- Robust and fast topic models with sentence-transformers.☆48Updated this week
- An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.☆76Updated this week
- List of entity resolution software and resources.☆63Updated 2 months ago
- A Flexible Deep Learning Approach to Fuzzy String Matching☆145Updated 6 months ago
- This repository contains the raw data, code, and sources used to create an individual level and state municipal incorporation date datase…☆23Updated last month
- ☆31Updated 2 weeks ago
- Interactive notebooks containing demonstration code of the splink library☆38Updated last year
- Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers…☆235Updated this week
- HDBSCAN Tuning for BERTopic Models☆45Updated last year
- Tools for interactive visual exploration of semantic embeddings.☆32Updated 7 months ago
- Evaluation and benchmarking of PatentsView disambiguation algorithms☆13Updated last year
- A shared repository for data cleaning scripts used for innovation data.☆30Updated 3 years ago
- Blazing fast topic modelling for short texts.☆31Updated 2 weeks ago
- ☆82Updated 10 months ago
- Code for the paper 'Conversations at Scale: Robust AI-led Interviews with a Simple Open-Source Platform'☆32Updated 2 months ago
- A BERT-based application for reusable text classification at scale☆38Updated last year