dell-research-harvard / linktransformerLinks
A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning
☆119Updated 2 months ago
Alternatives and similar repositories for linktransformer
Users that are interested in linktransformer are comparing it to the libraries listed below
Sorting:
- Entity Matching Model solves the problem of matching company names between two possibly very large datasets.☆72Updated 3 months ago
- Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets☆114Updated 6 months ago
- Innovation across ages☆70Updated 2 years ago
- Tools for interactive visual exploration of semantic embeddings.☆34Updated 9 months ago
- ☆42Updated this week
- Powerful topic model visualization in Python☆124Updated 2 months ago
- code base for constructing narrative statements from text☆108Updated last year
- Code for measuring novelty in science using publication text☆27Updated 3 months ago
- Code for the paper "CAREER: Transfer Learning for Economic Prediction of Labor Sequence Data"☆40Updated last year
- Course repository for the session "Hands-on Transformers: Fine-Tune your own BERT and GPT" of the Data Science Summer School 2023☆87Updated last year
- Blazing fast topic modelling for short texts.☆32Updated last month
- Python package for text mining of time-series data☆73Updated last month
- An End-to-End Evaluation Framework for Entity Resolution Systems☆28Updated last year
- ☆80Updated 4 years ago
- This repository contains the raw data, code, and sources used to create an individual level and state municipal incorporation date datase…☆24Updated 3 months ago
- Embedding Vector Oriented Clustering☆138Updated last month
- Natural language processing tools developed by the World Bank's DECAT unit. A suite of text preprocessing and cleaning algorithms for NLP…☆10Updated 2 years ago
- Fast, flexible name matching for large datasets☆72Updated 2 weeks ago
- Noise-robust de-duplication at scale☆19Updated 2 years ago
- Google Trends, made easy.☆109Updated 11 months ago
- This offers a Jupyter Notebook introduction on how to use Large Language Models for text analysis within the social sciences.☆65Updated last year
- ☆32Updated last month
- Every big regression is a small regression with weights.☆50Updated 3 weeks ago
- Nesta's Skills Extractor Library☆138Updated this week
- Partition selection, point estimation, pointwise and uniform inference, and graphical procedures using binscatter methods.☆45Updated last week
- Evaluation and benchmarking of PatentsView disambiguation algorithms☆13Updated last year
- ☆55Updated last year
- Replication code for https://www.john-joseph-horton.com/papers/llm_ask.pdf☆35Updated 2 years ago
- A Flexible Deep Learning Approach to Fuzzy String Matching☆145Updated 7 months ago
- A light-weight wrapper for the Datawrapper API.☆63Updated 10 months ago