dell-research-harvard / linktransformerLinks
A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning
☆125Updated 3 months ago
Alternatives and similar repositories for linktransformer
Users that are interested in linktransformer are comparing it to the libraries listed below
Sorting:
- Entity Matching Model solves the problem of matching company names between two possibly very large datasets.☆73Updated 4 months ago
- Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets☆117Updated 7 months ago
- An End-to-End Evaluation Framework for Entity Resolution Systems☆30Updated last year
- Nesta's Skills Extractor Library☆140Updated last month
- Robust and fast topic models with sentence-transformers.☆69Updated 2 weeks ago
- Powerful topic model visualization in Python☆126Updated 3 months ago
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆179Updated last month
- Code for the paper "CAREER: Transfer Learning for Economic Prediction of Labor Sequence Data"☆43Updated last year
- Code for measuring novelty in science using publication text☆30Updated 4 months ago
- Google Trends, made easy.☆111Updated last year
- Python package for text mining of time-series data☆73Updated 2 months ago
- A python package to enrich Twitter Data☆75Updated 2 years ago
- Innovation across ages☆70Updated 2 years ago
- Course repository for the session "Hands-on Transformers: Fine-Tune your own BERT and GPT" of the Data Science Summer School 2023☆87Updated last year
- This repository contains the raw data, code, and sources used to create an individual level and state municipal incorporation date datase…☆24Updated 4 months ago
- Given a job title and job description, the algorithm assigns a standard occupational classification (SOC) code to the job.☆75Updated last year
- ☆47Updated this week
- Zero/few shot learning components for scikit-learn pipelines with LLMs and transformers.☆18Updated 7 months ago
- Prototype search engine for ONS bulletins☆24Updated last year
- Embedding Vector Oriented Clustering☆144Updated 3 months ago
- Fast, flexible name matching for large datasets☆72Updated last month
- A Fast, Adaptive, Stable, and Transferable Topic Model (NeurIPS 2024)☆111Updated this week
- VIINA: Violent Incident Information from News Articles on the 2022 Russian Invasion of Ukraine☆306Updated this week
- ☆55Updated last year
- Easy PDF to text to spaCy text extraction in Python.☆39Updated 9 months ago
- Tools for interactive visual exploration of semantic embeddings.☆35Updated 10 months ago
- A BERT-based application for reusable text classification at scale☆38Updated last year
- List of entity resolution software and resources.☆77Updated 4 months ago
- ☆22Updated 4 years ago
- Name matching is a Python package for the matching of company names. This package has been developed to match the names of companies from…☆155Updated 2 months ago