dell-research-harvard / linktransformer
A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning
☆99Updated 3 months ago
Related projects: ⓘ
- Innovation across ages☆64Updated last year
- Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets☆107Updated 4 months ago
- ☆30Updated 2 months ago
- Entity Matching Model solves the problem of matching company names between two possibly very large datasets.☆43Updated this week
- Google Trends, made easy.☆101Updated 3 months ago
- Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers…☆169Updated this week
- Fast, flexible name matching for large datasets☆69Updated 9 months ago
- ☆131Updated last month
- Nesta's Skills Extractor Library☆118Updated last month
- A shared repository for data cleaning scripts used for innovation data.☆27Updated 3 years ago
- ☆70Updated 3 months ago
- Given a job title and job description, the algorithm assigns a standard occupational classification (SOC) code to the job.☆73Updated 2 months ago
- LLM4Data is a Python library designed to facilitate the application of large language models (LLMs) and artificial intelligence for devel…☆46Updated 6 months ago
- Unstructured Code with interesting analysis☆33Updated 2 months ago
- code base for constructing narrative statements from text☆93Updated last year
- Powerful topic model visualization in Python☆96Updated 3 weeks ago
- PatentSBERTa: A Deep NLP based Hybrid Model for Patent Distance and Classification using Augmented SBERT☆62Updated last year
- This offers a Jupyter Notebook introduction on how to use Large Language Models for text analysis within the social sciences.☆55Updated 5 months ago
- Replication code for https://www.john-joseph-horton.com/papers/llm_ask.pdf☆31Updated last year
- An End-to-End Evaluation Framework for Entity Resolution Systems☆24Updated 9 months ago
- Course repository for the session "Hands-on Transformers: Fine-Tune your own BERT and GPT" of the Data Science Summer School 2023☆84Updated last year
- Code for the paper "CAREER: Transfer Learning for Economic Prediction of Labor Sequence Data"☆30Updated 3 months ago
- The Harvard USPTO Patent Dataset☆54Updated 9 months ago
- An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.☆69Updated 3 weeks ago
- Python package for text mining of time-series data☆66Updated 2 weeks ago
- ✨ Awesome - A curated list of amazing Topic Models (implementations, libraries, and resources)☆87Updated 2 years ago
- Partition selection, point estimation, pointwise and uniform inference, and graphical procedures using binscatter methods.☆38Updated last month
- ☆26Updated 4 months ago
- A machine learning library for economics and finance☆15Updated this week
- A python package to enrich Twitter Data☆73Updated last year