DistrictDataLabs / entity-resolution
Tutorial code and data for the entity resolution workshops.
☆45Updated 9 years ago
Alternatives and similar repositories for entity-resolution:
Users that are interested in entity-resolution are comparing it to the libraries listed below
- Collection of some algorithms for entity resolution☆28Updated 9 years ago
- Algorithms for "schema matching"☆26Updated 8 years ago
- Topic models (just LDA for now) on the Hacker News corpus☆22Updated 9 years ago
- code and slides for my PyGotham 2016 talk, "Higher-level Natural Language Processing with textacy"☆15Updated 8 years ago
- A small utility for converting Stanford GloVe vectors to HDF5 / NumPy☆12Updated 7 years ago
- A Topic Modeling toolbox☆92Updated 8 years ago
- My machine learning model for the See Click Predict Fix Kaggle competition☆31Updated 7 years ago
- SparkER: an Entity Resolution framework for Apache Spark☆64Updated last year
- This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs o…☆46Updated 6 years ago
- Code and data for the AAAI 2015 paper entitled: "Predicting the demographics of Twitter users from social evidence using website traffic …☆44Updated 5 years ago
- Predict age and gender from a first name☆60Updated 6 years ago
- Reinforcement Learning Algorithms☆14Updated 6 years ago
- ☆41Updated 4 years ago
- ☆21Updated 8 years ago
- deep entity resolution lite version☆11Updated 5 years ago
- Word2Vec models with Twitter data using Spark. Blog:☆65Updated 6 years ago
- Python package aiding in entity disambiguation based on string and location matching☆18Updated last year
- Embed categorical variables via neural networks.☆59Updated 2 years ago
- Python package for Bayesian Tests / AB Testing☆40Updated 4 years ago
- Package that returns a company embedding given a company name☆45Updated 4 years ago
- A guide on extracting entities from raw text in order to conduct social network analysis.☆20Updated 7 years ago
- create a browser of a corpus using a topic model; original TMVE implementation (static pages)☆47Updated 9 years ago
- Feature Engineering with Pipeline Talk at ODSC West 2016, Santa Clara☆17Updated 8 years ago
- A Cython implementation of the affine gap string distance☆57Updated 2 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 4 years ago
- Ensemble topic modeling with matrix factorization☆25Updated 6 years ago
- Repository for the paper "Ethnicity sensitive author disambiguation using semi-supervised learning"☆22Updated 8 years ago
- Hidden alignment conditional random field for classifying string pairs.☆24Updated 6 months ago
- Semantic natural language understanding at scale using Spark, machine-learned annotators and deep-learned ontologies☆20Updated 8 years ago