DistrictDataLabs / entity-resolutionLinks
Tutorial code and data for the entity resolution workshops.
☆45Updated 10 years ago
Alternatives and similar repositories for entity-resolution
Users that are interested in entity-resolution are comparing it to the libraries listed below
Sorting:
- Python package aiding in entity disambiguation based on string and location matching☆18Updated last year
- Hidden alignment conditional random field for classifying string pairs.☆24Updated last week
- Multidimensional data explorer and visualization tool.☆56Updated 8 years ago
- Topic models (just LDA for now) on the Hacker News corpus☆22Updated 9 years ago
- Feature Engineering with Pipeline Talk at ODSC West 2016, Santa Clara☆17Updated 8 years ago
- A small utility for converting Stanford GloVe vectors to HDF5 / NumPy☆12Updated 8 years ago
- Scalable String Similarity Joins in Python☆39Updated last year
- Algorithms for "schema matching"☆26Updated 9 years ago
- A Topic Modeling toolbox☆92Updated 9 years ago
- Embed categorical variables via neural networks.☆59Updated 2 years ago
- A Cython implementation of the affine gap string distance☆57Updated 2 years ago
- Multilayer Feed-Forward Neural Network predictive model implementations with TensorFlow and scikit-learn☆46Updated 2 years ago
- Predict age and gender from a first name☆60Updated 6 years ago
- Relatively simple text classification powered by spaCy☆41Updated 9 years ago
- A Python package for Bayesian A/B Testing☆61Updated 2 years ago
- create a browser of a corpus using a topic model; original TMVE implementation (static pages)☆47Updated 10 years ago
- ☆11Updated 9 years ago
- Scripts for paper "Encoding high-cardinality string categorical variables"☆24Updated 5 years ago
- NOTE: skutil is now deprecated. See its sister project: https://github.com/tgsmith61591/skoot. Original description: A set of scikit-lear…☆31Updated 7 years ago
- Simplified tree-based classifier and regressor for interpretable machine learning (scikit-learn compatible)☆46Updated 4 years ago
- Slides for my doc2vec workshop/talk☆29Updated 7 years ago
- Tools that make working with scikit-learn and pandas easier.☆44Updated last year
- Python package for Bayesian Tests / AB Testing☆40Updated 4 years ago
- Active Learning for text classification using scikit-learn☆24Updated 6 years ago
- Word2Vec models with Twitter data using Spark. Blog:☆65Updated 6 years ago
- Pandas' group-by/apply with multiprocessing☆24Updated 8 years ago
- 💥 Browser-based slides or PDFs of our talks and presentations☆94Updated 6 years ago
- Common data science and data engineering utilities to help us perform analytics. Our toolbox for data scientists, licensed under Apache-2…☆30Updated 7 years ago
- Contains code for understanding TensorFlow workflow and basics☆51Updated 7 years ago
- ☆46Updated 2 months ago