DistrictDataLabs / entity-resolutionLinks
Tutorial code and data for the entity resolution workshops.
☆45Updated 10 years ago
Alternatives and similar repositories for entity-resolution
Users that are interested in entity-resolution are comparing it to the libraries listed below
Sorting:
- Topic models (just LDA for now) on the Hacker News corpus☆22Updated 10 years ago
- A Topic Modeling toolbox☆92Updated 9 years ago
- Instructions & code for the EuroPython 2014 training session "Topic Modeling for Fun and Profit"☆111Updated 11 years ago
- content discovery... IN 3D☆49Updated 8 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- Record Linkage ToolKit (Find and link entities)☆111Updated 2 years ago
- Predict age and gender from a first name☆59Updated 7 years ago
- Multidimensional data explorer and visualization tool.☆56Updated 8 years ago
- create a browser of a corpus using a topic model; original TMVE implementation (static pages)☆47Updated 10 years ago
- NLP pipeline using word2vec (preprocessing/embedding/prediction/clustering)☆116Updated last year
- Word2Vec models with Twitter data using Spark. Blog:☆66Updated 7 years ago
- NOTE: skutil is now deprecated. See its sister project: https://github.com/tgsmith61591/skoot. Original description: A set of scikit-lear…☆31Updated 7 years ago
- ☆21Updated 9 years ago
- Materials for the workshop Advanced Text Analysis with SpaCy and Scikit-Learn, given at NYU during NYCDH Week 2017, at PyData NYC in Nov.…☆83Updated 3 years ago
- Relatively simple text classification powered by spaCy☆41Updated 10 years ago
- ☆193Updated last year
- Python package aiding in entity disambiguation based on string and location matching☆18Updated 2 years ago
- 💥 Browser-based slides or PDFs of our talks and presentations☆94Updated 7 years ago
- An in depth tutorial on sklearn's Pipeline and FeatureUnion classes.☆16Updated 8 years ago
- Slides for my doc2vec workshop/talk☆29Updated 8 years ago
- Topic modeling with gensim and LDA☆168Updated 8 years ago
- Embed categorical variables via neural networks.☆59Updated 2 years ago
- Slides and Code Tutorials for Strata Data 2018 Tutorial on Deep Learning Methodologies for Natural Language Processing☆22Updated 7 years ago
- A Cython implementation of the affine gap string distance☆57Updated 3 years ago
- My machine learning model for the See Click Predict Fix Kaggle competition☆31Updated 8 years ago
- Common data science and data engineering utilities to help us perform analytics. Our toolbox for data scientists, licensed under Apache-2…☆30Updated 7 years ago
- Predicting happiness from demographics and poll answers☆46Updated 9 years ago
- Twitter visualizaton experiment using various python-based technologies.☆60Updated 9 years ago
- Tools, wrappers, etc... for data science with a concentration on text processing☆207Updated 3 years ago
- A curated list of resources dedicated to text summarization☆54Updated 7 years ago