DistrictDataLabs / entity-resolutionLinks
Tutorial code and data for the entity resolution workshops.
☆45Updated 9 years ago
Alternatives and similar repositories for entity-resolution
Users that are interested in entity-resolution are comparing it to the libraries listed below
Sorting:
- Algorithms for "schema matching"☆26Updated 8 years ago
- Collection of some algorithms for entity resolution☆28Updated 9 years ago
- Python package aiding in entity disambiguation based on string and location matching☆18Updated last year
- Slides and Code Tutorials for Strata Data 2018 Tutorial on Deep Learning Methodologies for Natural Language Processing☆22Updated 6 years ago
- ☆54Updated 7 years ago
- code and slides for my PyGotham 2016 talk, "Higher-level Natural Language Processing with textacy"☆15Updated 8 years ago
- A Topic Modeling toolbox☆92Updated 9 years ago
- Active Learning for text classification using scikit-learn☆24Updated 5 years ago
- A simple introduction to using spark ml pipelines☆26Updated 7 years ago
- Natural Language Processing with Spark's MLlib☆62Updated 7 years ago
- ☆21Updated 9 years ago
- create a browser of a corpus using a topic model; original TMVE implementation (static pages)☆47Updated 9 years ago
- Quora Kaggle Competition : Natural Language Processing using word2vec embeddings, scikit-learn and xgboost for training☆18Updated 6 years ago
- Hidden alignment conditional random field for classifying string pairs.☆24Updated 3 weeks ago
- Multidimensional data explorer and visualization tool.☆56Updated 8 years ago
- A small utility for converting Stanford GloVe vectors to HDF5 / NumPy☆12Updated 8 years ago
- Word2Vec models with Twitter data using Spark. Blog:☆65Updated 6 years ago
- Library for Geo-Inferencing in Twitter Data☆28Updated 8 years ago
- Semantic natural language understanding at scale using Spark, machine-learned annotators and deep-learned ontologies☆20Updated 8 years ago
- Python script for matching a list of messy addresses against a gazetteer using dedupe.☆63Updated 5 years ago
- NOTE: skutil is now deprecated. See its sister project: https://github.com/tgsmith61591/skoot. Original description: A set of scikit-lear…☆31Updated 7 years ago
- Feature Engineering with Pipeline Talk at ODSC West 2016, Santa Clara☆17Updated 8 years ago
- Event extraction pipeline.☆34Updated 7 years ago
- Materials for the workshop Advanced Text Analysis with SpaCy and Scikit-Learn, given at NYU during NYCDH Week 2017, at PyData NYC in Nov.…☆82Updated 2 years ago
- 💥 Browser-based slides or PDFs of our talks and presentations☆94Updated 6 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- Dummy variable generation with fit/transform capabilities☆23Updated 6 years ago
- Probabilistic/machine-learning algorithms for medical record linkage [Critical Juncture]☆14Updated 7 years ago
- Stability analysis for topic models☆51Updated 8 years ago
- Sandbox for playing with Neo4J and graph approaches to NLP☆12Updated 7 years ago