DistrictDataLabs / entity-resolutionLinks
Tutorial code and data for the entity resolution workshops.
☆45Updated 10 years ago
Alternatives and similar repositories for entity-resolution
Users that are interested in entity-resolution are comparing it to the libraries listed below
Sorting:
- Topic models (just LDA for now) on the Hacker News corpus☆22Updated 10 years ago
- 💥 Browser-based slides or PDFs of our talks and presentations☆94Updated 6 years ago
- Algorithms for "schema matching"☆26Updated 9 years ago
- Predict age and gender from a first name☆59Updated 7 years ago
- NLP pipeline using word2vec (preprocessing/embedding/prediction/clustering)☆116Updated last year
- Package that returns a company embedding given a company name☆47Updated 5 years ago
- ☆193Updated last year
- Scalable String Similarity Joins in Python☆39Updated last year
- A Cython implementation of the affine gap string distance☆57Updated 2 years ago
- Multidimensional data explorer and visualization tool.☆56Updated 8 years ago
- ☆21Updated 9 years ago
- Relatively simple text classification powered by spaCy☆41Updated 10 years ago
- A compendium of data projects and associated blog posts☆10Updated 6 years ago
- An in depth tutorial on sklearn's Pipeline and FeatureUnion classes.☆16Updated 8 years ago
- LSH based high dimensional clustering for sets and points☆80Updated 11 years ago
- Data Server for Topic Models☆122Updated 2 years ago
- Materials for the workshop Advanced Text Analysis with SpaCy and Scikit-Learn, given at NYU during NYCDH Week 2017, at PyData NYC in Nov.…☆83Updated 3 years ago
- Python package aiding in entity disambiguation based on string and location matching☆18Updated 2 years ago
- A Topic Modeling toolbox☆92Updated 9 years ago
- Instructions & code for the EuroPython 2014 training session "Topic Modeling for Fun and Profit"☆110Updated 11 years ago
- [development moved to termite-data-server]☆61Updated 11 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- Active Learning for text classification using scikit-learn☆24Updated 6 years ago
- code and slides for my PyGotham 2016 talk, "Higher-level Natural Language Processing with textacy"☆15Updated 9 years ago
- A small utility for converting Stanford GloVe vectors to HDF5 / NumPy☆12Updated 8 years ago
- A collection of simple tutorials for using Fonduer☆100Updated 5 years ago
- Collection of some algorithms for entity resolution☆28Updated 10 years ago
- Record Linkage ToolKit (Find and link entities)☆111Updated 2 years ago
- Fast, flexible name matching for large datasets☆71Updated 3 months ago
- Presentations & notebooks from our talks /workshops/meetups/etc☆24Updated 7 years ago