DistrictDataLabs / entity-resolution
Tutorial code and data for the entity resolution workshops.
☆44Updated 9 years ago
Alternatives and similar repositories for entity-resolution:
Users that are interested in entity-resolution are comparing it to the libraries listed below
- Collection of some algorithms for entity resolution☆28Updated 9 years ago
- code and slides for my PyGotham 2016 talk, "Higher-level Natural Language Processing with textacy"☆15Updated 8 years ago
- Feature Engineering with Pipeline Talk at ODSC West 2016, Santa Clara☆17Updated 8 years ago
- A Topic Modeling toolbox☆92Updated 8 years ago
- Python package aiding in entity disambiguation based on string and location matching☆18Updated last year
- Multidimensional data explorer and visualization tool.☆55Updated 7 years ago
- Web Service for E-Discovery Analytics☆75Updated 2 years ago
- Using Word2Vec on lists and sets☆34Updated 9 years ago
- ☆21Updated 8 years ago
- A small utility for converting Stanford GloVe vectors to HDF5 / NumPy☆12Updated 7 years ago
- Hidden alignment conditional random field for classifying string pairs.☆24Updated 4 months ago
- Library for Geo-Inferencing in Twitter Data☆28Updated 8 years ago
- A Cython implementation of the affine gap string distance☆57Updated 2 years ago
- My machine learning model for the See Click Predict Fix Kaggle competition☆31Updated 7 years ago
- Data Server for Topic Models☆121Updated last year
- Algorithms for "schema matching"☆25Updated 8 years ago
- Natural Language Processing with Spark's MLlib☆62Updated 7 years ago
- NLP pipeline using word2vec (preprocessing/embedding/prediction/clustering)☆115Updated 8 months ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- Reinforcement Learning Algorithms☆14Updated 6 years ago
- A guide on extracting entities from raw text in order to conduct social network analysis.☆20Updated 7 years ago
- Tool for tweaking dbpedia spotlight's models☆16Updated 7 years ago
- Instructions & code for the EuroPython 2014 training session "Topic Modeling for Fun and Profit"☆110Updated 10 years ago
- [development moved to termite-data-server]☆61Updated 10 years ago
- ☆27Updated 6 years ago
- pipeline library☆12Updated 6 years ago
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 4 years ago
- Learning String Alignments for Entity Aliases☆37Updated 5 years ago
- Regex like pattern tree matching but on sentence's tree instead of Strings☆42Updated 6 years ago
- NLP text recommendation system built in Python using Gensim, spaCy, and Plotly Dash☆15Updated 6 years ago