DistrictDataLabs / entity-resolutionLinks
Tutorial code and data for the entity resolution workshops.
☆45Updated 10 years ago
Alternatives and similar repositories for entity-resolution
Users that are interested in entity-resolution are comparing it to the libraries listed below
Sorting:
- Topic models (just LDA for now) on the Hacker News corpus☆22Updated 10 years ago
- An automated ingestion service for blogs to construct a corpus for NLP research.☆86Updated 7 years ago
- 💥 Browser-based slides or PDFs of our talks and presentations☆94Updated 6 years ago
- create a browser of a corpus using a topic model; original TMVE implementation (static pages)☆47Updated 10 years ago
- A Cython implementation of the affine gap string distance☆57Updated 2 years ago
- Relatively simple text classification powered by spaCy☆41Updated 9 years ago
- A Topic Modeling toolbox☆92Updated 9 years ago
- Algorithms for "schema matching"☆26Updated 9 years ago
- Predict age and gender from a first name☆60Updated 6 years ago
- Active Learning for text classification using scikit-learn☆24Updated 6 years ago
- code and slides for my PyGotham 2016 talk, "Higher-level Natural Language Processing with textacy"☆15Updated 9 years ago
- Embed categorical variables via neural networks.☆59Updated 2 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- Word2Vec models with Twitter data using Spark. Blog:☆65Updated 6 years ago
- NLP pipeline using word2vec (preprocessing/embedding/prediction/clustering)☆115Updated last year
- Instructions & code for the EuroPython 2014 training session "Topic Modeling for Fun and Profit"☆110Updated 11 years ago
- Code base for representation learning of very short texts, such as tweets. By Cedric De Boom, IBCN, Ghent University, Belgium.☆35Updated 9 years ago
- Multidimensional data explorer and visualization tool.☆56Updated 8 years ago
- content discovery... IN 3D☆49Updated 8 years ago
- Natural Language Processing with Spark's MLlib☆62Updated 7 years ago
- Record Linkage ToolKit (Find and link entities)☆110Updated 2 years ago
- Scalable String Similarity Joins in Python☆39Updated last year
- My machine learning model for the See Click Predict Fix Kaggle competition☆31Updated 8 years ago
- lightweight python wrapper for vowpal wabbit☆169Updated 5 years ago
- Tools, wrappers, etc... for data science with a concentration on text processing☆206Updated 2 years ago
- feng - feature engineering for machine-learning champions☆27Updated 8 years ago
- Topic modeling with gensim and LDA☆168Updated 8 years ago
- Quora Kaggle Competition : Natural Language Processing using word2vec embeddings, scikit-learn and xgboost for training☆18Updated 6 years ago
- Twitter visualizaton experiment using various python-based technologies.☆60Updated 9 years ago
- This is where all of the IPython Notebooks will be kept from the blog☆60Updated 7 years ago