black-tea / data-projectsLinks
A compendium of data projects and associated blog posts
☆10Updated 6 years ago
Alternatives and similar repositories for data-projects
Users that are interested in data-projects are comparing it to the libraries listed below
Sorting:
- An example on how to train supervised classifiers for multi-label text classification using sklearn pipelines☆110Updated 7 years ago
- Topic Modelling for Humans☆22Updated 7 years ago
- Tutorial code and data for the entity resolution workshops.☆45Updated 10 years ago
- Automatically labeling training data☆107Updated 6 years ago
- Text summarization algorithm for the Capstone Project at Springboard code bootcamp☆54Updated 2 years ago
- Long(er) text representation and classification using Doc2Vec embeddings☆109Updated last year
- Transfer Learning for NLP Tasks☆55Updated 7 years ago
- Movie recommendations based on user written passages about preferred movies.☆16Updated 6 years ago
- Python library for advanced text mining☆69Updated 5 years ago
- Embed categorical variables via neural networks.☆59Updated 2 years ago
- Extracting LinkedIn comments from any post and export it to Excel file☆23Updated 7 years ago
- A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.☆83Updated last year
- Clustering analysis of one million tweets using scikit-learn, including basic benchmarking of various clustering algorithms☆36Updated 9 years ago
- classify a job description (or noisy job title) into a ONET job title☆19Updated 9 years ago
- Watson OpenScale tutorials including sample models, notebooks and applications☆22Updated 3 years ago
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆41Updated 6 years ago
- Document clustering in Python☆30Updated 9 years ago
- A simple Flask API for named entity extraction using spaCy Model☆47Updated 6 years ago
- An evaluation of word-embeddings for classification☆32Updated 6 years ago
- Package that returns a company embedding given a company name☆49Updated 5 years ago
- Material for UW Extension Data Science 350☆19Updated 7 years ago
- ☆16Updated 4 years ago
- A collection of simple tutorials for using Fonduer☆100Updated 5 years ago
- KDD Hands-On Tutorial (2018)☆29Updated 3 years ago
- ☆19Updated 6 years ago
- Text Similarity Search Application using Modern NLP and Elasticsearch☆30Updated 5 years ago
- Tutorial on deploying machine learning models to production☆59Updated 6 years ago
- ☆46Updated 4 years ago
- In-Session Personalization Workshop for eCommerce, April 2021, and the MICES Workshop in June 2021.☆22Updated 4 years ago
- Using Luigi to create a Machine Learning Pipeline using the Rossman Sales data from Kaggle☆33Updated 9 years ago