black-tea / data-projectsLinks
A compendium of data projects and associated blog posts
☆10Updated 6 years ago
Alternatives and similar repositories for data-projects
Users that are interested in data-projects are comparing it to the libraries listed below
Sorting:
- An example on how to train supervised classifiers for multi-label text classification using sklearn pipelines☆110Updated 7 years ago
- Build a deep learning model for predicting the named entities from text.☆55Updated 7 years ago
- Topic Modelling for Humans☆22Updated 7 years ago
- Clustering analysis of one million tweets using scikit-learn, including basic benchmarking of various clustering algorithms☆36Updated 9 years ago
- classify a job description (or noisy job title) into a ONET job title☆19Updated 9 years ago
- Material for UW Extension Data Science 350☆19Updated 8 years ago
- Automatically labeling training data☆107Updated 7 years ago
- Watson OpenScale tutorials including sample models, notebooks and applications☆22Updated 3 years ago
- Transfer Learning for NLP Tasks☆55Updated 7 years ago
- Tutorial on deploying machine learning models to production☆59Updated 6 years ago
- Tutorial code and data for the entity resolution workshops.☆45Updated 10 years ago
- An evaluation of word-embeddings for classification☆32Updated 6 years ago
- RESTful API hosting xgboost model☆25Updated 8 years ago
- This repo contains my hackathon solutions☆39Updated 3 years ago
- This repository contains machine learning related work for the corpus to graph project, including Jupyter research notebooks and a Flask …☆46Updated 9 years ago
- ☆16Updated 5 years ago
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆41Updated 6 years ago
- Quora Kaggle Competition : Natural Language Processing using word2vec embeddings, scikit-learn and xgboost for training☆18Updated 7 years ago
- Code For Medium Article "How To Create Data Products That Are Magical Using Sequence-to-Sequence Models"☆139Updated 3 years ago
- A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.☆83Updated last year
- Embed categorical variables via neural networks.☆59Updated 2 years ago
- In-Session Personalization Workshop for eCommerce, April 2021, and the MICES Workshop in June 2021.☆23Updated 4 years ago
- Sample data science projects (machine learning, optimization, business intelligence)☆28Updated 7 years ago
- Using Luigi to create a Machine Learning Pipeline using the Rossman Sales data from Kaggle☆33Updated 9 years ago
- Spark NLP for Streamlit☆15Updated 4 years ago
- NLP tutorial☆42Updated 7 years ago
- Text summarization algorithm for the Capstone Project at Springboard code bootcamp☆54Updated 3 years ago
- Slides and Code Tutorials for Strata Data 2018 Tutorial on Deep Learning Methodologies for Natural Language Processing☆22Updated 7 years ago
- Multi-Label Text Classification with Transfer Learning☆18Updated 5 years ago
- Discovers similarity between scientific papers☆62Updated 10 years ago