black-tea / data-projectsLinks
A compendium of data projects and associated blog posts
☆10Updated 6 years ago
Alternatives and similar repositories for data-projects
Users that are interested in data-projects are comparing it to the libraries listed below
Sorting:
- Tutorial code and data for the entity resolution workshops.☆45Updated 10 years ago
- An example on how to train supervised classifiers for multi-label text classification using sklearn pipelines☆110Updated 7 years ago
- classify a job description (or noisy job title) into a ONET job title☆19Updated 9 years ago
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆41Updated 6 years ago
- Package that returns a company embedding given a company name☆47Updated 5 years ago
- Automatically labeling training data☆107Updated 6 years ago
- Clustering analysis of one million tweets using scikit-learn, including basic benchmarking of various clustering algorithms☆36Updated 9 years ago
- Extracting LinkedIn comments from any post and export it to Excel file☆23Updated 7 years ago
- Build a deep learning model for predicting the named entities from text.☆55Updated 7 years ago
- Transfer Learning for NLP Tasks☆55Updated 7 years ago
- Tutorial on deploying machine learning models to production☆59Updated 6 years ago
- Quora Kaggle Competition : Natural Language Processing using word2vec embeddings, scikit-learn and xgboost for training☆18Updated 6 years ago
- Using NLP to cluster reddit user comments by topics☆14Updated 8 years ago
- Topic Modelling for Humans☆22Updated 7 years ago
- Propensity models make true predictions about a customer’s future behavior. With propensity models you can truly anticipate a customer's …☆18Updated 6 years ago
- A simple Flask API for named entity extraction using spaCy Model☆47Updated 6 years ago
- Clinical NLP Analysis with Elasticsearch and Kibana☆35Updated 6 years ago
- Using Luigi to create a Machine Learning Pipeline using the Rossman Sales data from Kaggle☆33Updated 9 years ago
- Data Processing and Machine learning methods for the Open Skills Project☆172Updated last year
- A collection of simple tutorials for using Fonduer☆100Updated 5 years ago
- Project files related to topic modeling of NYT articles regarding mental health☆17Updated 7 years ago
- Material for UW Extension Data Science 350☆19Updated 7 years ago
- This repository contains machine learning related work for the corpus to graph project, including Jupyter research notebooks and a Flask …☆46Updated 9 years ago
- ☆19Updated 6 years ago
- Code For Medium Article "How To Create Data Products That Are Magical Using Sequence-to-Sequence Models"☆138Updated 3 years ago
- A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.☆83Updated last year
- Multi-Label Text Classification with Transfer Learning☆18Updated 5 years ago
- Event extraction pipeline.☆34Updated 8 years ago
- Topic modelling on financial news with Natural Language Processing☆59Updated 8 years ago
- Experiments on how to use machine learning to rank a product catalog☆83Updated 8 years ago