black-tea / data-projectsLinks
A compendium of data projects and associated blog posts
☆10Updated 5 years ago
Alternatives and similar repositories for data-projects
Users that are interested in data-projects are comparing it to the libraries listed below
Sorting:
- classify a job description (or noisy job title) into a ONET job title☆19Updated 9 years ago
- Tutorial code and data for the entity resolution workshops.☆45Updated 10 years ago
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆41Updated 5 years ago
- Package that returns a company embedding given a company name☆47Updated 5 years ago
- ☆16Updated 4 years ago
- An example on how to train supervised classifiers for multi-label text classification using sklearn pipelines☆110Updated 7 years ago
- Topic Modelling for Humans☆22Updated 7 years ago
- Embed categorical variables via neural networks.☆59Updated 2 years ago
- Clustering analysis of one million tweets using scikit-learn, including basic benchmarking of various clustering algorithms☆36Updated 9 years ago
- An evaluation of word-embeddings for classification☆32Updated 6 years ago
- demo using FuzzyWuzzy matching company names☆75Updated 3 years ago
- Build a deep learning model for predicting the named entities from text.☆56Updated 7 years ago
- Text Preprocessing in Python☆19Updated 8 years ago
- Propensity models make true predictions about a customer’s future behavior. With propensity models you can truly anticipate a customer's …☆17Updated 6 years ago
- This repository contains machine learning related work for the corpus to graph project, including Jupyter research notebooks and a Flask …☆46Updated 9 years ago
- Production Machine Learning Pipeline for Text Classification with fastText☆33Updated 4 years ago
- No Regrets: A deep dive comparison of bandits and A/B testing☆47Updated 7 years ago
- Automatically labeling training data☆107Updated 6 years ago
- Watson OpenScale tutorials including sample models, notebooks and applications☆22Updated 2 years ago
- KDD Hands-On Tutorial (2018)☆29Updated 2 years ago
- A simple Flask API for named entity extraction using spaCy Model☆47Updated 6 years ago
- store my personal project☆22Updated 5 years ago
- A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.☆83Updated last year
- Tutorial for Topic Modelling using PySpark and Spark NLP☆17Updated 5 years ago
- Tutorial on deploying machine learning models to production☆59Updated 5 years ago
- A previous version of Snorkel focused on information extraction☆35Updated 6 years ago
- This repo contains my hackathon solutions☆38Updated 3 years ago
- A collection of simple tutorials for using Fonduer☆100Updated 4 years ago
- Record Linkage ToolKit (Find and link entities)☆109Updated 2 years ago
- Guide on creating an API for serving your ML model☆67Updated 3 years ago