black-tea / data-projectsLinks
A compendium of data projects and associated blog posts 
☆10Updated 5 years ago
Alternatives and similar repositories for data-projects
Users that are interested in data-projects are comparing it to the libraries listed below
Sorting:
- Package that returns a company embedding given a company name☆47Updated 5 years ago
 - Tutorial code and data for the entity resolution workshops.☆45Updated 10 years ago
 - classify a job description (or noisy job title) into a ONET job title☆19Updated 9 years ago
 - Clustering analysis of one million tweets using scikit-learn, including basic benchmarking of various clustering algorithms☆36Updated 9 years ago
 - Automatically labeling training data☆107Updated 6 years ago
 - A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill u…☆27Updated 6 years ago
 - Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆41Updated 6 years ago
 - Quora Kaggle Competition : Natural Language Processing using word2vec embeddings, scikit-learn and xgboost for training☆18Updated 6 years ago
 - Build a deep learning model for predicting the named entities from text.☆55Updated 7 years ago
 - ☆16Updated 4 years ago
 - A simple Flask API for named entity extraction using spaCy Model☆47Updated 6 years ago
 - An evaluation of word-embeddings for classification☆32Updated 6 years ago
 - Probabilistic/machine-learning algorithms for medical record linkage [Critical Juncture]☆14Updated 8 years ago
 - An example on how to train supervised classifiers for multi-label text classification using sklearn pipelines☆110Updated 7 years ago
 - Propensity models make true predictions about a customer’s future behavior. With propensity models you can truly anticipate a customer's …☆18Updated 6 years ago
 - This repository contains machine learning related work for the corpus to graph project, including Jupyter research notebooks and a Flask …☆46Updated 9 years ago
 - Data Processing and Machine learning methods for the Open Skills Project☆172Updated 11 months ago
 - Augment IBM Watson Natural Language Understanding APIs with a configurable mechanism for text classification, uses Watson Studio.☆46Updated 6 years ago
 - A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.☆83Updated last year
 - A collection of simple tutorials for using Fonduer☆100Updated 5 years ago
 - Topic Modelling for Humans☆22Updated 7 years ago
 - Guide on creating an API for serving your ML model☆67Updated 3 years ago
 - Transfer Learning for NLP Tasks☆55Updated 6 years ago
 - MLinProduction SageMaker workshop hosted in April 2020☆15Updated 5 years ago
 - NERtwork is a collection of scripts to help you create a network graph of co-occurring named entities using open source tools. This is do…☆49Updated last year
 - Tutorial for Topic Modelling using PySpark and Spark NLP☆17Updated 5 years ago
 - Embed categorical variables via neural networks.☆59Updated 2 years ago
 - store my personal project☆22Updated 5 years ago
 - Text Similarity Search Application using Modern NLP and Elasticsearch☆30Updated 5 years ago
 - Facebook's fasttext tech☆15Updated 8 years ago