black-tea / data-projects
A compendium of data projects and associated blog posts
☆10Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for data-projects
- Tutorial for Topic Modelling using PySpark and Spark NLP☆16Updated 4 years ago
- store my personal project☆22Updated 4 years ago
- Project template for highly effective data science workflows☆29Updated 7 months ago
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆40Updated 5 years ago
- Topic Modelling for Humans☆22Updated 6 years ago
- A Spark Streaming implementation for Online Twitter Sentiment Analysis.☆8Updated 6 years ago
- Spark NLP for Streamlit☆15Updated 3 years ago
- ☆10Updated 3 years ago
- ☆16Updated 3 years ago
- Probabilistic/machine-learning algorithms for medical record linkage [Critical Juncture]☆14Updated 7 years ago
- Webscikit is a set of tools to run a webserver as a JSON Webservice for scikit-learn predictions. It comes with two examples: boston and …☆9Updated 6 years ago
- An evaluation of word-embeddings for classification☆33Updated 5 years ago
- Package that returns a company embedding given a company name☆42Updated 4 years ago
- classify a job description (or noisy job title) into a ONET job title☆17Updated 8 years ago
- 🦖 Streamlined Recommender Systems with TensorFlow and KubeFlow☆18Updated last year
- Recommender Systems, Social Network Analysis, static & dynamic graph modeling, Neo4j, igraph, networkX☆8Updated 7 years ago
- Watson OpenScale tutorials including sample models, notebooks and applications☆21Updated last year
- Tutorial code and data for the entity resolution workshops.☆45Updated 9 years ago
- Instant search for and access to many datasets in Pyspark.☆34Updated 2 years ago
- ☆37Updated 8 years ago
- An analysis of traffic accident data for the UK in 2014, using data from the UK Data Service. (Sourced from Kaggle with original data com…☆12Updated 6 years ago
- Money Laundering Detector is to prove the hypothesis that a solution powered by Machine Learning and Behaviour Analytics will find… -> cu…☆21Updated 6 years ago
- Topic models (just LDA for now) on the Hacker News corpus☆22Updated 9 years ago
- Repository for medium article☆22Updated 9 months ago
- Customer life time analysis (CLV analysis). We are using Gamma-Gamma model to estimate average transaction value for each customer.☆45Updated 6 years ago
- Live Twitter sentiment analysis using Python, Apache Spark Streaming, Kafka, NLTK, SocketIO☆20Updated 7 years ago
- A curated list of references for MLOps☆13Updated 4 years ago
- Follow the Lumiata Tech Blog on Medium!☆21Updated last year
- Skill Representations in Vector Space☆34Updated 10 months ago
- Bots for reviewing the credibility of web content: articles, tweets, sentences and websites☆9Updated last year