tdubon / study_notes
Repo contains Jupyter notebooks compiled during my review of the programming books listed.
☆13Updated 3 years ago
Alternatives and similar repositories for study_notes:
Users that are interested in study_notes are comparing it to the libraries listed below
- A comprehensive tool for linguistic analysis of communities☆49Updated 3 years ago
- ☆16Updated 4 years ago
- Repository for my master thesis on automated string handling☆16Updated 3 years ago
- A utility for labeling clusters of text data.☆28Updated 3 years ago
- Bag of, not words, but tricks!☆68Updated last year
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟☆53Updated 3 years ago
- Notebooks configured to be run with Binder, usually found on my blog.☆42Updated 2 years ago
- Python package for deduplication/entity resolution using active learning☆76Updated 7 months ago
- Just another sentiment wrapper.☆17Updated 3 years ago
- Interactive tree-maps with SBERT & Hierarchical Clustering (HAC)☆30Updated 2 months ago
- Record matching and entity resolution at scale in Spark☆34Updated last year
- Techniques & resources for training interpretable ML models, explaining ML models, and debugging ML models.☆21Updated 2 years ago
- Clean personally identifiable information from dirty dirty text using spaCy.☆41Updated last year
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 2 years ago
- Comparing Polars to Pandas and a small introduction☆43Updated 3 years ago
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learning☆41Updated 4 years ago
- Companion Repo for the book The Applied ML Field Manual, Prithiviraj Damodaran☆12Updated 2 years ago
- An easy-to-use Python module that helps you to extract the BERT embeddings for a large text dataset (Bengali/English) efficiently.☆36Updated last year
- Low-code pre-built pipelines for experiments with huggingface/transformers for Data Scientists in a rush.☆16Updated 4 years ago
- ☆30Updated 2 years ago
- The ntentional blog - a machine learning journey☆23Updated 2 years ago
- NERtwork is a collection of scripts to help you create a network graph of co-occurring named entities using open source tools. This is do…☆48Updated last year
- ☆18Updated 2 years ago
- Tutorial for Topic Modelling using PySpark and Spark NLP☆17Updated 4 years ago
- XAI based human-in-the-loop framework for automatic rule-learning.☆48Updated 8 months ago
- A scikit-learn compatible estimator based on business-rules with interactive dashboard included☆28Updated 3 years ago
- 🐍 Material for PyData Global 2021 Presentation: Effective Testing for Machine Learning Projects☆81Updated 3 years ago
- ☆19Updated 4 years ago
- Explainable Zero-Shot Topic Extraction☆62Updated 7 months ago