eyaltrabelsi / my-notebooksLinks
☆14Updated 2 years ago
Alternatives and similar repositories for my-notebooks
Users that are interested in my-notebooks are comparing it to the libraries listed below
Sorting:
- Projects developed by Domino's R&D team☆78Updated 3 years ago
- Deep Learning how-to's using Lance file format☆19Updated 2 months ago
- Record matching and entity resolution at scale in Spark☆35Updated last year
- Using Kafka-Python to illustrate a ML production pipeline☆112Updated 2 years ago
- Tutorial code and data for the entity resolution workshops.☆45Updated 10 years ago
- Using Luigi to create a Machine Learning Pipeline using the Rossman Sales data from Kaggle☆33Updated 9 years ago
- Match schema attributes of relational databases by value similarity. As a study assignment, this isn't well documented, but you can conta…☆24Updated 5 years ago
- python automatic data quality check toolkit☆282Updated 4 years ago
- Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops☆118Updated 2 years ago
- Content for the Model Interpretability Tutorial at Pycon US 2019☆41Updated last year
- MLOps simplified. One-stop AI delivery platform, all the features you need.☆100Updated this week
- Makes Interactive Chart Widget, Cleans raw data, Runs baseline models, Interactive hyperparameter tuning & tracking☆55Updated 3 years ago
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆83Updated 7 months ago
- Code showing how to use a model based on the ML model base class.☆10Updated 2 years ago
- ☆19Updated 4 years ago
- An open source python library for automated prediction engineering☆45Updated 2 months ago
- Locality Sensitive Hashing for semantic similarity (Python 3.x)☆15Updated 7 years ago
- A simplified version of featuretools for Spark☆31Updated 6 years ago
- Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.☆103Updated 6 years ago
- Data Analysis Baseline Library☆133Updated 9 months ago
- MinHash implementation in Python☆11Updated 11 months ago
- Blog post on ETL pipelines with Airflow☆23Updated 5 years ago
- A collaborative feature engineering system built on JupyterHub☆94Updated 6 years ago
- Python Machine Learning (ML) project that demonstrates the archetypal ML workflow within a Jupyter notebook, with automated model deploym…☆62Updated 2 years ago
- Asynchronous actions for PySpark☆47Updated 3 years ago
- Code supporting Data Science articles at The Marketing Technologist, Floryn Tech Blog, and Pythom.nl☆71Updated 2 years ago
- ☆18Updated 3 years ago
- ☆42Updated 6 months ago
- Learn the pyspark API through pictures and simple examples☆170Updated 4 years ago
- A data labelling tool based on Streamlit.☆23Updated 3 years ago