mapattacker / datascience
How to be a Data Scientist
β34Updated 2 years ago
Alternatives and similar repositories for datascience:
Users that are interested in datascience are comparing it to the libraries listed below
- Various methods for generating synthetic data for data science and MLβ76Updated 3 years ago
- Using Kafka-Python to illustrate a ML production pipelineβ108Updated 2 years ago
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API πβ53Updated 3 years ago
- Python Machine Learning (ML) project that demonstrates the archetypal ML workflow within a Jupyter notebook, with automated model deploymβ¦β60Updated last year
- CentOS based Docker container for Time Series Analysis and Modeling.β21Updated 5 years ago
- β21Updated last year
- Public code & notebooks accompanying our blog posts & YouTube tutorials (https://www.youtube.com/c/PyMCLabs)β24Updated last month
- Predicting the Likelihood to Purchase a Financial Product Following a Direct Marketing Campaignβ27Updated 2 years ago
- Best practices for engineering ML pipelines.β37Updated 2 years ago
- Companion Notebooks and Data for Data Science with Python and Dask from Manning Publicationsβ51Updated 4 years ago
- Repository for the research and implementation of categorical encoding into a Featuretools-compatible Python libraryβ51Updated 2 years ago
- Jupyter Notebooks and other material from tutorial sessions on Machine Learning, Data Science, and relatedβ56Updated 3 years ago
- Fast Bayesian A/B and Multivariate testing.β36Updated 2 years ago
- A fast numpy-based implementation of ranking metrics for information retrieval and recommendation.β32Updated 2 years ago
- Library for Multi-objective optimization in Gradient Boosted Treesβ78Updated 5 months ago
- NitroFE is a Python feature engineering engine which provides a variety of modules designed to internally save past dependent values for β¦β106Updated 2 years ago
- Pre-Modelling Analysis of the data, by doing various exploratory data analysis and Statistical Test.β50Updated last year
- Big Data's open seminars: An Interactive Introduction to Reinforcement Learningβ15Updated 7 years ago
- πͺ Bayesian Hierarchical Models at Scaleβ52Updated 3 years ago
- π¦ Deployment tool for online machine learning modelsβ97Updated 2 years ago
- Scikit-Learn compatible transformer that turns categorical variables into dense entity embeddings.β42Updated last year
- Pytest for Data Science Beginnersβ58Updated 6 years ago
- An unsupervised feature selection technique using supervised algorithms such as XGBoostβ89Updated last year
- Similarity encoding of dirty categorical variables (strings)β20Updated 5 years ago
- Code for blog post on Bayesian inference in PyStanβ20Updated 3 years ago
- A scikit-learn compatible estimator based on business-rules with interactive dashboard includedβ28Updated 3 years ago
- General Interpretability Packageβ58Updated 2 years ago
- datascienv is package that helps you to setup your environment in single line of code with all dependency and it is also include pyforestβ¦β58Updated 3 years ago
- Production Machine Learning Pipeline for Text Classification with fastTextβ32Updated 3 years ago
- Using a feature store to connect the DataOps and MLOps workflows to enable collaborative teams to develop efficiently.β55Updated 2 years ago