StanfordDataScience / best-practices
Source files for the Open, Transparent, and Reproducible Data Science Handbook
☆49Updated last year
Alternatives and similar repositories for best-practices:
Users that are interested in best-practices are comparing it to the libraries listed below
- A magic-free, understandable python project template using tox, pytest, ruff and pip-tools.☆34Updated 3 weeks ago
- ☆58Updated last year
- convert a scikit-learn decision tree into a Keras model☆39Updated last year
- Flenser is a simple, minimal, automated exploratory data analysis tool.☆78Updated last week
- Have UV deal with all your Jupyter deps.☆24Updated 8 months ago
- ☆14Updated last year
- CLI for running files through AWS Textract☆54Updated last year
- SQL functions for calling OpenAI APIs☆21Updated 2 years ago
- A library to use `modal` as a backend for `joblib`.☆28Updated 3 months ago
- Comparing Polars to Pandas and a small introduction☆43Updated 3 years ago
- Create a SQLite database containing your data from Google Calendar☆58Updated 2 years ago
- Python package for extractive NLP using the OpenAI API☆17Updated 8 months ago
- The Awesome Panel CLI makes it super simple to develop high-quality data apps with Panel 💪☆20Updated 2 years ago
- Scrollership through 20m pubmed abstracts.☆26Updated last year
- ☆19Updated 4 years ago
- A fork of sqlite-utils with CLI etc removed☆15Updated last month
- Potnia is an open-source Python library designed to convert Romanized transliterations of ancient texts into Unicode representations of t…☆17Updated last week
- cookiecutter template for setting up Sphinx docs with Markdown support☆12Updated 7 months ago
- Adding Marimo to Datasette☆20Updated last month
- Semantic Unit Testing☆15Updated 3 weeks ago
- A Datasette plugin for making data visualizations with Observable Plot☆21Updated last year
- Datasette plugin for visualizing data using Vega☆58Updated last year
- A Python package for computing effect sizes☆22Updated 2 years ago
- The LLM plugins directory☆42Updated last year
- A maximum-strength name parser for record linkage.☆37Updated this week
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆16Updated last month
- 🐾 PdpCLI is a pandas DataFrame processing CLI tool which enables you to build a pandas pipeline from a configuration file.☆15Updated last year
- A library to create lore plots (logistic regression of the prevalence of a categorical variable in function of a continuous feature)☆16Updated last week
- Doing sql in notebooks.☆15Updated last year
- Scripts and ideas to manage tons and tons of images and movies☆17Updated 2 months ago