Nike-Inc / knockoff-factoryLinks
A library for generating fake data and populating database tables.
☆35Updated last year
Alternatives and similar repositories for knockoff-factory
Users that are interested in knockoff-factory are comparing it to the libraries listed below
Sorting:
- ∞ Priceloop Engineering Conventions for Scala, Python, Git Workflow etc☆100Updated 2 years ago
- ☆89Updated 2 years ago
- 🧪 Simple data science experimentation & tracking with jupyter, papermill, and mlflow.☆183Updated last year
- Capturing model drift and handling its response - Example webinar☆108Updated 6 years ago
- ML pipeline orchestration and model deployments on Kubernetes.☆434Updated 2 years ago
- Compare MLOps Platforms. Breakdowns of SageMaker, VertexAI, AzureML, Dataiku, Databricks, h2o, kubeflow, mlflow...☆392Updated 2 years ago
- Build and deploy a serverless data pipeline on AWS with no effort.☆111Updated 2 years ago
- HandySpark - bringing pandas-like capabilities to Spark dataframes☆196Updated 6 years ago
- A command line tool to easily add an ethics checklist to your data science projects.☆301Updated 3 weeks ago
- 🛠 Python project template with unit tests, code coverage, linting, type checking, Makefile wrapper, and GitHub Actions.☆149Updated last year
- Repository with sample code and instructions for "Continuous Intelligence" and "Continuous Delivery for Machine Learning: CD4ML" workshop…☆320Updated last year
- Monitor the stability of a Pandas or Spark dataframe ⚙︎☆505Updated last month
- Example repo to kickstart integration with mlflow pipelines.☆77Updated 2 years ago
- The Data Science Lifecycle Process is a process for taking data science teams from Idea to Value repeatedly and sustainably. The process …☆520Updated 4 years ago
- Template repository for data science lifecycle project☆197Updated 5 years ago
- A workshop with several modules to help learn Feast, an open-source feature store☆92Updated 3 months ago
- The LinkedIn Fairness Toolkit (LiFT) is a Scala/Spark library that enables the measurement of fairness in large scale machine learning wo…☆171Updated 2 years ago
- Repository with sample code and instructions for "Continuous Intelligence" and "Continuous Delivery for Machine Learning: CD4ML" workshop…☆143Updated last year
- Keep your local python scripts installed and in sync with a databricks notebook. Shortens the feedback loop to develop projects using a h…☆16Updated 3 months ago
- Write python locally, execute SQL in your data warehouse☆268Updated 3 years ago
- ☆42Updated 5 years ago
- Ingesting data with Pulumi, AWS lambdas and Snowflake in a scalable, fully replayable manner☆71Updated 3 years ago
- PySpark phonetic and string matching algorithms☆39Updated last year
- Public repository for the Search with Machine Learning course taught by Daniel Tunkelang and Grant Ingersoll. Available at https://coris…☆58Updated last year
- Create HTML profiling reports from Apache Spark DataFrames☆198Updated 5 years ago
- A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.☆81Updated last year
- ⚓ Eurybia monitors model drift over time and securizes model deployment with data validation☆214Updated 11 months ago
- A hands-on tutorial showing how to use Python to do anonymisation with synthetic data☆79Updated 3 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆169Updated last year
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆82Updated 3 weeks ago