cnatsis / faker-clickstreamLinks
Clickstream Faker Provider for Python.
☆11Updated 3 years ago
Alternatives and similar repositories for faker-clickstream
Users that are interested in faker-clickstream are comparing it to the libraries listed below
Sorting:
- PySpark phonetic and string matching algorithms☆41Updated last year
- Data validation library for PySpark 3.0.0☆33Updated 3 years ago
- A curated list of awesome blogs, videos, tools and resources about Data Contracts☆182Updated last year
- Weekly Data Engineering Newsletter☆96Updated last year
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆28Updated 3 years ago
- ☆23Updated 4 years ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Updated 3 years ago
- A write-audit-publish implementation on a data lake without the JVM☆45Updated last year
- Snowflake Guide: Building a Recommendation Engine Using Snowflake & Amazon SageMaker☆32Updated 4 years ago
- Data Tools Subjective List☆89Updated 2 years ago
- An LLM-powered chatbot with the added context of the dbt knowledge base.☆39Updated last year
- A repository of sample code to show data quality checking best practices using Airflow.☆78Updated 2 years ago
- Code for dbt tutorial☆168Updated 5 months ago
- Playground for using large language models into the Modern Data Stack for entity matching☆108Updated 2 years ago
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆115Updated this week
- PipeRider dbt workshop for DataTalksClub DE Zoomcamp☆18Updated 2 years ago
- Delta Lake examples☆238Updated last year
- Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team …☆131Updated last week
- Delta Lake Documentation☆53Updated last year
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆91Updated 2 years ago
- Airflow training for the crunch conf☆105Updated 7 years ago
- Project files for the post: Running PySpark Applications on Amazon EMR: Methods for Interacting with PySpark on Amazon Elastic MapReduce.☆38Updated 3 years ago
- Ingesting data with Pulumi, AWS lambdas and Snowflake in a scalable, fully replayable manner☆71Updated 4 years ago
- Sample configuration to deploy a modern data platform.☆89Updated 4 years ago
- Containerized end-to-end analytics of Spotify data using Python, dbt, Postgres, and Metabase☆132Updated 3 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Updated last week
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆168Updated 2 years ago
- Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work☆47Updated 3 years ago
- Great Expectations Airflow operator☆170Updated 2 weeks ago
- Schema modelling framework for decentralised domain-driven ownership of data.☆261Updated 2 years ago