dacort / faker-cliLinks
Command-line interface to quickly generate fake CSV and JSON data
☆76Updated last year
Alternatives and similar repositories for faker-cli
Users that are interested in faker-cli are comparing it to the libraries listed below
Sorting:
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆219Updated 4 months ago
- ☆93Updated 7 months ago
- Schema modelling framework for decentralised domain-driven ownership of data.☆257Updated last year
- Sample code to collect Apache Iceberg metrics for table monitoring☆28Updated last year
- Quickstart for any service☆159Updated this week
- ☆80Updated 10 months ago
- Yet Another (Spark) ETL Framework☆21Updated last year
- A write-audit-publish implementation on a data lake without the JVM☆46Updated last year
- Enforce Best Practices for all your Airflow DAGs. ⭐☆104Updated last week
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆64Updated 3 years ago
- PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it☆75Updated 4 months ago
- Python package for querying iceberg data through duckdb.☆70Updated last year
- Run, mock and test fake Snowflake databases locally.☆149Updated last week
- A DuckDB-powered command line interface for Snowflake security, governance, operations, and cost optimization.☆40Updated last year
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆165Updated 9 months ago
- Delta Lake Documentation☆49Updated last year
- Quick Guides from Dremio on Several topics☆74Updated 2 weeks ago
- Data Engineering Digest☆28Updated last year
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆115Updated 5 months ago
- A Python Library to support running data quality rules while the spark job is running⚡☆189Updated this week
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆70Updated 2 weeks ago
- A terraform module that deploys Dagster to AWS, using ECS.☆37Updated 2 years ago
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆88Updated 2 years ago
- Open Data Stack Platform: a collection of projects and pipelines built with open data stack tools for scalable, observable data platform…☆20Updated last month
- A Table format agnostic data sharing framework☆38Updated last year
- A bunch of hacks developed around dbt☆48Updated 5 years ago
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆260Updated last month
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆61Updated 2 years ago
- An example dbt project using AutomateDV to create a Data Vault 2.0 Data Warehouse based on the Snowflake TPC-H dataset.☆52Updated last year
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated last year