Wittline / csv-schema-inferenceLinks
A tool to automatically infer columns data types in .csv files
☆37Updated 2 years ago
Alternatives and similar repositories for csv-schema-inference
Users that are interested in csv-schema-inference are comparing it to the libraries listed below
Sorting:
- Opinionated JSON to CSV/XLSX/SQLITE/PARQUET converter. Flattens JSON fast.☆204Updated 5 months ago
- Repo for orienting dbt users to the Dagster asset framework☆56Updated 3 years ago
- dagster scikit-learn pipeline example.☆46Updated 2 years ago
- Set up a Cost-Effective Modern Data Stack for a Charity☆19Updated 8 months ago
- An example of a Dagster project with a possible folder structure to organize the assets, jobs, repositories, schedules, and ops. Also has…☆101Updated last year
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆176Updated 3 weeks ago
- One framework to develop, deploy and operate data workflows with Python and SQL.☆472Updated last week
- ☆74Updated last year
- A curated list of dagster code snippets for data engineers☆56Updated last year
- Demo Project for Open Source MDS☆168Updated 3 months ago
- ☆81Updated 3 months ago
- Create and manage data pipes with Meerschaum.☆153Updated last week
- Flatten/Explode JSON objects☆21Updated 6 months ago
- A playground for running duckdb as a stateless query engine over a data lake☆216Updated last year
- Building 3D Trusted Data Pipelines With Dagster, Dbt, and Duckdb☆24Updated 2 years ago
- 📦 Serverless and local-first Open Data Platform☆302Updated last week
- A template repository with all the fundamentals needed to develop and deploy a Python data-processing routine for Prefect pipelines.☆20Updated 3 years ago
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆58Updated 3 years ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆229Updated last month
- New generation opensource data stack☆76Updated 3 years ago
- Data Tools Subjective List☆88Updated 2 years ago
- Contribute to dlt verified sources 🔥☆101Updated this week
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated 2 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆114Updated last month
- A "modern" Strava data pipeline fueled by dlt, duckdb, dbt, and evidence.dev☆37Updated 7 months ago
- Containerized end-to-end analytics of Spotify data using Python, dbt, Postgres, and Metabase☆132Updated 3 years ago
- Make dbt docs and Apache Superset talk to one another☆154Updated 2 months ago
- Anomstack - Painless open source anomaly detection for your metrics 📈📉🚀☆106Updated last month
- Project demonstrating how to automate Prefect 2.0 deployments to AWS ECS Fargate☆116Updated 2 years ago
- Example Dagster Cloud code for the Hooli Data Engineering organization.☆17Updated last month