Wittline / csv-schema-inferenceLinks
A tool to automatically infer columns data types in .csv files
☆37Updated 3 years ago
Alternatives and similar repositories for csv-schema-inference
Users that are interested in csv-schema-inference are comparing it to the libraries listed below
Sorting:
- Opinionated JSON to CSV/XLSX/SQLITE/PARQUET converter. Flattens JSON fast.☆205Updated 7 months ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆236Updated this week
- Demo Project for Open Source MDS☆170Updated 5 months ago
- dagster scikit-learn pipeline example.☆46Updated 2 years ago
- Set up a Cost-Effective Modern Data Stack for a Charity☆19Updated 10 months ago
- An example of a Dagster project with a possible folder structure to organize the assets, jobs, repositories, schedules, and ops. Also has…☆102Updated last year
- Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables☆75Updated 2 years ago
- ☆82Updated 4 months ago
- A curated list of dagster code snippets for data engineers☆56Updated last year
- Repo for orienting dbt users to the Dagster asset framework☆56Updated 3 years ago
- ☆81Updated 11 months ago
- ☆80Updated 2 years ago
- New generation opensource data stack☆76Updated 3 years ago
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆108Updated this week
- ☆158Updated 3 weeks ago
- Playground for using large language models into the Modern Data Stack for entity matching☆108Updated 2 years ago
- Data Tools Subjective List☆89Updated 2 years ago
- A playground for running duckdb as a stateless query engine over a data lake☆218Updated 2 years ago
- Contribute to dlt verified sources 🔥☆104Updated 2 months ago
- ☆178Updated 8 months ago
- One framework to develop, deploy and operate data workflows with Python and SQL.☆476Updated last week
- Cost Efficient Data Pipelines with DuckDB☆61Updated 8 months ago
- [DEPRECATED] A dbt adapter for Excel.☆96Updated 10 months ago
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆258Updated last month
- A simple and easy to use Data Quality (DQ) tool built with Python.☆51Updated 2 years ago
- Schema modelling framework for decentralised domain-driven ownership of data.☆261Updated 2 years ago
- Python+VueJS application to load, explore, combine,transform and deliver data☆102Updated 11 months ago
- ✨ Build dashboards with end-to-end version control. 🔋 CLI w/ batteries included, no infra required. Develop on your laptop for instant r…☆93Updated this week
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆58Updated 3 years ago
- Example Dagster Cloud code for the Hooli Data Engineering organization.☆22Updated 3 weeks ago