Wittline / csv-schema-inferenceLinks
A tool to automatically infer columns data types in .csv files
☆35Updated 2 years ago
Alternatives and similar repositories for csv-schema-inference
Users that are interested in csv-schema-inference are comparing it to the libraries listed below
Sorting:
- Opinionated JSON to CSV/XLSX/SQLITE/PARQUET converter. Flattens JSON fast.☆198Updated 2 months ago
- Data Tools Subjective List☆86Updated 2 years ago
- Demo Project for Open Source MDS☆168Updated last week
- A curated list of dagster code snippets for data engineers☆57Updated last year
- Department of Education (DOE) for New South Wales (AUS) data stack in a box☆35Updated 9 months ago
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆57Updated 2 years ago
- dagster scikit-learn pipeline example.☆46Updated 2 years ago
- Repo for orienting dbt users to the Dagster asset framework☆54Updated 2 years ago
- ☆79Updated 2 weeks ago
- Example Dagster Cloud code for the Hooli Data Engineering organization.☆11Updated last week
- Set up a Cost-Effective Modern Data Stack for a Charity☆19Updated 5 months ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆202Updated last week
- An example of a Dagster project with a possible folder structure to organize the assets, jobs, repositories, schedules, and ops. Also has…☆97Updated 10 months ago
- Schema modelling framework for decentralised domain-driven ownership of data.☆257Updated last year
- DAG based BI-as-code CLI tool. Unlocks a better approach data visualization that integrates seamlessly into the modern data stack.☆60Updated last week
- A template repository with all the fundamentals needed to develop and deploy a Python data-processing routine for Prefect pipelines.☆20Updated 3 years ago
- A curated collection of helpful SQL queries and functions, maintained by Count.☆205Updated 3 years ago
- 📦 Serverless and local-first Open Data Platform☆295Updated 3 weeks ago
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆57Updated 3 years ago
- One framework to develop, deploy and operate data workflows with Python and SQL.☆458Updated 2 weeks ago
- Anomstack - Painless open source anomaly detection for your metrics 📈📉🚀☆105Updated last week
- Examples of various flow deployments for Prefect 1.0 (storage and run configurations)☆35Updated 3 years ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated last year
- Work with your web service, database, and streaming schemas in a single format.☆344Updated 2 months ago
- Playground for using large language models into the Modern Data Stack for entity matching☆108Updated 2 years ago
- ☆81Updated 2 years ago
- Python+VueJS application to load, explore, combine,transform and deliver data☆96Updated 6 months ago
- A playground for running duckdb as a stateless query engine over a data lake☆212Updated last year
- Contribute to dlt verified sources 🔥☆92Updated this week
- Cost Efficient Data Pipelines with DuckDB☆57Updated 3 months ago