Wittline / csv-schema-inferenceLinks
A tool to automatically infer columns data types in .csv files
☆35Updated 2 years ago
Alternatives and similar repositories for csv-schema-inference
Users that are interested in csv-schema-inference are comparing it to the libraries listed below
Sorting:
- Opinionated JSON to CSV/XLSX/SQLITE/PARQUET converter. Flattens JSON fast.☆198Updated 3 months ago
- dagster scikit-learn pipeline example.☆45Updated 2 years ago
- Set up a Cost-Effective Modern Data Stack for a Charity☆19Updated 6 months ago
- Demo Project for Open Source MDS☆168Updated last month
- 📦 Serverless and local-first Open Data Platform☆298Updated 2 months ago
- Containerized end-to-end analytics of Spotify data using Python, dbt, Postgres, and Metabase☆131Updated 3 years ago
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆58Updated 2 years ago
- Data Tools Subjective List☆86Updated 2 years ago
- Repo for orienting dbt users to the Dagster asset framework☆55Updated 2 years ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆220Updated this week
- New generation opensource data stack☆73Updated 3 years ago
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆57Updated 3 years ago
- Write python locally, execute SQL in your data warehouse☆268Updated 3 years ago
- Example Dagster Cloud code for the Hooli Data Engineering organization.☆12Updated last week
- Work with your web service, database, and streaming schemas in a single format.☆343Updated last month
- Jupyter Cell / Line Magics for DuckDB☆54Updated last week
- A curated list of dagster code snippets for data engineers☆56Updated last year
- A curated collection of helpful SQL queries and functions, maintained by Count.☆206Updated 3 years ago
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- A playground for running duckdb as a stateless query engine over a data lake☆211Updated last year
- Schema modelling framework for decentralised domain-driven ownership of data.☆259Updated last year
- Anomstack - Painless open source anomaly detection for your metrics 📈📉🚀☆105Updated last week
- ☆116Updated 2 years ago
- Create and manage data pipes with Meerschaum.☆153Updated 2 weeks ago
- 🏃♀️ Minimalist SQL orchestrator☆262Updated this week
- A template repository with all the fundamentals needed to develop and deploy a Python data-processing routine for Prefect pipelines.☆20Updated 3 years ago
- Python+VueJS application to load, explore, combine,transform and deliver data☆97Updated 7 months ago
- Playground for using large language models into the Modern Data Stack for entity matching☆108Updated 2 years ago
- An example of a Dagster project with a possible folder structure to organize the assets, jobs, repositories, schedules, and ops. Also has…☆99Updated 11 months ago
- Weekly Data Engineering Newsletter☆96Updated last year