Wittline / csv-schema-inference
A tool to automatically infer columns data types in .csv files
☆35Updated last year
Alternatives and similar repositories for csv-schema-inference:
Users that are interested in csv-schema-inference are comparing it to the libraries listed below
- dagster scikit-learn pipeline example.☆44Updated last year
- Set up a Cost-Effective Modern Data Stack for a Charity☆19Updated 10 months ago
- ☆73Updated this week
- Repo for orienting dbt users to the Dagster asset framework☆51Updated 2 years ago
- New generation opensource data stack☆65Updated 2 years ago
- A curated list of dagster code snippets for data engineers☆53Updated 10 months ago
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆54Updated 2 years ago
- [DEPRECATED] A dbt adapter for Excel.☆91Updated last year
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆12Updated 7 months ago
- Make dbt great again! Enables end user to extend dbt to his/her needs☆42Updated last month
- Cost Efficient Data Pipelines with DuckDB☆48Updated 5 months ago
- Department of Education (DOE) for New South Wales (AUS) data stack in a box☆30Updated 2 months ago
- A project for exploring how Great Expectations can be used to ensure data quality and validate batches within a data pipeline defined in …☆21Updated 2 years ago
- A cool simple example of functional data engineering☆33Updated last year
- An example of a Dagster project with a possible folder structure to organize the assets, jobs, repositories, schedules, and ops. Also has…☆90Updated 2 months ago
- Python wrapper for the Sling CLI tool☆44Updated 3 months ago
- The shared semantic layer definitions that dbt-core and MetricFlow use.☆75Updated last month
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆57Updated 2 years ago
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆61Updated 2 years ago
- A serverless duckDB deployment at GCP☆38Updated 2 years ago
- The go to demo for public and private dbt Learn☆73Updated 4 months ago
- ☆32Updated 4 years ago
- ☆19Updated 3 years ago
- dbt-core-interface is an MIT licensed high level wrapper for dbt-core that can be used to drive third party integrations such as servers,…☆31Updated last year
- Sample configuration to deploy a modern data platform.☆87Updated 3 years ago
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆70Updated last year
- ☆80Updated this week
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆46Updated 2 months ago
- Schema modelling framework for decentralised domain-driven ownership of data.☆249Updated last year
- Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables☆72Updated last year