Wittline / csv-schema-inferenceLinks
A tool to automatically infer columns data types in .csv files
☆36Updated 2 years ago
Alternatives and similar repositories for csv-schema-inference
Users that are interested in csv-schema-inference are comparing it to the libraries listed below
Sorting:
- Opinionated JSON to CSV/XLSX/SQLITE/PARQUET converter. Flattens JSON fast.☆200Updated 4 months ago
- dagster scikit-learn pipeline example.☆46Updated 2 years ago
- Demo Project for Open Source MDS☆167Updated 2 months ago
- A curated list of dagster code snippets for data engineers☆56Updated last year
- Possibly the fastest DataFrame-agnostic quality check library in town.☆223Updated this week
- Data Tools Subjective List☆86Updated 2 years ago
- An example of a Dagster project with a possible folder structure to organize the assets, jobs, repositories, schedules, and ops. Also has…☆100Updated 11 months ago
- ☆81Updated 8 months ago
- ☆74Updated last year
- New generation opensource data stack☆74Updated 3 years ago
- Flatten/Explode JSON objects☆20Updated 5 months ago
- Anomstack - Painless open source anomaly detection for your metrics 📈📉🚀☆106Updated last month
- quadipy is a python package to help transform structured data into RDF graph format☆19Updated 2 years ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated 2 years ago
- Example Dagster Cloud code for the Hooli Data Engineering organization.☆16Updated 2 weeks ago
- Repo for orienting dbt users to the Dagster asset framework☆55Updated 3 years ago
- Containerized end-to-end analytics of Spotify data using Python, dbt, Postgres, and Metabase☆131Updated 3 years ago
- DuckDB SQL Tools add DuckDB support to VSCode, and provide database schema and SQL query interfaces for the popular SQLTools extension, S…☆17Updated last year
- One framework to develop, deploy and operate data workflows with Python and SQL.☆467Updated this week
- A playground for running duckdb as a stateless query engine over a data lake☆212Updated last year
- Contribute to dlt verified sources 🔥☆100Updated last week
- Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables☆74Updated last year
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆57Updated 3 years ago
- A SQL linter and auto-formatter for Humans☆54Updated 3 months ago
- Python wrapper for the Sling CLI tool☆58Updated this week
- Create and manage data pipes with Meerschaum.☆153Updated this week
- 📦 Serverless and local-first Open Data Platform☆300Updated last week
- ☆116Updated 2 years ago
- Jupyter Cell / Line Magics for DuckDB☆54Updated 3 weeks ago
- ☆80Updated 2 years ago