Wittline / csv-schema-inferenceLinks
A tool to automatically infer columns data types in .csv files
☆36Updated 2 years ago
Alternatives and similar repositories for csv-schema-inference
Users that are interested in csv-schema-inference are comparing it to the libraries listed below
Sorting:
- Opinionated JSON to CSV/XLSX/SQLITE/PARQUET converter. Flattens JSON fast.☆204Updated 4 months ago
- Python+VueJS application to load, explore, combine,transform and deliver data☆99Updated 9 months ago
- dagster scikit-learn pipeline example.☆46Updated 2 years ago
- An example of a Dagster project with a possible folder structure to organize the assets, jobs, repositories, schedules, and ops. Also has…☆101Updated last year
- A playground for running duckdb as a stateless query engine over a data lake☆214Updated last year
- Demo Project for Open Source MDS☆167Updated 2 months ago
- Create and manage data pipes with Meerschaum.☆153Updated 3 weeks ago
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆57Updated 3 years ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆225Updated 3 weeks ago
- Set up a Cost-Effective Modern Data Stack for a Charity☆19Updated 7 months ago
- ☆116Updated 2 years ago
- One framework to develop, deploy and operate data workflows with Python and SQL.☆471Updated last week
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆114Updated 2 weeks ago
- Anomstack - Painless open source anomaly detection for your metrics 📈📉🚀☆106Updated 2 weeks ago
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆105Updated this week
- ☆80Updated 2 years ago
- ODD Specification is a universal open standard for collecting metadata.☆144Updated last year
- 📦 Serverless and local-first Open Data Platform☆302Updated last month
- A "modern" Strava data pipeline fueled by dlt, duckdb, dbt, and evidence.dev☆36Updated 6 months ago
- Write python locally, execute SQL in your data warehouse☆269Updated 3 years ago
- quadipy is a python package to help transform structured data into RDF graph format☆19Updated 2 years ago
- A SQLite adapter plugin for dbt (data build tool)☆83Updated 4 months ago
- Jupyter Cell / Line Magics for DuckDB☆54Updated last month
- Playground for using large language models into the Modern Data Stack for entity matching☆108Updated 2 years ago
- 🏃♀️ Minimalist SQL orchestrator☆293Updated last week
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable fro…☆29Updated 3 years ago
- Work with your web service, database, and streaming schemas in a single format.☆346Updated 2 months ago
- Swiple enables you to easily observe, understand, validate and improve the quality of your data☆84Updated this week
- Schema modelling framework for decentralised domain-driven ownership of data.☆259Updated last year
- Type System for Data Analysis in Python☆214Updated 9 months ago