Wittline / csv-schema-inferenceLinks
A tool to automatically infer columns data types in .csv files
☆37Updated 3 years ago
Alternatives and similar repositories for csv-schema-inference
Users that are interested in csv-schema-inference are comparing it to the libraries listed below
Sorting:
- Opinionated JSON to CSV/XLSX/SQLITE/PARQUET converter. Flattens JSON fast.☆205Updated 7 months ago
- Create and manage data pipes with Meerschaum.☆153Updated 3 weeks ago
- ☆116Updated 2 years ago
- dagster scikit-learn pipeline example.☆46Updated 2 years ago
- An example of a Dagster project with a possible folder structure to organize the assets, jobs, repositories, schedules, and ops. Also has…☆101Updated last year
- Python+VueJS application to load, explore, combine,transform and deliver data☆102Updated 11 months ago
- Demo Project for Open Source MDS☆170Updated 5 months ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆51Updated 2 years ago
- Write python locally, execute SQL in your data warehouse☆269Updated 3 years ago
- ☆74Updated last year
- Scripts to make specific datasets cleaner and more convenient☆42Updated 3 years ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆234Updated 3 months ago
- Data Tools Subjective List☆89Updated 2 years ago
- Playground for using large language models into the Modern Data Stack for entity matching☆108Updated 2 years ago
- Anomstack - Painless open source anomaly detection for your metrics 📈📉🚀☆107Updated 2 months ago
- Set up a Cost-Effective Modern Data Stack for a Charity☆19Updated 10 months ago
- A playground for running duckdb as a stateless query engine over a data lake☆218Updated 2 years ago
- A curated list of dagster code snippets for data engineers☆56Updated last year
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆57Updated 3 years ago
- 🏃♀️ Minimalist SQL orchestrator☆302Updated this week
- ✨ Build dashboards with end-to-end version control. 🔋 CLI w/ batteries included, no infra required. Develop on your laptop for instant r…☆91Updated last week
- A curated collection of helpful SQL queries and functions, maintained by Count.☆208Updated 4 years ago
- Flatten/Explode JSON objects☆21Updated 8 months ago
- Jupyter Cell / Line Magics for DuckDB☆54Updated 2 weeks ago
- New generation opensource data stack☆76Updated 3 years ago
- A "modern" Strava data pipeline fueled by dlt, duckdb, dbt, and evidence.dev☆38Updated 8 months ago
- ☆81Updated 11 months ago
- 📦 Serverless and local-first Open Data Platform☆304Updated last week
- DuckDB SQL Tools add DuckDB support to VSCode, and provide database schema and SQL query interfaces for the popular SQLTools extension, S…☆19Updated last year
- ☆82Updated 4 months ago