cldellow / csv2parquetLinks
Convert a CSV to a parquet file.
☆64Updated 2 years ago
Alternatives and similar repositories for csv2parquet
Users that are interested in csv2parquet are comparing it to the libraries listed below
Sorting:
- A SQLite vtable extension to read Parquet files☆271Updated 4 years ago
- Embedded MonetDB with a Python frontend and fast Numpy/Pandas support☆63Updated 10 months ago
- Data pipelines from re-usable components☆107Updated 2 years ago
- A conda-smithy repository for python-duckdb.☆13Updated last month
- Codd method-chained SQL generator and Pandas data processing in Python.☆117Updated last year
- Demo converting streamlit uber nyc rides to use duckdb☆29Updated 2 years ago
- A Jupyter kernel for ClickHouse☆24Updated 5 years ago
- DuckDB extension to read and write to SQLite databases☆253Updated last month
- Data Catalog for Databases and Data Warehouses☆35Updated last year
- Dedupe/batch geocode addresses and venues around the world with libpostal☆83Updated 3 years ago
- GraphQL service for arrow tables and parquet data sets.☆89Updated this week
- ☆147Updated 4 months ago
- ☆90Updated last year
- Data loader for the Apache Arrow format.☆61Updated 3 weeks ago
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆101Updated this week
- A collection of handy CLI tools to convert CSV and JSON to Apache Arrow and Parquet☆183Updated 3 weeks ago
- Convert CSV files to Apache Parquet.☆79Updated 2 years ago
- S3 CSV Foreign Data Wrapper Using Multicorn☆28Updated 3 years ago
- Framework for processing data packages in pipelines of modular components.☆121Updated 2 months ago
- A DuckDB extension to read data directly from databases supporting the ODBC interface☆84Updated last year
- Arc is an opinionated framework for defining data pipelines which are predictable, repeatable and manageable.☆169Updated last year
- Apache Arrow Flight SQL adapter for PostgreSQL☆95Updated last week
- Open source Flotilla☆195Updated last week
- ☆27Updated 2 weeks ago
- Resources for tackling record linkage / deduplication / data matching problems☆125Updated last year
- Apache Arrow PostgreSQL connector☆61Updated last year
- ☆116Updated 2 years ago
- An experimental Athena extension for DuckDB 🐤☆54Updated 7 months ago
- Convert JSON files to Apache Parquet.☆47Updated 2 years ago
- 🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)☆141Updated 2 years ago