cldellow / csv2parquetLinks
Convert a CSV to a parquet file.
☆64Updated 2 years ago
Alternatives and similar repositories for csv2parquet
Users that are interested in csv2parquet are comparing it to the libraries listed below
Sorting:
- Embedded MonetDB with a Python frontend and fast Numpy/Pandas support☆62Updated 10 months ago
- A SQLite vtable extension to read Parquet files☆271Updated 4 years ago
- A Jupyter kernel for ClickHouse☆24Updated 5 years ago
- Demo converting streamlit uber nyc rides to use duckdb☆29Updated 2 years ago
- DuckDB extension to read and write to SQLite databases☆252Updated 3 weeks ago
- Data pipelines from re-usable components☆107Updated 2 years ago
- Data loader for the Apache Arrow format.☆61Updated last month
- S3 CSV Foreign Data Wrapper Using Multicorn☆28Updated 3 years ago
- Dedupe/batch geocode addresses and venues around the world with libpostal☆83Updated 3 years ago
- Codd method-chained SQL generator and Pandas data processing in Python.☆117Updated last year
- a toy duckdb based timeseries database☆15Updated 4 years ago
- GraphQL service for arrow tables and parquet data sets.☆89Updated this week
- A conda-smithy repository for python-duckdb.☆13Updated 3 weeks ago
- Data Catalog for Databases and Data Warehouses☆35Updated last year
- A collection of handy CLI tools to convert CSV and JSON to Apache Arrow and Parquet☆183Updated last month
- Convert JSON files to Apache Parquet.☆47Updated 2 years ago
- ☆79Updated 2 years ago
- Framework for processing data packages in pipelines of modular components.☆121Updated last month
- ☆90Updated last year
- prerelease built versions of arrow/master for graphistry☆34Updated 6 years ago
- ☆146Updated 3 months ago
- Convert CSV files to Apache Parquet.☆78Updated 2 years ago
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆101Updated this week
- A DuckDB extension to read data directly from databases supporting the ODBC interface☆84Updated last year
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- Extension for DuckDB for functions that require the Apache Arrow dependency☆43Updated 2 months ago
- 🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)☆141Updated 2 years ago
- Arc is an opinionated framework for defining data pipelines which are predictable, repeatable and manageable.☆169Updated last year
- Apache Arrow PostgreSQL connector☆61Updated last year
- A tool to read CSV files with CSVW metadata and transform them into other formats.☆33Updated 6 years ago