cldellow / csv2parquetLinks
Convert a CSV to a parquet file.
☆64Updated 2 years ago
Alternatives and similar repositories for csv2parquet
Users that are interested in csv2parquet are comparing it to the libraries listed below
Sorting:
- A SQLite vtable extension to read Parquet files☆271Updated 4 years ago
- Embedded MonetDB with a Python frontend and fast Numpy/Pandas support☆64Updated last year
- A Jupyter kernel for ClickHouse☆24Updated 5 years ago
- Data pipelines from re-usable components☆107Updated 2 years ago
- DuckDB extension to read and write to SQLite databases☆258Updated last month
- Data Catalog for Databases and Data Warehouses☆35Updated last year
- Demo converting streamlit uber nyc rides to use duckdb☆29Updated 2 years ago
- A conda-smithy repository for python-duckdb.☆13Updated last month
- A collection of handy CLI tools to convert CSV and JSON to Apache Arrow and Parquet☆190Updated last week
- Extension for DuckDB for functions that require the Apache Arrow dependency☆44Updated 6 months ago
- Dedupe/batch geocode addresses and venues around the world with libpostal☆84Updated 3 years ago
- A Python wrapper over the GraphGen system☆37Updated 8 years ago
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- S3 CSV Foreign Data Wrapper Using Multicorn☆28Updated 3 years ago
- Data loader for the Apache Arrow format.☆62Updated last week
- Codd method-chained SQL generator and Pandas data processing in Python.☆118Updated 2 years ago
- Framework for processing data packages in pipelines of modular components.☆121Updated 4 months ago
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆104Updated this week
- ☆149Updated 6 months ago
- GraphQL service for python dataframes and parquet datasets.☆89Updated this week
- Convert JSON files to Apache Parquet.☆48Updated 2 years ago
- ☆80Updated 2 years ago
- An experimental Athena extension for DuckDB 🐤☆57Updated 10 months ago
- ☆90Updated last year
- Convert CSV files to Apache Parquet.☆79Updated 2 years ago
- SQL transformation tool for DuckDB written in Rust☆72Updated 7 months ago
- Resources for tackling record linkage / deduplication / data matching problems☆125Updated last year
- Arc is an opinionated framework for defining data pipelines which are predictable, repeatable and manageable.☆169Updated last year
- a toy duckdb based timeseries database☆15Updated 5 years ago
- A DuckDB extension to read data directly from databases supporting the ODBC interface☆86Updated 2 years ago