cldellow / csv2parquet
Convert a CSV to a parquet file.
☆64Updated 2 years ago
Alternatives and similar repositories for csv2parquet
Users that are interested in csv2parquet are comparing it to the libraries listed below
Sorting:
- A SQLite vtable extension to read Parquet files☆271Updated 4 years ago
- Dump metadata about a Parquet file.☆11Updated 3 years ago
- Embedded MonetDB with a Python frontend and fast Numpy/Pandas support☆62Updated 7 months ago
- A Jupyter kernel for ClickHouse☆24Updated 4 years ago
- Data loader for the Apache Arrow format.☆60Updated 2 weeks ago
- Demo converting streamlit uber nyc rides to use duckdb☆29Updated 2 years ago
- Materials for Apache Arrow workshop at VLDB 2019☆42Updated 4 years ago
- Extension for DuckDB for functions that require the Apache Arrow dependency☆41Updated this week
- S3 CSV Foreign Data Wrapper Using Multicorn☆28Updated 3 years ago
- An experimental Athena extension for DuckDB 🐤☆54Updated 4 months ago
- A conda-smithy repository for python-duckdb.☆13Updated last month
- Sample code to accompany blog post showcasing Arrow Flight SQL running on DuckDB☆33Updated 2 years ago
- Inspect Your Servers with DuckDB☆30Updated last week
- A purely experimental DuckDB Deltalake extension☆95Updated this week
- Compilation and rule-based optimization framework for relational algebra. Raco is the language, optimization, and query translation layer…☆72Updated 7 years ago
- A tool to read CSV files with CSVW metadata and transform them into other formats.☆32Updated 6 years ago
- Data pipelines from re-usable components☆108Updated 2 years ago
- Codd method-chained SQL generator and Pandas data processing in Python.☆117Updated last year
- prerelease built versions of arrow/master for graphistry☆34Updated 6 years ago
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆97Updated this week
- Apache Arrow PostgreSQL connector☆59Updated last year
- Scripts and code written whilst learning and experimenting with machine learning☆13Updated 2 years ago
- DuckDB Extension Linearization/Delinearization, Z-Order, Hilbert and Morton Curves☆43Updated last month
- A python library bakeoff for medium sized datasets☆24Updated last year
- ☆141Updated last month
- Data Catalog for Databases and Data Warehouses☆35Updated last year
- ☆79Updated 2 years ago
- Python Driver for Apache Drill.☆59Updated 2 years ago
- Utilities for creating ETL pipelines with mara☆36Updated 2 years ago
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago