blackrock / xml_to_parquet
Convert one or more XML files into Apache Parquet format. Only requires a XSD and XML file to get started.
☆34Updated 2 years ago
Alternatives and similar repositories for xml_to_parquet
Users that are interested in xml_to_parquet are comparing it to the libraries listed below
Sorting:
- A small Python module containing quick utility functions for standard ETL processes.☆35Updated 2 weeks ago
- A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologies☆19Updated 2 years ago
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- Convert a CSV to a parquet file.☆64Updated 2 years ago
- Benchmark study on Kuzu, an embedded graph database, on an artificial social network dataset☆39Updated last month
- Graph Engine for Exploration and Search☆40Updated last year
- Data Catalog for Databases and Data Warehouses☆35Updated last year
- Repo demonstrating a Dagster pipeline to generate Neo4j Graph☆21Updated 4 years ago
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- API Framework heavily relying on the power of DuckDB and DuckDB extensions. Ready to build performant and cost-efficient APIs on top of B…☆28Updated last month
- ERPL is a DuckDB extension to connect to API based ecosystems via standard interfaces like OData, GraphQL and REST. This works e.g. for S…☆11Updated 6 months ago
- Herd-UI is a search and discovery tool for business and technical users. Everyone in your organization can use Herd-UI to browse and unde…☆16Updated 2 years ago
- PyPi module for Graphlet AI Knowledge Graph Factory☆29Updated 2 years ago
- A tool to read CSV files with CSVW metadata and transform them into other formats.☆32Updated 6 years ago
- Sord Data Fabric: A Vue 3 frontend with a Python WebSocket server, leveraging a distributed architecture with DeltaLake and DuckDB worker…☆18Updated last year
- Chatlytics is a data query and visualization platform for chat!☆13Updated 8 years ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated last year
- ☆33Updated last year
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable fro…☆27Updated 2 years ago
- A python library bakeoff for medium sized datasets☆24Updated last year
- ☆83Updated last year
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆36Updated 4 years ago
- This connector is a dbt project that maps Medicare CCLF claims data to the Tuva Input Layer.☆13Updated last month
- A conda-smithy repository for python-duckdb.☆13Updated last month
- A Jupyter kernel for ClickHouse☆24Updated 4 years ago
- An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks☆21Updated 2 years ago
- @vega transforms with @ibis-project expressions☆29Updated 4 years ago
- A maximum-strength name parser for record linkage.☆37Updated 2 weeks ago
- ☆15Updated 2 years ago
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆97Updated this week