blackrock / xml_to_parquet
Convert one or more XML files into Apache Parquet format. Only requires a XSD and XML file to get started.
☆32Updated 2 years ago
Alternatives and similar repositories for xml_to_parquet:
Users that are interested in xml_to_parquet are comparing it to the libraries listed below
- ☆47Updated 2 weeks ago
- Graph Engine for Exploration and Search☆40Updated last year
- A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologies☆19Updated 2 years ago
- API Framework heavily relying on the power of DuckDB and DuckDB extensions. Ready to build performant and cost-efficient APIs on top of B…☆27Updated this week
- ☆31Updated last year
- A monorepo of many Rill example projects☆35Updated this week
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated last year
- Ibis analytics, with Ibis (and more!)☆21Updated 6 months ago
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- Data Catalog for Databases and Data Warehouses☆33Updated last year
- Repo demonstrating a Dagster pipeline to generate Neo4j Graph☆21Updated 3 years ago
- Convert a CSV to a parquet file.☆64Updated 2 years ago
- Sord Data Fabric: A Vue 3 frontend with a Python WebSocket server, leveraging a distributed architecture with DeltaLake and DuckDB worker…☆18Updated last year
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆95Updated this week
- Abstractions for feature engineering on large graphs of tabular data.☆21Updated last week
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable fro…☆27Updated 2 years ago
- duckdb_wasm in jupyterlite & pyodide☆23Updated last year
- data wrangling simplicity, complete audit transparency, and at speed☆34Updated 2 weeks ago
- Using the Parquet file format with Python☆15Updated last year
- Demo from NEO4j's Connections: Healthcare & Life Sciences event☆11Updated 4 years ago
- [Project moved] Polars integration for Dagster☆36Updated last year
- A Jupyter kernel for ClickHouse☆24Updated 4 years ago
- A curated list of (open)Cypher resources.☆79Updated 3 years ago
- dagster scikit-learn pipeline example.☆45Updated 2 years ago
- Benchmark study on Kùzu, an embedded OLAP graph database, on an artificial social network dataset☆35Updated 3 months ago
- A serverless duckDB deployment at GCP☆38Updated 2 years ago
- Notebooks for the ML Link Prediction Course☆14Updated 4 years ago
- A python package to create a database on the platform using our moj data warehousing framework☆21Updated 6 months ago
- A maximum-strength name parser for record linkage.☆36Updated last month