blackrock / xml_to_parquet
Convert one or more XML files into Apache Parquet format. Only requires a XSD and XML file to get started.
☆32Updated last year
Related projects: ⓘ
- Notebooks, slides, and examples for "Streaming, cross-sectional data visualization in Jupyterlab with Perspective and Apache Arrow", my J…☆26Updated 3 years ago
- An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks☆21Updated 2 years ago
- A small Python module containing quick utility functions for standard ETL processes.☆33Updated this week
- Benchmark study on KùzuDB, an embedded OLAP graph database, on an artificial social network dataset☆24Updated last month
- List of entity resolution software and resources.☆31Updated 6 months ago
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable fro…☆26Updated 2 years ago
- Graph Engine for Exploration and Search☆39Updated 7 months ago
- ☆19Updated last year
- A browser-based Parquet file viewer☆38Updated 3 weeks ago
- An experimental Athena extension for DuckDB 🐤☆49Updated 7 months ago
- Demo converting streamlit uber nyc rides to use duckdb☆29Updated last year
- Data Catalog for Databases and Data Warehouses☆31Updated 8 months ago
- Repo demonstrating a Dagster pipeline to generate Neo4j Graph☆21Updated 3 years ago
- A serverless duckDB deployment at GCP☆34Updated 2 years ago
- A JupyterLab extension providing, SQL formatter, auto-completion, syntax highlighting, Spark SQL and Trino☆83Updated 2 weeks ago
- Metabase DuckDB Driver shipped as 3rd party plugin☆72Updated 5 months ago
- Sord Data Fabric: A Vue 3 frontend with a Python WebSocket server, leveraging a distributed architecture with DeltaLake and DuckDB worker…☆18Updated 9 months ago
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆75Updated this week
- dagster scikit-learn pipeline example.☆43Updated last year
- Embedded MonetDB with a Python frontend and fast Numpy/Pandas support☆61Updated last month
- ☆69Updated last year
- Ibis analytics, with Ibis (and more!)☆19Updated this week
- ☆39Updated last month
- The sane way of building a data layer in Airflow☆24Updated 4 years ago
- Write your dbt models using Ibis☆47Updated 4 months ago
- quadipy is a python package to help transform structured data into RDF graph format☆18Updated last year
- Derivatives models written with the Tributary data flow library☆19Updated 7 months ago
- DuckDB for streaming data☆62Updated 5 months ago
- KnowledgeRepo + JupyterLab☆48Updated 3 months ago
- DuckDB SQL Tools add DuckDB support to VSCode, and provide database schema and SQL query interfaces for the popular SQLTools extension, S…☆12Updated 2 months ago