datirium / cwl-airflow-parser
Package to extend Airflow functionality with CWL v1.0 support
☆12Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for cwl-airflow-parser
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 3 years ago
- Reproducing Distributed Systems and Experiments on Cloud☆39Updated last year
- Documentation and resources for deploying JupyterHub on Hadoop☆18Updated 5 years ago
- The sane way of building a data layer in Airflow☆24Updated 4 years ago
- Ansible roles to deploy Kubernetes, JupyterHub, Jupyter Enterprise Gateway and Spark on Kubernetes cluster☆38Updated 3 years ago
- An Operator for scheduling and executing NiFi Flows as Jobs on Kubernetes☆53Updated 4 years ago
- Astronomer Vendor Images☆12Updated this week
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆27Updated 2 years ago
- a toy duckdb based timeseries database☆14Updated 4 years ago
- Data Catalog for Databases and Data Warehouses☆31Updated 10 months ago
- A DockerSwarm Jupyterhub setup, which uses a NFS Server running in a Docker Container for persistent storage☆20Updated 6 years ago
- Parquet file management in S3 for Athena / Spectrum / Presto partitioning☆22Updated this week
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆48Updated 10 months ago
- Export Airflow metrics (from mysql) in prometheus format☆29Updated 2 years ago
- A K8s-based infrastructure for analytics☆24Updated 4 years ago
- A Spark datasource for the HadoopOffice library☆39Updated 2 years ago
- Fybrik platform - Arrow/Flight module☆16Updated 3 months ago
- Parquet Command-line Tools☆18Updated 8 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated last year
- A Jupyter kernel for ClickHouse☆24Updated 4 years ago
- A bridge to Apache Atlas for provenance metadata created in course of using Apache NiFi☆15Updated last year
- ☕⛵WIP PySpark dependency management☆22Updated 6 years ago
- Jupyter notebook extension for proxing a VNC session☆26Updated 4 years ago
- Dockerflow is a workflow runner that uses Dataflow to run a series of tasks in Docker with the Pipelines API☆97Updated 7 years ago
- A conda-smithy repository for python-duckdb.☆13Updated 2 weeks ago
- Alluxio Python client - Access Any Data Source with Python☆26Updated 3 weeks ago
- A facebook for data☆26Updated 5 years ago
- Code for our local jupyterhub install☆18Updated 6 years ago
- JupyterHub proxy implementation with traefik☆54Updated 2 weeks ago