cognitedata / python-extractor-utils
Framework for developing extractors in Python
☆13Updated last week
Alternatives and similar repositories for python-extractor-utils:
Users that are interested in python-extractor-utils are comparing it to the libraries listed below
- Utilities for creating ETL pipelines with mara☆36Updated 2 years ago
- A serverless duckDB deployment at GCP☆38Updated 2 years ago
- Data Catalog for Databases and Data Warehouses☆32Updated last year
- Terraform module for a PostgreSQL-backed Apache Airflow instance☆24Updated 6 years ago
- DuckDB with Dashboarding tools demo evidence, streamlit and rill☆15Updated last year
- DataHub on AWS demonstration resources☆10Updated 2 years ago
- A python package to create a database on the platform using our moj data warehousing framework☆21Updated 4 months ago
- ERPL is a DuckDB extension to integrate Enterprise Data in your Data Science and ML pipelines within minutes! ERPL connects DuckDB to SAP…☆32Updated 7 months ago
- RESTful API for prometheus (stock portfolio allocation & analysis)☆8Updated 5 years ago
- Drag N Drop WepApp to Build and Manage Airflow DAGs☆25Updated 2 years ago
- An experimental Athena extension for DuckDB 🐤☆53Updated last month
- A framework for distributing flask apps across separate packages with minimal dependencies☆15Updated last year
- A python library bakeoff for medium sized datasets☆24Updated last year
- Ansible Superset Role☆16Updated 4 years ago
- Lightweight configuration and access to multiple databases in a single project☆38Updated last year
- ☆12Updated 10 years ago
- Setup Apache Airflow on Kubernetes☆9Updated 6 years ago
- Example of using Airflow to schedule downloading data form S3 and launching spark jobs☆15Updated 8 years ago
- This repo demonstrates how to load a sample Parquet formatted file from an AWS S3 Bucket. A python job will then be submitted to a Apach…☆19Updated 8 years ago
- Materials for Apache Arrow workshop at VLDB 2019☆42Updated 4 years ago
- Launch JupyterHub on AWS using Terrraform☆12Updated 6 years ago
- ☕⛵WIP PySpark dependency management☆22Updated 6 years ago
- Build Lambda deployment packages faster with Docker☆23Updated last year
- An Python object protocol for projects to interchange data frame-like data without forcing pandas.DataFrame as the intermediary☆16Updated 4 years ago
- A monorepo of many Rill example projects☆33Updated 2 weeks ago
- Easy access to APIs from SAP Digital Supply Chain for data scientists.☆24Updated 4 months ago
- ☆15Updated last year
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- Material from presentations☆13Updated 3 years ago
- Extract, PreProcess, and Analyze big data on GPUs☆21Updated 6 years ago