Netflix / pygenieLinks
☆75Updated 3 months ago
Alternatives and similar repositories for pygenie
Users that are interested in pygenie are comparing it to the libraries listed below
Sorting:
- A toolkit providing a uniform interface for connecting to and extracting data from a wide variety of (potentially remote) data stores (in…☆255Updated last year
- Thin-client metrics library for use with Atlas and SpectatorD☆48Updated 2 weeks ago
- transformpy is a Python 2/3 module for doing transforms on "streams" of data☆29Updated 7 years ago
- ETLy is an add-on dashboard service on top of Apache Airflow.☆69Updated last year
- Snowplow event tracker for Python. Add analytics to your Python and Django apps, webapps and games☆44Updated 2 weeks ago
- Set of iPython and Jupyter extensions to improve user experience☆50Updated 5 years ago
- Dockerized setup for testing code on realistic hadoop clusters☆27Updated 4 years ago
- Deploy dask on YARN clusters☆69Updated 9 months ago
- REST-like API exposing Airflow data and operations☆61Updated 6 years ago
- Apache Avro <-> pandas DataFrame☆137Updated 10 months ago
- Utilities for creating ETL pipelines with mara☆36Updated 3 years ago
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆109Updated last week
- Fork of aio-libs/aiokafka☆27Updated last year
- IP Address dtype and block for pandas☆105Updated last year
- ☆15Updated 6 years ago
- Serializes data into a JSON format using AVRO schema.☆137Updated 3 years ago
- Lightweight configuration and access to multiple databases in a single project☆38Updated last year
- Optional extensions for petl based on third party libraries.☆45Updated 9 years ago
- A Cookiecutter template for creating Faust projects quickly.☆70Updated 2 years ago
- As a believer of learning through examples, I have decided to put my own examples of Gremlin queries inside Jupyter Notebooks for people …☆32Updated 5 years ago
- Airflow plugin to transfer arbitrary files between operators☆78Updated 6 years ago
- SQLAlchemy dialect for Turbodbc☆23Updated last month
- Data pipelines from re-usable components☆108Updated 2 years ago
- A Getting Started Guide for developing and using Airflow Plugins☆93Updated 6 years ago
- Pylint plugin for static code analysis on Airflow code☆95Updated 4 years ago
- ☆54Updated 6 years ago
- A Python framework for data processing on GCP.☆119Updated last month
- ☆10Updated 6 years ago
- Asynchronous actions for PySpark☆47Updated 3 years ago
- Apache (Py)Spark type annotations (stub files).☆117Updated 2 years ago