Netflix / pygenieLinks
☆75Updated 5 months ago
Alternatives and similar repositories for pygenie
Users that are interested in pygenie are comparing it to the libraries listed below
Sorting:
- A toolkit providing a uniform interface for connecting to and extracting data from a wide variety of (potentially remote) data stores (in…☆254Updated 3 weeks ago
- Python stream processing for humans☆185Updated 6 months ago
- Thin-client metrics library for use with Atlas and SpectatorD☆48Updated last month
- IP Address dtype and block for pandas☆105Updated 2 years ago
- Fork of aio-libs/aiokafka☆27Updated last year
- A Cookiecutter template for creating Faust projects quickly.☆70Updated 2 years ago
- Airflow plugin to transfer arbitrary files between operators☆78Updated 6 years ago
- Apache Avro <-> pandas DataFrame☆138Updated last year
- Airflow configuration for Telemetry☆192Updated this week
- A pandas.DataFrame-based ORM.☆85Updated 3 years ago
- Dataflow programming for python.☆292Updated 2 years ago
- T4 is now in production as Quilt 3☆64Updated 6 years ago
- SQLAlchemy dialect for Turbodbc☆23Updated 4 months ago
- A Getting Started Guide for developing and using Airflow Plugins☆93Updated 6 years ago
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆110Updated this week
- Faust dockerized application☆68Updated 2 years ago
- Metadata service library for Amundsen☆83Updated 3 weeks ago
- An example mini data warehouse for python project stats, template for new projects☆179Updated 5 years ago
- 📚 Notebook storage and publishing workflows for the masses☆201Updated 3 years ago
- Amazon S3 filesystem for PyFilesystem2☆156Updated last year
- Docker image with Python 3.6 and 3.7 using Conda, with CUDA variants. To serve as base image for Machine Learning projects.☆84Updated 2 years ago
- Optional extensions for petl based on third party libraries.☆45Updated 10 years ago
- Convert JSON files to Parquet using PyArrow☆97Updated last year
- Airflow workflow management platform chef cookbook.☆71Updated 6 years ago
- A DBAPI and SQLAlchemy dialect for Elasticsearch☆117Updated last year
- Serializes data into a JSON format using AVRO schema.☆137Updated 3 years ago
- Python library for API access and data analysis in Product, BI, Revenue Operations (GAM, GA, Athena etc.)☆73Updated 9 months ago
- Data pipelines from re-usable components☆107Updated 2 years ago
- Deploy dask on YARN clusters☆69Updated last year
- A small Python module containing quick utility functions for standard ETL processes.☆36Updated last week