Netflix / pygenie
☆72Updated last year
Related projects ⓘ
Alternatives and complementary repositories for pygenie
- A toolkit providing a uniform interface for connecting to and extracting data from a wide variety of (potentially remote) data stores (in…☆255Updated 5 months ago
- Fork of aio-libs/aiokafka☆26Updated 11 months ago
- ☆47Updated 2 months ago
- IP Address dtype and block for pandas☆104Updated last year
- Vertica dialect for SQLAlchemy using the vertica-python client☆18Updated 4 years ago
- Deploy dask on YARN clusters☆69Updated 3 months ago
- Airflow plugin to transfer arbitrary files between operators☆78Updated 6 years ago
- Convert JSON files to Parquet using PyArrow☆94Updated 10 months ago
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆104Updated this week
- Data ingestion library for Amundsen to build graph and search index☆206Updated 8 months ago
- A pandas.DataFrame-based ORM.☆84Updated 2 years ago
- A wrapper for libhdfs3 to interact with HDFS from Python☆136Updated 3 years ago
- ☆15Updated 5 years ago
- transformpy is a Python 2/3 module for doing transforms on "streams" of data☆29Updated 7 years ago
- ☆54Updated 6 years ago
- Amazon S3 filesystem for PyFilesystem2☆154Updated 4 months ago
- Metadata service library for Amundsen☆83Updated last year
- Fast iterative local development and testing of Apache Airflow workflows☆193Updated 5 months ago
- A python client library for the Stitch Import API☆42Updated 10 months ago
- SQLAlchemy dialect for Turbodbc☆23Updated 5 months ago
- Start a cluster in EC2 for dask.distributed☆106Updated 4 years ago
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.☆78Updated last week
- Concurrent appendable key-value storage☆106Updated 4 months ago
- Utilities for creating ETL pipelines with mara☆36Updated 2 years ago
- Pandas-SQLAlchemy integration☆28Updated 8 months ago
- Amazon Redshift SQLAlchemy Dialect☆219Updated 4 months ago
- Snowplow event tracker for Python. Add analytics to your Python and Django apps, webapps and games☆43Updated this week
- Set of iPython and Jupyter extensions to improve user experience☆50Updated 5 years ago
- Serializes data into a JSON format using AVRO schema.☆137Updated 2 years ago
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆262Updated last year