This module provides the functionality of uploading files to s3 from a FTP server. An SFTP connection is created with the FTP server and all the files present in the specified directory are uploaded to the specified s3 bucket. Following are the key features of this module: Creates a secure ssh connection with FTP server. Handles multipart upload…
☆13May 9, 2020Updated 5 years ago
Alternatives and similar repositories for Python-FTP-File-Ingestion
Users that are interested in Python-FTP-File-Ingestion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Jun 27, 2020Updated 5 years ago
- Using Apache Airflow to author, run and monitor complex data pipelines.☆12Oct 24, 2018Updated 7 years ago
- ☆15Sep 20, 2019Updated 6 years ago
- light wrapper for indeed.com api☆16Oct 30, 2016Updated 9 years ago
- ☆11Oct 11, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A Gentle introduction to Machine Learning with Apache Spark☆11Mar 2, 2026Updated 2 months ago
- Demo code for testing with Pytest and Hypothesis☆14Oct 12, 2021Updated 4 years ago
- ☆15May 31, 2017Updated 8 years ago
- ☆12Oct 16, 2023Updated 2 years ago
- Optimizing downstream data processing with Amazon Kinesis Data Firehose and Amazon EMR running Apache Spark☆14Apr 14, 2023Updated 3 years ago
- Python wrapper for libraries.io API☆19Dec 1, 2024Updated last year
- The Demo for Blog: Modularization using Python and Docker (MicroService)☆12Feb 4, 2021Updated 5 years ago
- Helper utils for our packages☆32Dec 1, 2025Updated 5 months ago
- A pyspark lib to validate data quality☆19Nov 11, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This is a mirror of https://github.com/LucaCanali/sparkMeasure - sparkMeasure is a tool for performance troubleshooting of Apache Spark w…☆16Oct 3, 2025Updated 7 months ago
- Data and Notebook for medium blog post☆20Aug 31, 2019Updated 6 years ago
- Easy access of environment variables from Python with support for typing (ex. booleans, strings, lists, tuples, integers, floats, and dic…☆26Jun 28, 2023Updated 2 years ago
- Due to lack of resources on how to deploy kafka with simple SASL authentication (just username and password) and how to write producer an…☆12Dec 29, 2021Updated 4 years ago
- Example to create lineage in Atlas with sqoop and spark☆14Apr 5, 2017Updated 9 years ago
- Repo for holding the dbt project used to make sense of cloud cost data from the major cloud platforms☆38Mar 1, 2020Updated 6 years ago
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Jul 11, 2018Updated 7 years ago
- Thoughts on things I find interesting.☆17Dec 19, 2024Updated last year
- This is a fork of the Apache Flink Kinesis connector adding Enhanced Fanout support for Flink 1.8/1.11 on KDA.☆24Mar 1, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆20Feb 14, 2018Updated 8 years ago
- JSON of US states and their corresponding cities☆18Sep 14, 2017Updated 8 years ago
- Example of using Faust with Docker☆23Sep 30, 2019Updated 6 years ago
- Demonstrates calling a Scala UDF from Python using spark-submit with an EGG and JAR☆23Mar 3, 2020Updated 6 years ago
- The kubectl plugin which allows us to test IRSA configuration AWS sa☆23Nov 2, 2022Updated 3 years ago
- Simple PHP class that provides tools to define is a point (latitude/longitude) is inside the polygon☆19Nov 8, 2021Updated 4 years ago
- Blog post on ETL pipelines with Airflow☆23Aug 31, 2025Updated 8 months ago
- A cheatsheet comparing syntax b/w R and Python☆21May 2, 2019Updated 7 years ago
- Amazon EMR on EKS Custom Image CLI☆32Sep 26, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Tools for working with icd codes and comorbidities☆26Apr 28, 2021Updated 5 years ago
- A toolset to streamline running spark python on EMR☆20Nov 16, 2016Updated 9 years ago
- Django generic marketplace API☆11Jun 10, 2020Updated 5 years ago
- Multi-stage, config driven, SQL based ETL framework using PySpark☆26Sep 16, 2019Updated 6 years ago
- Kubernetes tutorial with examples and a simple exercise.☆28Dec 10, 2022Updated 3 years ago
- Spark pipelines that correspond to a series of Dataflow examples.☆27May 5, 2019Updated 7 years ago
- ☆26Apr 15, 2021Updated 5 years ago