This module provides the functionality of uploading files to s3 from a FTP server. An SFTP connection is created with the FTP server and all the files present in the specified directory are uploaded to the specified s3 bucket. Following are the key features of this module: Creates a secure ssh connection with FTP server. Handles multipart upload…
☆13May 9, 2020Updated 5 years ago
Alternatives and similar repositories for Python-FTP-File-Ingestion
Users that are interested in Python-FTP-File-Ingestion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Jun 27, 2020Updated 5 years ago
- Using Apache Airflow to author, run and monitor complex data pipelines.☆12Oct 24, 2018Updated 7 years ago
- ☆11Oct 11, 2022Updated 3 years ago
- A Gentle introduction to Machine Learning with Apache Spark☆11Mar 2, 2026Updated last month
- A set of widgets for Python's Orange Machine Learning to work with Apache Spark ML☆15Dec 24, 2016Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Optimizing downstream data processing with Amazon Kinesis Data Firehose and Amazon EMR running Apache Spark☆14Apr 14, 2023Updated 2 years ago
- Python wrapper for libraries.io API☆19Dec 1, 2024Updated last year
- The Demo for Blog: Modularization using Python and Docker (MicroService)☆12Feb 4, 2021Updated 5 years ago
- Helper utils for our packages☆32Dec 1, 2025Updated 4 months ago
- A pyspark lib to validate data quality☆19Nov 11, 2022Updated 3 years ago
- This is a mirror of https://github.com/LucaCanali/sparkMeasure - sparkMeasure is a tool for performance troubleshooting of Apache Spark w…☆16Oct 3, 2025Updated 6 months ago
- Due to lack of resources on how to deploy kafka with simple SASL authentication (just username and password) and how to write producer an…☆12Dec 29, 2021Updated 4 years ago
- Rocksdb state storage implementation for Structured Streaming.☆17Oct 21, 2020Updated 5 years ago
- This is a fork of the Apache Flink Kinesis connector adding Enhanced Fanout support for Flink 1.8/1.11 on KDA.☆24Mar 1, 2026Updated last month
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆20Feb 14, 2018Updated 8 years ago
- ☆23Oct 3, 2024Updated last year
- Workshop materials for the workshop "Computer Science Crash Course for Python Hackers" at PyBay 2017☆17Aug 10, 2017Updated 8 years ago
- ☆16Apr 9, 2019Updated 7 years ago
- Example of using Faust with Docker☆23Sep 30, 2019Updated 6 years ago
- Demonstrates calling a Scala UDF from Python using spark-submit with an EGG and JAR☆23Mar 3, 2020Updated 6 years ago
- The kubectl plugin which allows us to test IRSA configuration AWS sa☆23Nov 2, 2022Updated 3 years ago
- Blog post on ETL pipelines with Airflow☆23Aug 31, 2025Updated 7 months ago
- Amazon EMR on EKS Custom Image CLI☆32Sep 26, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A toolset to streamline running spark python on EMR☆20Nov 16, 2016Updated 9 years ago
- bootstrap for my dev setup☆60Feb 29, 2020Updated 6 years ago
- Multi-stage, config driven, SQL based ETL framework using PySpark☆26Sep 16, 2019Updated 6 years ago
- Application to securely map users on a multi tenant Amazon EMR cluster to different IAM Roles and then assume the mapped Role.☆24Oct 24, 2023Updated 2 years ago
- Spark pipelines that correspond to a series of Dataflow examples.☆27May 5, 2019Updated 6 years ago
- A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract d…☆24Nov 22, 2021Updated 4 years ago
- ☆26Apr 15, 2021Updated 4 years ago
- Chapter 5 of the AWS Cookbook☆14Mar 13, 2022Updated 4 years ago
- An opinionated Kafka producer/consumer built on top of confluent-kafka-python/librdkafka☆28Apr 30, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Define the shape of your data with simple python data structures. Use those data descriptions to validate your application.☆43Jul 24, 2016Updated 9 years ago
- Cloud based Data Platform based on Apache Spark☆27Feb 17, 2026Updated last month
- Utility tool to pull Excel data into Word, PowerPoint and Outlook☆14Jan 25, 2020Updated 6 years ago
- Spark-Radiant is Apache Spark Performance and Cost Optimizer☆25Dec 31, 2024Updated last year
- Data Package reader for Pandas☆19Feb 10, 2023Updated 3 years ago
- This repo contains sample code and sample notebooks to illustrate how to work with Amazon FinSpace☆21Feb 12, 2025Updated last year
- Jupyter Notebook Scientific Python Stack extension for Docker Desktop☆19Mar 26, 2024Updated 2 years ago