This module provides the functionality of uploading files to s3 from a FTP server. An SFTP connection is created with the FTP server and all the files present in the specified directory are uploaded to the specified s3 bucket. Following are the key features of this module: Creates a secure ssh connection with FTP server. Handles multipart upload…
☆13May 9, 2020Updated 5 years ago
Alternatives and similar repositories for Python-FTP-File-Ingestion
Users that are interested in Python-FTP-File-Ingestion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Jun 27, 2020Updated 5 years ago
- ☆11Oct 11, 2022Updated 3 years ago
- A Gentle introduction to Machine Learning with Apache Spark☆11Mar 2, 2026Updated 3 weeks ago
- A set of widgets for Python's Orange Machine Learning to work with Apache Spark ML☆15Dec 24, 2016Updated 9 years ago
- ☆12Oct 16, 2023Updated 2 years ago
- Basic Spark utilities☆13Feb 20, 2025Updated last year
- A sample restify api service with JWT authentication☆17Dec 11, 2024Updated last year
- A pyspark lib to validate data quality☆18Nov 11, 2022Updated 3 years ago
- This is a mirror of https://github.com/LucaCanali/sparkMeasure - sparkMeasure is a tool for performance troubleshooting of Apache Spark w…☆16Oct 3, 2025Updated 5 months ago
- API REST boilerplate using Spring Boot and Redis as database☆13Dec 26, 2018Updated 7 years ago
- Due to lack of resources on how to deploy kafka with simple SASL authentication (just username and password) and how to write producer an…☆12Dec 29, 2021Updated 4 years ago
- Example to create lineage in Atlas with sqoop and spark☆14Apr 5, 2017Updated 8 years ago
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Jul 11, 2018Updated 7 years ago
- Rocksdb state storage implementation for Structured Streaming.☆17Oct 21, 2020Updated 5 years ago
- Thoughts on things I find interesting.☆17Dec 19, 2024Updated last year
- ☆20Feb 14, 2018Updated 8 years ago
- Spark Structured Streaming JDBC Sink☆16Apr 26, 2021Updated 4 years ago
- ☆23Oct 3, 2024Updated last year
- SQL problems with solution & summary☆12May 2, 2021Updated 4 years ago
- ☆16Apr 9, 2019Updated 6 years ago
- Example of using Faust with Docker☆23Sep 30, 2019Updated 6 years ago
- Demonstrates calling a Scala UDF from Python using spark-submit with an EGG and JAR☆23Mar 3, 2020Updated 6 years ago
- Amazon EMR on EKS Custom Image CLI☆32Sep 26, 2024Updated last year
- A toolset to streamline running spark python on EMR☆20Nov 16, 2016Updated 9 years ago
- bootstrap for my dev setup☆60Feb 29, 2020Updated 6 years ago
- Django generic marketplace API☆11Jun 10, 2020Updated 5 years ago
- Multi-stage, config driven, SQL based ETL framework using PySpark☆26Sep 16, 2019Updated 6 years ago
- Application to securely map users on a multi tenant Amazon EMR cluster to different IAM Roles and then assume the mapped Role.☆24Oct 24, 2023Updated 2 years ago
- Spark pipelines that correspond to a series of Dataflow examples.☆27May 5, 2019Updated 6 years ago
- ☆26Apr 15, 2021Updated 4 years ago
- Chapter 5 of the AWS Cookbook☆14Mar 13, 2022Updated 4 years ago
- An opinionated Kafka producer/consumer built on top of confluent-kafka-python/librdkafka☆27Apr 30, 2025Updated 10 months ago
- Tax-Brain is an integrator model for PSL tax models☆13Dec 24, 2025Updated 2 months ago
- Cloud based Data Platform based on Apache Spark☆27Feb 17, 2026Updated last month
- Spark and Hive docker containers sharing a common MySQL metastore☆26Apr 17, 2020Updated 5 years ago
- Utility tool to pull Excel data into Word, PowerPoint and Outlook☆14Jan 25, 2020Updated 6 years ago
- Test examples of kafka-clients: unit, integration, end-to-end☆27Nov 16, 2022Updated 3 years ago
- Spark-Radiant is Apache Spark Performance and Cost Optimizer☆25Dec 31, 2024Updated last year
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35Jul 9, 2025Updated 8 months ago