egen / spark-sftpView external linksLinks
Spark connector for SFTP
☆98Mar 26, 2023Updated 2 years ago
Alternatives and similar repositories for spark-sftp
Users that are interested in spark-sftp are comparing it to the libraries listed below
Sorting:
- API for reading and writing data via various file transfer protocols from Apache Spark.☆21Sep 23, 2020Updated 5 years ago
- Integration of Iceberg table management into Spark SQL☆11Jan 21, 2020Updated 6 years ago
- ☆11Dec 10, 2015Updated 10 years ago
- Support for operating on images via Apache Spark☆26Jun 12, 2023Updated 2 years ago
- A package full of linear algebra operators for Apache Spark MLlib's linalg package☆10Sep 9, 2015Updated 10 years ago
- Hortonworks Data Platform Data Generation Tool☆13Nov 30, 2017Updated 8 years ago
- Spark structured streaming with Kafka data source and writing to Cassandra☆63Dec 5, 2019Updated 6 years ago
- Alerting and monitoring tool for Apache Spark☆23May 20, 2022Updated 3 years ago
- Avro SerDe for Apache Spark structured APIs.☆242Jun 10, 2025Updated 8 months ago
- Argument parsing in Scala☆84Mar 27, 2023Updated 2 years ago
- Code and setup information for Introduction to Machine Learning with Spark☆12Sep 4, 2015Updated 10 years ago
- Real-time Monitoring☆29May 14, 2012Updated 13 years ago
- Building custom data sources for Apache Spark, in Java.☆12Oct 12, 2020Updated 5 years ago
- Workshop for Hadoop Operations Best Practices☆10Feb 24, 2015Updated 10 years ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆281Aug 3, 2018Updated 7 years ago
- Amazon Kinesis Source for Structured Streaming☆12Nov 6, 2017Updated 8 years ago
- Spark UDFs to deserialize Avro messages with schemas stored in Schema Registry.☆20Jan 11, 2018Updated 8 years ago
- This project is for examples of how to use Zeppelin. https://github.com/apache/incubator-zeppelin☆25Jan 27, 2016Updated 10 years ago
- High performance HBase / Spark SQL engine☆28Jul 7, 2022Updated 3 years ago
- TPC-DS benchmark kit with some modifications/additions☆10Nov 12, 2015Updated 10 years ago
- A library for financial and time series calculations on Apache Spark☆28Feb 2, 2016Updated 10 years ago
- ☆16Sep 17, 2017Updated 8 years ago
- Set of ETL utils for Spark☆15May 4, 2020Updated 5 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Sep 8, 2022Updated 3 years ago
- Recipes and examples for Apache Spark☆13Jan 21, 2015Updated 11 years ago
- Spark data source for Cognite Data Fusion☆23Updated this week
- Presto SQL query formatter☆15Jan 1, 2024Updated 2 years ago
- High Performance Kafka Connector for Spark Streaming.Supports Multi Topic Fetch, Kafka Security. Reliable offset management in Zookeeper.…☆636Feb 26, 2022Updated 3 years ago
- Scalable CDC Pattern Implemented using PySpark☆18Oct 8, 2025Updated 4 months ago
- Db2 JDBC connector for Trino☆19Jan 6, 2023Updated 3 years ago
- ☆16Jun 9, 2016Updated 9 years ago
- Dockerfile for Apache Zeppelin☆17Dec 9, 2015Updated 10 years ago
- NetFlow data source for Spark SQL and DataFrames☆18May 6, 2021Updated 4 years ago
- Mesos Integration Tests on Docker/Ec2☆15May 25, 2023Updated 2 years ago
- Import and export TensorFlow records from/to Spark☆18Jul 7, 2017Updated 8 years ago
- Spark Example using Phoenix to interact with HBase☆16Nov 2, 2016Updated 9 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆40Jun 29, 2017Updated 8 years ago
- A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).☆152Apr 21, 2023Updated 2 years ago
- Complete Pipeline Training at Big Data Scala By the Bay☆71Oct 27, 2015Updated 10 years ago