Spark connector for SFTP
☆98Mar 26, 2023Updated 3 years ago
Alternatives and similar repositories for spark-sftp
Users that are interested in spark-sftp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Spark data source for Salesforce☆80May 23, 2024Updated last year
- spark structured streaming via HTTP communication☆18Jul 7, 2022Updated 3 years ago
- Amazon Kinesis Source for Structured Streaming☆12Nov 6, 2017Updated 8 years ago
- Terraform module for Mesos + Ceph cluster in AWS VPC and Packer template for the AMI.☆26Dec 25, 2014Updated 11 years ago
- ☆17Oct 27, 2015Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Dockerfile for Apache Zeppelin☆17Dec 9, 2015Updated 10 years ago
- Avro SerDe for Apache Spark structured APIs.☆242Jun 10, 2025Updated 9 months ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆281Aug 3, 2018Updated 7 years ago
- Support for operating on images via Apache Spark☆26Jun 12, 2023Updated 2 years ago
- Integration of Iceberg table management into Spark SQL☆11Jan 21, 2020Updated 6 years ago
- Argument parsing in Scala☆84Mar 27, 2023Updated 3 years ago
- A package full of linear algebra operators for Apache Spark MLlib's linalg package☆10Sep 9, 2015Updated 10 years ago
- ☆16Jun 9, 2016Updated 9 years ago
- Import and export TensorFlow records from/to Spark☆18Jul 7, 2017Updated 8 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Legoo: A collection of automation modules to build analytics infrastructure☆20Jul 24, 2020Updated 5 years ago
- Hortonworks Data Platform Data Generation Tool☆13Nov 30, 2017Updated 8 years ago
- High performance HBase / Spark SQL engine☆28Jul 7, 2022Updated 3 years ago
- Fast-Data-Processing-with-Spark-2☆22Jan 18, 2023Updated 3 years ago
- Set of ETL utils for Spark☆15May 4, 2020Updated 5 years ago
- A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).☆153Apr 21, 2023Updated 2 years ago
- A sink to save Spark Structured Streaming DataFrame into Hive table☆30Apr 16, 2018Updated 7 years ago
- This repository contains the development code for sparkMeasure, an Apache Spark performance analysis and troubleshooting library. It simp…☆818Apr 1, 2026Updated last week
- Spark data source for Cognite Data Fusion☆23Mar 27, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A library for financial and time series calculations on Apache Spark☆28Feb 2, 2016Updated 10 years ago
- ☆11Dec 10, 2015Updated 10 years ago
- Docker images for building packages with clean dependencies in multiple distributions.☆10Jan 25, 2018Updated 8 years ago
- ☆12May 30, 2017Updated 8 years ago
- Spark code to analyze HBase Snapshots☆35Feb 19, 2018Updated 8 years ago
- A Java library for working with Table Schema.☆27Nov 24, 2025Updated 4 months ago
- Big Data Development Kit (Hadoop / Spark / Zeppelin / IntelliJ)☆22Feb 9, 2016Updated 10 years ago
- Vagrant, Apache Spark and Apache Zeppelin VM for teaching☆44Oct 19, 2017Updated 8 years ago
- EasyXMS是一个Java编写的用于批量管理Linux/Unix服务器的简易系统,如:多线程批量执行命令、多线程批量上传文件等功能.☆21Feb 8, 2015Updated 11 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A trivial project illustrating the usage of a bean, which can be configured to invoke stored procedures, as a Camel component.☆12Mar 31, 2015Updated 11 years ago
- A framework for creating composable and pluggable data processing pipelines using Apache Spark, and running them on a cluster.☆47Aug 1, 2016Updated 9 years ago
- Ralph is a service discovery for twemproxy on mesos☆24Mar 1, 2015Updated 11 years ago
- Code and setup information for Introduction to Machine Learning with Spark☆12Sep 4, 2015Updated 10 years ago
- Big Data Toolkit for the JVM☆148Nov 4, 2020Updated 5 years ago
- Workshop for Hadoop Operations Best Practices☆10Feb 24, 2015Updated 11 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Feb 21, 2014Updated 12 years ago