skyplane-project / skyplaneLinks
π₯ Blazing fast bulk data transfers between any cloud π₯
β1,150Updated last year
Alternatives and similar repositories for skyplane
Users that are interested in skyplane are comparing it to the libraries listed below
Sorting:
- A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel appβ¦β341Updated this week
- Cloud Infrastructure as data in PostgreSQLβ601Updated 6 months ago
- Unified Interface for Constructing and Managing Workflows on different workflow engines, such as Argo Workflows, Tekton Pipelines, and Apβ¦β940Updated 9 months ago
- Aqueduct is no longer being maintained. Aqueduct allows you to run LLM and ML workloads on any cloud infrastructure.β521Updated 2 years ago
- A simple, high-throughput file client for mounting an Amazon S3 bucket as a local file system.β5,303Updated this week
- An open-source ML pipeline development platformβ995Updated 6 months ago
- Database replication platform that leverages change data capture. Stream production data from databases to your data warehouse (Snowflakeβ¦β662Updated this week
- Parallel S3 and local filesystem execution tool.β3,474Updated last month
- Chronon is a data platform for serving for AI/ML applications.β827Updated this week
- Distributed query engine providing simple and reliable data processing for any modality and scaleβ3,161Updated this week
- Hera makes Python code easy to orchestrate on Argo Workflows through native Python integrations. It lets you construct and submit your Woβ¦β756Updated this week
- Making data lake work for time seriesβ1,179Updated 11 months ago
- InfiniCache: A cost-effective memory cache that is built atop ephemeral serverless functions (USENIX FAST'20)β254Updated 2 years ago
- KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scaleβ707Updated this week
- Multy - Easily deploy multi cloud infrastructure. Write cloud-agnostic config deployed across multiple cloudsβ654Updated 2 years ago
- Klotho - write AWS applications at lightning speedβ1,147Updated 11 months ago
- Module to Automatically maximize the utilization of GPU resources in a Kubernetes cluster through real-time dynamic partitioning and elasβ¦β666Updated last year
- π Continuously synchronize the systems where your data lives, to the systems where you _want_ it to live, with Estuary Flow. πβ778Updated this week
- A toolkit to run Ray applications on Kubernetesβ1,921Updated this week
- Open Control Plane for Tables in Data Lakehouseβ362Updated this week
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to β¦β233Updated this week
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewβ¦β2,101Updated 4 months ago
- New file format for storage of large columnar datasets.β577Updated this week
- Measure Amazon S3's performance from any location.β865Updated last year
- Prepare requirements and deploy Flyte using Helmβ73Updated 3 months ago
- lakeFS - Data version control for your data lake | Git for dataβ4,794Updated this week
- DoEKS is a tool to build, deploy and scale Data Platforms on Amazon EKSβ770Updated this week
- A multi-cluster batch queuing system for high-throughput workloads on Kubernetes.β538Updated this week
- The Amazon S3 Connector for PyTorch delivers high throughput for PyTorch training jobs that access and store data in Amazon S3.β172Updated this week
- A stateful serverless platformβ242Updated 2 years ago