skyplane-project / skyplaneLinks
π₯ Blazing fast bulk data transfers between any cloud π₯
β1,162Updated last year
Alternatives and similar repositories for skyplane
Users that are interested in skyplane are comparing it to the libraries listed below
Sorting:
- A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel appβ¦β347Updated last week
- Aqueduct is no longer being maintained. Aqueduct allows you to run LLM and ML workloads on any cloud infrastructure.β520Updated 2 years ago
- Making data lake work for time seriesβ1,183Updated last year
- A simple, high-throughput file client for mounting an Amazon S3 bucket as a local file system.β5,392Updated this week
- Cloud Infrastructure as data in PostgreSQLβ602Updated 7 months ago
- New file format for storage of large columnar datasets.β609Updated this week
- Module to Automatically maximize the utilization of GPU resources in a Kubernetes cluster through real-time dynamic partitioning and elasβ¦β669Updated last year
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to β¦β240Updated this week
- Distribute and run AI workloads magically in Python, like PyTorch for ML infra.β1,045Updated 2 months ago
- Python Stream Processingβ1,813Updated 5 months ago
- Parallel S3 and local filesystem execution tool.β3,597Updated 3 months ago
- Open Control Plane for Tables in Data Lakehouseβ370Updated last week
- An open-source ML pipeline development platformβ996Updated 8 months ago
- dstack is an open-source control plane for running development, training, and inference jobs on GPUsβacross hyperscalers, neoclouds, or oβ¦β1,897Updated this week
- A stateful serverless platformβ242Updated 2 years ago
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewβ¦β2,110Updated 5 months ago
- βοΈ Terraform plugin for machine learning workloads: spot instance recovery & auto-termination | AWS, GCP, Azure, Kubernetesβ294Updated 9 months ago
- Compressed Log Processor (CLP) is a free log management tool capable of compressing logs and searching the compressed logs without decompβ¦β997Updated this week
- Chronon is a data platform for serving for AI/ML applications.β899Updated last week
- KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scaleβ815Updated this week
- Distributed query engine providing simple and reliable data processing for any modality and scaleβ4,480Updated this week
- The Amazon S3 Connector for PyTorch delivers high throughput for PyTorch training jobs that access and store data in Amazon S3.β178Updated this week
- Multy - Easily deploy multi cloud infrastructure. Write cloud-agnostic config deployed across multiple cloudsβ657Updated 2 years ago
- π Continuously synchronize the systems where your data lives, to the systems where you _want_ it to live, with Estuary Flow. πβ816Updated this week
- Postgres-native columnar storage extensionβ2,988Updated 7 months ago
- Klotho - write AWS applications at lightning speedβ1,146Updated last year
- This is RonDB, a distribution of NDB Cluster developed and used by Hopsworks AB. It also contains development branches of RonDB.β668Updated this week
- GlareDB: A light and fast SQL database for analyticsβ971Updated last week
- Database replication platform that leverages change data capture. Stream production data from databases to your data warehouse (Snowflakeβ¦β673Updated this week
- Prepare requirements and deploy Flyte using Helmβ79Updated 5 months ago