Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.
☆302Mar 12, 2026Updated 2 months ago
Alternatives and similar repositories for cylon
Users that are interested in cylon are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A composable framework for fast and scalable data analytics☆58Dec 12, 2022Updated 3 years ago
- A Python library to run analytics workloads with the performance of Rust, the flexibility of Python and O(1) cost in moving data between …☆61May 6, 2021Updated 5 years ago
- Brushing and linking for big data☆973Dec 2, 2025Updated 5 months ago
- Vectorized processing for Apache Arrow☆484Feb 14, 2022Updated 4 years ago
- Pandas ExtensionDType/Array backed by Apache Arrow☆232Feb 22, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆109Jul 5, 2023Updated 2 years ago
- In-memory, columnar, arrow-based database.☆47Sep 6, 2022Updated 3 years ago
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆37Mar 9, 2021Updated 5 years ago
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,503May 12, 2026Updated last week
- Tuplex is a parallel big data processing framework that runs data science pipelines written in Python at the speed of compiled code. Tupl…☆814Aug 10, 2025Updated 9 months ago
- 🌳 A compressed rank/select dictionary exploiting approximate linearity and repetitiveness.☆15Jun 28, 2022Updated 3 years ago
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆658May 5, 2026Updated 2 weeks ago
- High performance model preprocessing library on PyTorch☆644Mar 29, 2024Updated 2 years ago
- Vinum is a SQL processor for Python, designed for data analysis workflows and in-memory analytics.☆65May 15, 2021Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The Universal Storage Engine☆2,056Apr 23, 2026Updated 3 weeks ago
- The stupidest database of all time.☆56Mar 27, 2026Updated last month
- BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.☆2,011Sep 16, 2022Updated 3 years ago
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…