BlazingDB / blazingsqlLinks
BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.
☆1,996Updated 3 years ago
Alternatives and similar repositories for blazingsql
Users that are interested in blazingsql are comparing it to the libraries listed below
Sorting:
- HeavyDB (formerly MapD/OmniSciDB)☆3,046Updated 2 months ago
- Distributed Computing for AI Made Simple☆1,047Updated 2 years ago
- Dremio - the missing link in modern data☆1,453Updated 2 months ago
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,869Updated this week
- The Universal Storage Engine☆2,003Updated last week
- Vectorized processing for Apache Arrow☆485Updated 3 years ago
- ZetaSQL - Analyzer Framework for SQL☆2,444Updated last week
- Hopsworks - Data-Intensive AI platform with a Feature Store☆1,268Updated 10 months ago
- A Redis module for serving tensors and executing deep learning graphs☆841Updated 4 months ago
- Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.☆2,751Updated last year
- A low-latency prediction-serving system☆1,421Updated 4 years ago
- A uniform interface to run deep learning models from multiple frameworks☆941Updated last year
- Spark RAPIDS plugin - accelerate Apache Spark with GPUs☆953Updated this week
- Distributed SQL Engine in Python using Dask☆408Updated last year
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,441Updated last week
- A composable and fully extensible C++ execution engine library for data management systems.☆3,983Updated last week
- Parsing and analysis of Vertica, Hive, and Presto SQL.☆1,078Updated 3 years ago
- Tuplex is a parallel big data processing framework that runs data science pipelines written in Python at the speed of compiled code. Tupl…☆817Updated 4 months ago
- Apache Parquet Format☆2,143Updated last week
- PG-Strom - Master development repository☆1,380Updated this week
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆301Updated last year
- Apache Parquet Java☆3,002Updated last week
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,128Updated 2 weeks ago
- Mirror of Apache MADlib☆468Updated last month
- Redis module that provides a completely functional SQL database☆1,548Updated 4 years ago
- RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.☆354Updated last week
- The live data layer for apps and AI agents Create up-to-the-second views into your business, just using SQL☆6,189Updated this week
- Apache Drill is a distributed MPP query layer for self describing data☆2,001Updated last month
- Mirror of Apache Kudu☆1,894Updated last week
- Apache Impala☆1,257Updated this week