BlazingDB / blazingsqlLinks
BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.
☆1,991Updated 3 years ago
Alternatives and similar repositories for blazingsql
Users that are interested in blazingsql are comparing it to the libraries listed below
Sorting:
- A GPU-powered real-time analytics storage and query engine.☆3,069Updated last year
- HeavyDB (formerly MapD/OmniSciDB)☆3,040Updated last month
- High-performance runtime for data analytics applications☆3,003Updated 3 years ago
- Distributed Computing for AI Made Simple☆1,047Updated 2 years ago
- Dremio - the missing link in modern data☆1,453Updated 2 months ago
- The Universal Storage Engine☆1,999Updated this week
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,867Updated last month
- Vectorized processing for Apache Arrow☆485Updated 3 years ago
- ZetaSQL - Analyzer Framework for SQL☆2,434Updated this week
- A Redis module for serving tensors and executing deep learning graphs☆842Updated 3 months ago
- Spark RAPIDS plugin - accelerate Apache Spark with GPUs☆950Updated this week
- A uniform interface to run deep learning models from multiple frameworks☆941Updated last year
- Hopsworks - Data-Intensive AI platform with a Feature Store☆1,260Updated 9 months ago
- Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.☆2,750Updated last year
- Apache Parquet Format☆2,125Updated last week
- A composable and fully extensible C++ execution engine library for data management systems.☆3,967Updated this week
- Apache Drill is a distributed MPP query layer for self describing data☆1,998Updated 3 weeks ago
- Mirror of Apache Kudu☆1,892Updated this week
- vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)☆933Updated last week
- Distributed SQL Engine in Python using Dask☆408Updated last year
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆301Updated last year
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,436Updated this week
- Tuplex is a parallel big data processing framework that runs data science pipelines written in Python at the speed of compiled code. Tupl…☆817Updated 3 months ago
- PG-Strom - Master development repository☆1,377Updated 2 weeks ago
- Apache Impala☆1,254Updated this week
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,123Updated 3 weeks ago
- Apache ORC - the smallest, fastest columnar storage for Hadoop workloads☆750Updated 2 weeks ago
- Parsing and analysis of Vertica, Hive, and Presto SQL.☆1,077Updated 3 years ago
- Mirror of Apache MADlib☆468Updated last month
- The live data layer for apps and AI agents Create up-to-the-second views into your business, just using SQL☆6,181Updated this week