BlazingDB / blazingsqlLinks
BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.
☆1,985Updated 2 years ago
Alternatives and similar repositories for blazingsql
Users that are interested in blazingsql are comparing it to the libraries listed below
Sorting:
- HeavyDB (formerly MapD/OmniSciDB)☆3,026Updated 2 months ago
- A GPU-powered real-time analytics storage and query engine.☆3,065Updated last year
- High-performance runtime for data analytics applications☆3,002Updated 3 years ago
- The Universal Storage Engine☆1,978Updated last week
- Dremio - the missing link in modern data☆1,440Updated 4 months ago
- Distributed Computing for AI Made Simple☆1,046Updated 2 years ago
- Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.☆2,740Updated last year
- Spark RAPIDS plugin - accelerate Apache Spark with GPUs☆928Updated this week
- Vectorized processing for Apache Arrow☆485Updated 3 years ago
- A new arguably faster implementation of Apache Spark from scratch in Rust☆2,240Updated 3 years ago
- A Redis module for serving tensors and executing deep learning graphs☆839Updated 2 weeks ago
- A composable and fully extensible C++ execution engine library for data management systems.☆3,874Updated this week
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,853Updated 3 weeks ago
- ZetaSQL - Analyzer Framework for SQL☆2,420Updated 2 months ago
- Hopsworks - Data-Intensive AI platform with a Feature Store☆1,249Updated 6 months ago
- Apache Parquet Format☆2,027Updated last week
- vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)☆922Updated last month
- A low-latency prediction-serving system☆1,420Updated 4 years ago
- Parsing and analysis of Vertica, Hive, and Presto SQL.☆1,076Updated 3 years ago
- A uniform interface to run deep learning models from multiple frameworks☆939Updated last year
- SQL-based streaming analytics platform at scale☆1,227Updated 5 years ago
- Blazingly fast analytics database that will rapidly devour all of your data.☆1,637Updated 2 months ago
- Distributed SQL Engine in Python using Dask☆407Updated last year
- Apache Drill is a distributed MPP query layer for self describing data☆1,989Updated this week
- RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.☆346Updated last month
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,375Updated 3 weeks ago
- Apache Parquet Java☆2,931Updated this week
- TonY is a framework to natively run deep learning frameworks on Apache Hadoop.☆709Updated last year
- Tuplex is a parallel big data processing framework that runs data science pipelines written in Python at the speed of compiled code. Tupl…☆814Updated 3 weeks ago
- Mirror of Apache Kudu☆1,888Updated this week