BlazingDB / blazingsqlLinks
BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.
☆1,974Updated 2 years ago
Alternatives and similar repositories for blazingsql
Users that are interested in blazingsql are comparing it to the libraries listed below
Sorting:
- A GPU-powered real-time analytics storage and query engine.☆3,052Updated 10 months ago
- A composable and fully extensible C++ execution engine library for data management systems.☆3,764Updated this week
- HeavyDB (formerly OmniSciDB)☆3,003Updated last week
- Dremio - the missing link in modern data☆1,431Updated last month
- The Universal Storage Engine☆1,943Updated this week
- Distributed Computing for AI Made Simple☆1,043Updated 2 years ago
- Vectorized processing for Apache Arrow☆485Updated 3 years ago
- Data-Centric Pipelines and Data Versioning☆6,228Updated 4 months ago
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,326Updated this week
- Spark RAPIDS plugin - accelerate Apache Spark with GPUs☆903Updated this week
- SQL-based streaming analytics platform at scale☆1,224Updated 4 years ago
- Apache Parquet Format☆1,958Updated this week
- Hopsworks - Data-Intensive AI platform with a Feature Store☆1,226Updated 3 months ago
- Distributed SQL Engine in Python using Dask☆405Updated 9 months ago
- ☆1,654Updated 3 weeks ago
- ZetaSQL - Analyzer Framework for SQL☆2,400Updated 2 months ago
- A uniform interface to run deep learning models from multiple frameworks☆934Updated last year
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,217Updated this week
- Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.☆2,732Updated last year
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,839Updated last year
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆8,051Updated last week
- A distributed knowledge graph store☆1,654Updated 5 years ago
- Distributed query engine providing simple and reliable data processing for any modality and scale☆2,873Updated this week
- Apache Parquet Java☆2,830Updated this week
- Apache Drill is a distributed MPP query layer for self describing data☆1,975Updated last week
- Apache Iceberg☆7,535Updated this week
- cuDF - GPU DataFrame Library☆8,961Updated this week
- Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, v…☆4,720Updated this week
- Mirror of Apache MADlib☆466Updated last year
- Mirror of Apache Kudu☆1,872Updated this week