BlazingDB / blazingsql
BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.
☆1,960Updated 2 years ago
Alternatives and similar repositories for blazingsql:
Users that are interested in blazingsql are comparing it to the libraries listed below
- A GPU-powered real-time analytics storage and query engine.☆3,047Updated 9 months ago
- HeavyDB (formerly OmniSciDB)☆2,991Updated 7 months ago
- The Universal Storage Engine☆1,928Updated this week
- Apache Parquet Format☆1,933Updated last week
- Vectorized processing for Apache Arrow☆485Updated 3 years ago
- A composable and fully extensible C++ execution engine library for data management systems.☆3,707Updated this week
- Apache Drill is a distributed MPP query layer for self describing data☆1,964Updated 2 weeks ago
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,832Updated last year
- Distributed Computing for AI Made Simple☆1,045Updated 2 years ago
- Apache Parquet Java☆2,788Updated last week
- ☆1,645Updated this week
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,289Updated this week
- Mirror of Apache Kudu☆1,869Updated this week
- Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, v…☆4,543Updated this week
- Distributed SQL Engine in Python using Dask☆402Updated 7 months ago
- A uniform interface to run deep learning models from multiple frameworks☆935Updated last year
- Parsing and analysis of Vertica, Hive, and Presto SQL.☆1,080Updated 3 years ago
- Dremio - the missing link in modern data☆1,423Updated 6 months ago
- ZetaSQL - Analyzer Framework for SQL☆2,386Updated 3 weeks ago
- Apache ORC - the smallest, fastest columnar storage for Hadoop workloads☆723Updated this week
- cuDF - GPU DataFrame Library☆8,876Updated this week
- Real-time Data Integration and Transformation: use SQL to transform, deliver, and act on fast-changing data.☆5,959Updated this week
- Real-time Query for Hadoop; mirror of Apache Impala☆34Updated 2 years ago
- Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.☆2,722Updated last year
- Spark RAPIDS plugin - accelerate Apache Spark with GPUs☆885Updated this week
- A distributed knowledge graph store☆1,654Updated 5 years ago
- SQL-based streaming analytics platform at scale☆1,223Updated 4 years ago
- Apache Impala☆1,204Updated this week
- A better notebook for Scala (and more)☆4,557Updated last month
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆7,975Updated this week