BlazingDB / blazingsqlLinks
BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.
☆1,987Updated 3 years ago
Alternatives and similar repositories for blazingsql
Users that are interested in blazingsql are comparing it to the libraries listed below
Sorting:
- A GPU-powered real-time analytics storage and query engine.☆3,066Updated last year
- HeavyDB (formerly MapD/OmniSciDB)☆3,028Updated 3 weeks ago
- Dremio - the missing link in modern data☆1,442Updated last week
- ZetaSQL - Analyzer Framework for SQL☆2,421Updated 2 weeks ago
- The Universal Storage Engine☆1,987Updated this week
- Distributed Computing for AI Made Simple☆1,048Updated 2 years ago
- Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.☆2,745Updated last year
- Vectorized processing for Apache Arrow☆485Updated 3 years ago
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,859Updated 3 weeks ago
- A composable and fully extensible C++ execution engine library for data management systems.☆3,906Updated this week
- Hopsworks - Data-Intensive AI platform with a Feature Store☆1,257Updated 7 months ago
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,391Updated this week
- vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)☆926Updated 2 months ago
- Apache Drill is a distributed MPP query layer for self describing data☆1,991Updated 3 weeks ago
- Spark RAPIDS plugin - accelerate Apache Spark with GPUs☆932Updated this week
- A crazy fast analytical database, built on bitmaps. Perfect for ML applications. Learn more at: http://docs.featurebase.com/. Start a Doc…☆2,522Updated last year
- Parsing and analysis of Vertica, Hive, and Presto SQL.☆1,076Updated 3 years ago
- A uniform interface to run deep learning models from multiple frameworks☆940Updated last year
- Apache Parquet Format☆2,055Updated last week
- SQL-based streaming analytics platform at scale☆1,227Updated 5 years ago
- Tuplex is a parallel big data processing framework that runs data science pipelines written in Python at the speed of compiled code. Tupl…☆813Updated last month
- Apache Parquet Java☆2,952Updated last week
- Apache Impala☆1,243Updated this week
- A Redis module for serving tensors and executing deep learning graphs☆839Updated last month
- Real-time Data Integration and Transformation: use SQL to transform, deliver, and act on fast-changing data.☆6,128Updated this week
- Mirror of Apache Kudu☆1,888Updated this week
- Apache DataFusion Ballista Distributed Query Engine☆1,851Updated last week
- ☆1,675Updated 2 weeks ago
- Making data lake work for time series☆1,183Updated last year
- PG-Strom - Master development repository☆1,354Updated this week