BlazingDB / blazingsql
BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.
☆1,965Updated 2 years ago
Alternatives and similar repositories for blazingsql
Users that are interested in blazingsql are comparing it to the libraries listed below
Sorting:
- HeavyDB (formerly OmniSciDB)☆2,996Updated 8 months ago
- ZetaSQL - Analyzer Framework for SQL☆2,396Updated last month
- A composable and fully extensible C++ execution engine library for data management systems.☆3,739Updated this week
- The Universal Storage Engine☆1,932Updated this week
- Hopsworks - Data-Intensive AI platform with a Feature Store☆1,223Updated 3 months ago
- Apache Parquet Format☆1,943Updated this week
- PG-Strom - Master development repository☆1,330Updated this week
- Apache Parquet Java☆2,812Updated this week
- Mirror of Apache Kudu☆1,870Updated this week
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,314Updated last week
- Vectorized processing for Apache Arrow☆485Updated 3 years ago
- Dremio - the missing link in modern data☆1,429Updated 2 weeks ago
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,836Updated last year
- Mirror of Apache MADlib☆466Updated last year
- Parsing and analysis of Vertica, Hive, and Presto SQL.☆1,080Updated 3 years ago
- A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow☆2,082Updated last year
- vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)☆895Updated this week
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics☆15,415Updated this week
- Distributed Computing for AI Made Simple☆1,044Updated 2 years ago
- ☆1,647Updated this week
- Real-time Data Integration and Transformation: use SQL to transform, deliver, and act on fast-changing data.☆5,979Updated this week
- Distributed storage for sequential data☆1,903Updated 3 years ago
- A multi-model machine learning feature embedding database☆638Updated 5 years ago
- Spark RAPIDS plugin - accelerate Apache Spark with GPUs☆892Updated this week
- cuDF - GPU DataFrame Library☆8,917Updated this week
- The Open Source Feature Store for AI/ML☆6,056Updated this week
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle☆3,635Updated 3 weeks ago
- Apache Impala☆1,209Updated this week
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,081Updated last month
- SQL-based streaming analytics platform at scale☆1,224Updated 4 years ago