BlazingDB / blazingsql
BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.
☆1,954Updated 2 years ago
Alternatives and similar repositories for blazingsql:
Users that are interested in blazingsql are comparing it to the libraries listed below
- A GPU-powered real-time analytics storage and query engine.☆3,045Updated 8 months ago
- High-performance runtime for data analytics applications☆2,997Updated 2 years ago
- HeavyDB (formerly OmniSciDB)☆2,973Updated 6 months ago
- The Universal Storage Engine☆1,909Updated last week
- ZetaSQL - Analyzer Framework for SQL☆2,373Updated 4 months ago
- Distributed Computing for AI Made Simple☆1,042Updated 2 years ago
- A new arguably faster implementation of Apache Spark from scratch in Rust☆2,231Updated 2 years ago
- Dremio - the missing link in modern data☆1,415Updated 4 months ago
- A composable and fully extensible C++ execution engine library for data management systems.☆3,654Updated this week
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,274Updated last week
- A uniform interface to run deep learning models from multiple frameworks☆936Updated last year
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,822Updated last year
- Mirror of Apache MADlib☆465Updated 10 months ago
- Distributed SQL Engine in Python using Dask☆400Updated 6 months ago
- A better notebook for Scala (and more)☆4,549Updated last week
- Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.☆2,714Updated last year
- A multi-model machine learning feature embedding database☆637Updated 5 years ago
- Apache Parquet Format☆1,903Updated 2 weeks ago
- Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark☆1,496Updated 3 months ago
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆299Updated 9 months ago
- Quilt is a data mesh for connecting people with actionable data☆1,331Updated this week
- Apache DataFusion SQL Query Engine☆6,887Updated this week
- Spark RAPIDS plugin - accelerate Apache Spark with GPUs☆869Updated this week
- A distributed task scheduler for Dask☆1,609Updated this week
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,051Updated 5 months ago
- Apache Parquet Java☆2,755Updated this week
- Self-Driving Database Management System from Carnegie Mellon University☆1,746Updated 2 years ago
- Parsing and analysis of Vertica, Hive, and Presto SQL.☆1,079Updated 3 years ago
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆7,876Updated this week
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle☆3,613Updated 2 weeks ago