Parquet-based ML data format optimized for working with unstructured data
☆141Jan 5, 2023Updated 3 years ago
Alternatives and similar repositories for rikai
Users that are interested in rikai are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- JupyterLab extensions developed by Tubi including nteract data explorer, shareable link and deep copy/cut/paste☆19Jan 5, 2023Updated 3 years ago
- Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data ve…☆6,310Updated this week
- 📙 Notebooks Academy: Write Production-Ready Code From Jupyter.☆13Jan 5, 2023Updated 3 years ago
- Amundsen Gremlin☆22Aug 26, 2022Updated 3 years ago
- ☆21Apr 21, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Yet another task management flow.☆14May 17, 2019Updated 6 years ago
- DataFuse operator manages fuse-query and fuse-store clusters atop Kubernetes using CRDs.☆13Jul 4, 2022Updated 3 years ago
- Demo repository to lambda-fy your dbt runs☆11Sep 7, 2023Updated 2 years ago
- Jupyter notebooks containing time series analysis demos☆18Mar 30, 2026Updated 2 weeks ago
- On top of SemanticUI, this Scala.js project provides components defined in Ant Design with Binding.scala☆15Jan 1, 2019Updated 7 years ago
- Cache server :)☆32Sep 5, 2023Updated 2 years ago
- Helm charts for databend☆19Aug 1, 2025Updated 8 months ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆16Jan 4, 2026Updated 3 months ago
- Libs / Themes for elvish☆18Mar 27, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.☆37Jan 3, 2023Updated 3 years ago
- Flink dynamic CEP demo☆20Mar 22, 2022Updated 4 years ago
- Package to download long-term Google Trends☆15Jul 19, 2023Updated 2 years ago
- Automated Jupyter notebook testing. 📙☆41Jan 25, 2024Updated 2 years ago
- write WeApp with scalajs☆19Dec 31, 2018Updated 7 years ago
- a hyper-optimized single-node(local) version of spark sql engine, which's fundamental data structure is scala Iterator rather than RDD.☆13Jun 13, 2023Updated 2 years ago
- This library is an ongoing effort towards bringing the data exchanging ability between Java/Scala and Python. PyJava introduces Apache A…☆49Apr 21, 2023Updated 2 years ago
- Tantivy directory implementation backed by object_store☆40Jan 22, 2024Updated 2 years ago
- ☆10Nov 11, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Pushdown cache for DataFusion☆396Apr 3, 2026Updated 2 weeks ago
- Helpers for setting up an embedded Python interpreter☆19Oct 31, 2025Updated 5 months ago
- Single Cell Multiplexed Imaging Jupyter Voila Dashboard using CODEX Data☆17Jul 6, 2023Updated 2 years ago
- docker scripts to build and run a minimal version of TDengine☆10Jul 17, 2019Updated 6 years ago
- ☆12Mar 12, 2021Updated 5 years ago
- A slab allocator with stable references☆15Jan 23, 2023Updated 3 years ago
- Serve a 1x1 GIF pixel from an AWS lambda-powered endpoint☆13Sep 7, 2017Updated 8 years ago
- sql解析和执行,能够执行hive, spark, flink, 以及对应对TensorFlow, Deeplearning4j的算法SQL执行☆11Sep 16, 2022Updated 3 years ago
- Quick & Dirty cli to process mysql dumps☆10Sep 30, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Golang driver for databend cloud☆20Mar 23, 2026Updated 3 weeks ago
- ☆21Jul 17, 2023Updated 2 years ago
- AWS Blog post code for running feature-extraction on images using AWS Batch and Cloud Development Kit (CDK).☆20Oct 28, 2022Updated 3 years ago
- An embeddable graph database for large-scale vertices and edges☆74Apr 16, 2023Updated 3 years ago
- Plugin to accelerate Spark SQL with the NEC Vector Engine.☆19Aug 15, 2022Updated 3 years ago
- TPCH benchmark tool for databend☆11Nov 15, 2022Updated 3 years ago
- The Databend plugin for dbt (data build tool)☆12Mar 17, 2023Updated 3 years ago