eto-ai / rikaiView external linksLinks
Parquet-based ML data format optimized for working with unstructured data
☆141Jan 5, 2023Updated 3 years ago
Alternatives and similar repositories for rikai
Users that are interested in rikai are comparing it to the libraries listed below
Sorting:
- Liga: Let Data Dance with ML Models☆13Sep 12, 2023Updated 2 years ago
- Processing videos on Apache Spark☆12Feb 14, 2022Updated 4 years ago
- DataFuse operator manages fuse-query and fuse-store clusters atop Kubernetes using CRDs.☆13Jul 4, 2022Updated 3 years ago
- Helm charts for databend☆19Aug 1, 2025Updated 6 months ago
- Cache server :)☆32Sep 5, 2023Updated 2 years ago
- Flink dynamic CEP demo☆19Mar 22, 2022Updated 3 years ago
- On top of SemanticUI, this Scala.js project provides components defined in Ant Design with Binding.scala☆15Jan 1, 2019Updated 7 years ago
- ☆21Apr 21, 2023Updated 2 years ago
- write WeApp with scalajs☆19Dec 31, 2018Updated 7 years ago
- Demo repository to lambda-fy your dbt runs☆11Sep 7, 2023Updated 2 years ago
- Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.☆37Jan 3, 2023Updated 3 years ago
- docker scripts to build and run a minimal version of TDengine☆10Jul 17, 2019Updated 6 years ago
- Client libraries of end users of Apache Kyuubi☆11Jan 10, 2023Updated 3 years ago
- Python and Scala APIs for enhanced Spark analytics☆12Mar 15, 2017Updated 8 years ago
- This library is an ongoing effort towards bringing the data exchanging ability between Java/Scala and Python. PyJava introduces Apache A…☆49Apr 21, 2023Updated 2 years ago
- TPCH benchmark tool for databend☆11Nov 15, 2022Updated 3 years ago
- Demo for service oriented application hosted on Hadoop YARN cluster for HA and scheduling☆23Apr 2, 2018Updated 7 years ago
- Clink is a library that provides APIs and infrastructure to facilitate the development of parallelizable feature engineering operators th…☆30Feb 21, 2022Updated 3 years ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆17Jan 4, 2026Updated last month
- The Databend plugin for dbt (data build tool)☆12Mar 17, 2023Updated 2 years ago
- Showing the relationship between ImageNet ID and labels and pytorch pre-trained model output ID and labels☆10Oct 11, 2020Updated 5 years ago
- ☆13Dec 11, 2024Updated last year
- Distributed SQL base Realtime Streaming Computation Framework On Apache Storm, Spark☆12Mar 14, 2016Updated 9 years ago
- Run Github Actions workflows locally or on a custom backend☆17Mar 17, 2025Updated 10 months ago
- 通过观看尚硅谷的Flink实战视频,开了一个仓库,记录源码和一些所需要的数据文件,也欢迎大家积极讨论☆17Mar 1, 2021Updated 4 years ago
- Examples demonstrating how to use Amazon S3 Inventory to analyze your S3 storage using Spark and EMR.☆20Mar 4, 2020Updated 5 years ago
- a hyper-optimized single-node(local) version of spark sql engine, which's fundamental data structure is scala Iterator rather than RDD.☆13Jun 13, 2023Updated 2 years ago
- Single Cell Multiplexed Imaging Jupyter Voila Dashboard using CODEX Data☆17Jul 6, 2023Updated 2 years ago
- Yet another task management flow.☆14May 17, 2019Updated 6 years ago
- Examples of using SparklingPandas and Pandas with PySpark☆16Aug 6, 2015Updated 10 years ago
- Implementation of S3-FIFO cache algorithm☆16Aug 30, 2023Updated 2 years ago
- Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.☆258Feb 21, 2023Updated 2 years ago
- OSPP 2022 Project: String Adaptive Hash Table for Databend☆19Sep 15, 2022Updated 3 years ago
- 大数据【企业级360°全方位用户画像】标签开发部分源码☆19Dec 18, 2020Updated 5 years ago
- ☆23Feb 9, 2026Updated last week
- China Scala User Group☆38Oct 15, 2018Updated 7 years ago
- 基于知识图谱的软件工程学科在线学习平台☆19Jun 5, 2020Updated 5 years ago
- An example project that combines Spark Streaming, Kafka, and Parquet to transform JSON objects streamed over Kafka into Parquet files in …☆19Jun 22, 2021Updated 4 years ago
- GIS extension for SparkSQL☆39Jan 25, 2016Updated 10 years ago