eto-ai/rikai

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/eto-ai/rikai)

eto-ai / rikai

Parquet-based ML data format optimized for working with unstructured data

☆140

Alternatives and similar repositories for rikai

Users that are interested in rikai are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

eto-ai / spark-video
View on GitHub
Processing videos on Apache Spark
☆13Feb 14, 2022Updated 4 years ago
komprenilo / liga
View on GitHub
Liga: Let Data Dance with ML Models
☆13Sep 12, 2023Updated 2 years ago
lance-format / lance
View on GitHub
Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data ve…
☆6,850Updated this week
redink / task_flow
View on GitHub
Yet another task management flow.
☆14May 17, 2019Updated 7 years ago
databendlabs / datafuse-operator
View on GitHub
DataFuse operator manages fuse-query and fuse-store clusters atop Kubernetes using CRDs.
☆13Jul 4, 2022Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
da-liii / Binding-SemanticUI
View on GitHub
On top of SemanticUI, this Scala.js project provides components defined in Ant Design with Binding.scala
☆15Jan 1, 2019Updated 7 years ago
trouze / modal-dbt
View on GitHub
Demo repository to lambda-fy your dbt runs
☆11Sep 7, 2023Updated 2 years ago
databendlabs / opencache
View on GitHub
Cache server :)
☆32Sep 5, 2023Updated 2 years ago
apache / kyuubi-client
View on GitHub
Client libraries of end users of Apache Kyuubi
☆11May 15, 2026Updated 2 months ago
oap-project / sql-ds-cache
View on GitHub
Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.
☆37Jan 3, 2023Updated 3 years ago
Tubitv / logger_sentry
View on GitHub
Elixir Logger backend for Sentry
☆26Jun 30, 2025Updated last year
muesli / elvish-libs
View on GitHub
Libs / Themes for elvish
☆18Mar 27, 2024Updated 2 years ago
databendlabs / helm-charts
View on GitHub
Helm charts for databend
☆19May 15, 2026Updated 2 months ago
apache / kyuubi-docker
View on GitHub
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
☆16May 22, 2026Updated 2 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
getindata / flink-dynamic-cep-demo
View on GitHub
Flink dynamic CEP demo
☆20Mar 22, 2022Updated 4 years ago
kaiwu / weui-scalajs
View on GitHub
write WeApp with scalajs
☆19Dec 31, 2018Updated 7 years ago
direct-spark-sql / direct-spark-sql
View on GitHub
a hyper-optimized single-node(local) version of spark sql engine, which's fundamental data structure is scala Iterator rather than RDD.
☆13Jun 13, 2023Updated 3 years ago
allwefantasy / pyjava
View on GitHub
This library is an ongoing effort towards bringing the data exchanging ability between Java/Scala and Python. PyJava introduces Apache A…
☆49Jun 15, 2026Updated last month
lancedb / tantivy-object-store
View on GitHub
Tantivy directory implementation backed by object_store
☆42Jan 22, 2024Updated 2 years ago
databendlabs / databend-go
View on GitHub
Golang driver for databend cloud
☆21Updated this week
neoremind / app-on-yarn-demo
View on GitHub
Demo for service oriented application hosted on Hadoop YARN cluster for HA and scheduling
☆23Apr 2, 2018Updated 8 years ago
scalapy / python-native-libs
View on GitHub
Helpers for setting up an embedded Python interpreter
☆20Oct 31, 2025Updated 8 months ago
CUBRID / cubrid-testcases
View on GitHub
☆10Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jkelleyrtp / bumpslab
View on GitHub
A slab allocator with stable references
☆15Jan 23, 2023Updated 3 years ago
streamlytic / TDengine-Docker
View on GitHub
docker scripts to build and run a minimal version of TDengine
☆10Jul 17, 2019Updated 7 years ago
zjffdu / flink-udf
View on GitHub
☆12Mar 12, 2021Updated 5 years ago
amazon-archives / s3-inventory-usage-examples
View on GitHub
Examples demonstrating how to use Amazon S3 Inventory to analyze your S3 storage using Spark and EMR.
☆20Mar 4, 2020Updated 6 years ago
jacopotagliabue / pixel_from_lambda
View on GitHub
Serve a 1x1 GIF pixel from an AWS lambda-powered endpoint
☆13Sep 7, 2017Updated 8 years ago
ambition119 / QueryParse
View on GitHub
sql解析和执行，能够执行hive, spark, flink, 以及对应对TensorFlow, Deeplearning4j的算法SQL执行
☆11Sep 16, 2022Updated 3 years ago
Boostport / hbase-phoenix-all-in-one
View on GitHub
☆20Jul 17, 2023Updated 3 years ago
Delphi-Data / dbt_natural_language
View on GitHub
A dbt package to run natural language queries
☆10Jan 13, 2023Updated 3 years ago
RecList / reclist
View on GitHub
Behavioral "black-box" testing for recommender systems
☆475Aug 9, 2023Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
vescale / zgraph
View on GitHub
An embeddable graph database for large-scale vertices and edges
☆75Apr 16, 2023Updated 3 years ago
aws-samples / aws-cdk-deep-learning-image-vector-embeddings-at-scale-using-aws-batch
View on GitHub
AWS Blog post code for running feature-extraction on images using AWS Batch and Cloud Development Kit (CDK).
☆21Oct 28, 2022Updated 3 years ago
XpressAI / SparkCyclone
View on GitHub
Plugin to accelerate Spark SQL with the NEC Vector Engine.
☆19Aug 15, 2022Updated 3 years ago
leiysky / tpch-databend
View on GitHub
TPCH benchmark tool for databend
☆11Nov 15, 2022Updated 3 years ago
Tubitv / xdiff
View on GitHub
xreq and xdiff tool to call or diff complicated API easily
☆104Jul 18, 2025Updated last year
PingCAP-QE / wreck-it
View on GitHub
This repository has been archived. See https://github.com/chaos-mesh/go-sqlancer for the new version
☆12May 12, 2020Updated 6 years ago
silentsokolov / dbt-databend
View on GitHub
The Databend plugin for dbt (data build tool)
☆12Mar 17, 2023Updated 3 years ago