apache/incubator-liminal

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/apache/incubator-liminal)

apache / incubator-liminal

Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful experiment to an automated pipeline of model training, validation, deployment and inference in production. Liminal provides a Domain Specific Language to build ML workflows on top of Apache Airflow.

☆143

Alternatives and similar repositories for incubator-liminal

Users that are interested in incubator-liminal are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wooplevip / sedis
View on GitHub
SQL for Redis
☆11Sep 16, 2022Updated 3 years ago
dbis-ilm / piglet
View on GitHub
A compiler for Pig Latin to Spark and Flink.
☆24Nov 21, 2019Updated 6 years ago
apache / fineract-cn-customer
View on GitHub
Apache Fineract know your customer service
☆11Jan 6, 2023Updated 3 years ago
apache / incubator-datalab
View on GitHub
Apache DataLab (incubating)
☆152Oct 3, 2023Updated 2 years ago
ExpediaGroup / apiary
View on GitHub
Apiary provides modules which can be combined to create a federated cloud data lake
☆38Apr 3, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
vlsi / calcite-test-dataset
View on GitHub
Data sets and Vagrant script to provision a virtual machine for Apache Calcite development
☆30Mar 24, 2023Updated 3 years ago
xavierguihot / spark_helper
View on GitHub
A bunch of low-level basic methods for data processing and monitoring with Scala Spark
☆10Jun 29, 2018Updated 8 years ago
apache / creadur-rat
View on GitHub
Apache Creadur RAT - Release Audit Tool
☆40Updated this week
UrbanOS-Public / kdp
View on GitHub
Kubernetes deployment of PrestoDB, Hive Metastore, and Minio S3-standard object store
☆17Oct 20, 2022Updated 3 years ago
gordonmurray / cloudfloe
View on GitHub
The Switzerland of Iceberg queries: neutral, easy entry across S3, R2, MinIO
☆22Apr 15, 2026Updated 3 months ago
astronomer / astronomer-fab-securitymanager
View on GitHub
Security Manager for the Astronomer Airflow distribution
☆11Jun 25, 2024Updated 2 years ago
BIDData / BIDMach_Spark
View on GitHub
Code to allow running BIDMach on Spark including HDFS integration and lightweight sparse model updates (Kylix).
☆16Jul 23, 2020Updated 6 years ago
etsy / boundary-layer
View on GitHub
Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform
☆260Jul 19, 2023Updated 3 years ago
ottogroup / SPQR
View on GitHub
Spooker is a dynamic framework for processing high volume data streams via processing pipelines
☆30Feb 1, 2016Updated 10 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
datasphere-oss / datasphere-service
View on GitHub
an open source dataworks platform
☆20Jun 4, 2021Updated 5 years ago
Wikia / discreETLy
View on GitHub
ETLy is an add-on dashboard service on top of Apache Airflow.
☆69Jul 21, 2023Updated 3 years ago
jdye64 / docker-hwx
View on GitHub
Combination of Dockerized Hortonworks projects and other Hadoop ecosystem components
☆10Oct 11, 2019Updated 6 years ago
bigdatagenomics / utils
View on GitHub
General utility code used across BDG products. Apache 2 licensed.
☆18Mar 17, 2026Updated 4 months ago
uber / marmaray
View on GitHub
Generic Data Ingestion & Dispersal Library for Hadoop
☆483Mar 19, 2023Updated 3 years ago
brightcove-archive / ooyala_scamr
View on GitHub
A Hadoop map reduce framework for Scala.
☆15Apr 21, 2016Updated 10 years ago
combinator-ml / combinator
View on GitHub
Combinator.ml's central repo, documentation and website
☆30Jan 6, 2022Updated 4 years ago
CIOIL / DataGovIL
View on GitHub
CKAN is an open-source DMS (data management system) for powering data hubs and data portals. CKAN makes it easy to publish, share and use…
☆17Jan 20, 2022Updated 4 years ago
NetEase / spark-ranger
View on GitHub
ACL Management for Apache Spark SQL with Apache Ranger
☆17Jun 18, 2020Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
linkedin / transport
View on GitHub
A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…
☆306Jun 29, 2026Updated 3 weeks ago
treeverse / lakeview
View on GitHub
lakeview is a visibility tool for S3 based data lakes
☆30Mar 26, 2026Updated 3 months ago
Ctrip-DI / Hue-Ctrip-DI
View on GitHub
Ctrip Data Infrastructure team works for hue
☆16Dec 10, 2014Updated 11 years ago
jpplayer / hdfs-auto-snapshot
View on GitHub
HDFS Automatic Snapshot Service for Linux
☆11Oct 17, 2016Updated 9 years ago
youngwookim / awesome-presto
View on GitHub
A curated list of awesome PrestoDB / Trino software, libraries, tools and resources
☆18Jun 28, 2021Updated 5 years ago
amundsen-io / amundsen
View on GitHub
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting…
☆4,780Jul 1, 2026Updated 3 weeks ago
airbnb / sputnik
View on GitHub
☆64Nov 8, 2019Updated 6 years ago
apache / submarine
View on GitHub
Submarine is Cloud Native Machine Learning Platform.
☆706Apr 3, 2024Updated 2 years ago
SANSA-Stack / SANSA-DataLake
View on GitHub
A library to query heterogeneous data sources uniformly using SPARQL
☆12Dec 5, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
spilth / maven-book
View on GitHub
A book about Maven in the style of the Pragmatic Guides published by The Pragmatic Bookshelf
☆11Dec 12, 2015Updated 10 years ago
theory / pg-jsonschema-boon
View on GitHub
JSON Schema Validation in Postgres
☆26Jun 8, 2026Updated last month
xavient / CDS
View on GitHub
Content Data Store (HDFS/HBase)
☆13Dec 1, 2016Updated 9 years ago
tugul / CoreJava
View on GitHub
Konzepte von Core-Java 8 werden durch beispiele illustriert. Java 8's core concepts are explained by examples.
☆12Oct 12, 2018Updated 7 years ago
rbrush / kite-apps
View on GitHub
Prescriptive Applications over Kite and Hadoop
☆12Oct 14, 2015Updated 10 years ago
dbs-leipzig / gradoop
View on GitHub
Distributed Temporal Graph Analytics with Apache Flink
☆251Jan 11, 2026Updated 6 months ago
big-data-europe / docker-hdfs-filebrowser
View on GitHub
A docker image for HDFS FileBrowser. Cloudera Hue with FileBrowser only.
☆11Sep 20, 2018Updated 7 years ago