apache / incubator-marvinLinks
Apache Marvin-AI
☆100Updated 2 years ago
Alternatives and similar repositories for incubator-marvin
Users that are interested in incubator-marvin are comparing it to the libraries listed below
Sorting:
- A tool for building feature stores.☆305Updated 3 weeks ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆126Updated 3 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- 🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)☆141Updated last year
- Apache DataLab (incubating)☆153Updated last year
- MLflow App Library☆78Updated 6 years ago
- ML made simple☆209Updated 2 years ago
- Azkaban Auror core for flow creation☆10Updated 4 years ago
- ☆106Updated 2 years ago
- ☆37Updated 6 years ago
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Updated 6 years ago
- HopsWorks - Hadoop for Humans☆117Updated 6 years ago
- Common library for serving TensorFlow, XGBoost and scikit-learn models in production.☆139Updated last year
- Asynchronous actions for PySpark☆47Updated 3 years ago
- ☆111Updated 6 months ago
- Tool to automate data quality checks on data pipelines☆255Updated 2 years ago
- Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.☆70Updated 2 years ago
- Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful …☆143Updated 11 months ago
- An open-source tool for quick Time Series Analysis and Forecasting☆105Updated 2 years ago
- A library for exporting Spark ML models and pipelines to PFA☆54Updated 6 years ago
- Python - Java/Scala API for the Hopsworks feature store☆54Updated 3 weeks ago
- Spark package for checking data quality☆221Updated 5 years ago
- real-time data + ML pipeline☆54Updated this week
- Apache (Py)Spark type annotations (stub files).☆117Updated 2 years ago
- ☆33Updated 10 years ago
- Jupyter kernel for scala and spark☆189Updated last year
- Google BigQuery support for Spark, SQL, and DataFrames☆155Updated 5 years ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆95Updated last week
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆61Updated 9 months ago
- Splittable SAS (.sas7bdat) Input Format for Hadoop and Spark SQL☆91Updated last year