apache / incubator-marvin
Apache Marvin-AI
☆101Updated last year
Alternatives and similar repositories for incubator-marvin:
Users that are interested in incubator-marvin are comparing it to the libraries listed below
- A tool for building feature stores.☆293Updated this week
- ML made simple☆210Updated last year
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆123Updated 3 years ago
- 🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)☆140Updated last year
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful …☆144Updated 7 months ago
- Data ingestion library for Amundsen to build graph and search index☆205Updated 11 months ago
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples☆70Updated 4 years ago
- An open-source tool for quick Time Series Analysis and Forecasting☆104Updated last year
- 1st paper that introduces main Marvin features and concepts, more like a white-paper☆25Updated 6 years ago
- MLOps Platform☆271Updated 3 months ago
- Marvin AI has been accepted into the Apache Foundation and is now available at https://github.com/apache/incubator-marvin☆140Updated 6 years ago
- MLflow App Library☆77Updated 6 years ago
- Debussy is an opinionated Data Architecture and Engineering framework, enabling data analysts and engineers to build better platforms and…☆28Updated last year
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Updated 6 years ago
- Spark ML Lib serving library☆48Updated 6 years ago
- Python - Java/Scala API for the Hopsworks feature store☆54Updated this week
- The LinkedIn Fairness Toolkit (LiFT) is a Scala/Spark library that enables the measurement of fairness in large scale machine learning wo…☆169Updated last year
- Resources for Data Science Process management☆204Updated 5 years ago
- Create HTML profiling reports from Apache Spark DataFrames☆195Updated 5 years ago
- DataQuality for BigData☆143Updated last year
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆94Updated this week
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆61Updated 5 months ago
- Asynchronous actions for PySpark☆47Updated 3 years ago
- Marvin AI has been accepted into the Apache Foundation and is now available at https://github.com/apache/incubator-marvin☆45Updated 6 years ago
- ☆33Updated 10 years ago
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Updated 2 years ago
- Privacy transformations on Spark and Pandas dataframes backed by a simple policy language.☆174Updated last year
- HandySpark - bringing pandas-like capabilities to Spark dataframes☆192Updated 5 years ago