apache / incubator-marvinLinks
Apache Marvin-AI
☆100Updated 2 years ago
Alternatives and similar repositories for incubator-marvin
Users that are interested in incubator-marvin are comparing it to the libraries listed below
Sorting:
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆126Updated 3 years ago
- Egeria's Guidance on Governance as well as large media files such as presentations and movies☆104Updated 2 years ago
- MLflow App Library☆80Updated 6 years ago
- MLOps Platform☆273Updated 8 months ago
- Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful …☆143Updated 11 months ago
- A toolkit providing a uniform interface for connecting to and extracting data from a wide variety of (potentially remote) data stores (in…☆254Updated last year
- A tool for building feature stores.☆308Updated 2 weeks ago
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples☆70Updated 5 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- Tool to automate data quality checks on data pipelines☆254Updated 2 years ago
- 🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)☆141Updated 2 years ago
- ☆106Updated 2 years ago
- Python - Java/Scala API for the Hopsworks feature store☆54Updated last week
- Common library for serving TensorFlow, XGBoost and scikit-learn models in production.☆139Updated last year
- ☆33Updated 10 years ago
- HopsWorks - Hadoop for Humans☆117Updated 6 years ago
- 📚 Notebook storage and publishing workflows for the masses☆202Updated 3 years ago
- Apache DataLab (incubating)☆153Updated last year
- Distribution transparent Machine Learning experiments on Apache Spark☆91Updated last year
- Spark ML Lib serving library☆48Updated 7 years ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆96Updated this week
- Create HTML profiling reports from Apache Spark DataFrames☆196Updated 5 years ago
- HandySpark - bringing pandas-like capabilities to Spark dataframes☆196Updated 6 years ago
- ☆35Updated 3 months ago
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆61Updated 10 months ago
- Superglue is a lineage-tracking tool built to help visualize the propagation of data through complex pipelines composed of tables, jobs …☆158Updated 2 years ago
- Data ingestion library for Amundsen to build graph and search index☆205Updated last year
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆69Updated 4 months ago
- Joblib Apache Spark Backend☆249Updated 3 months ago
- Avro2TF is designed to fill the gap of making users' training data ready to be consumed by deep learning training frameworks.☆128Updated 5 years ago