apache / incubator-marvinLinks
Apache Marvin-AI
☆99Updated 2 years ago
Alternatives and similar repositories for incubator-marvin
Users that are interested in incubator-marvin are comparing it to the libraries listed below
Sorting:
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated 2 years ago
- MLOps Platform☆272Updated last year
- Tool to automate data quality checks on data pipelines☆256Updated 3 years ago
- Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful …☆144Updated last year
- ☆108Updated 3 years ago
- Common library for serving TensorFlow, XGBoost and scikit-learn models in production.☆143Updated 2 years ago
- HopsWorks - Hadoop for Humans☆117Updated 6 years ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆127Updated 4 years ago
- Joblib Apache Spark Backend☆249Updated 10 months ago
- Egeria's Guidance on Governance as well as large media files such as presentations and movies☆107Updated 3 years ago
- MLflow App Library☆77Updated 7 years ago
- HandySpark - bringing pandas-like capabilities to Spark dataframes☆197Updated 6 years ago
- Jupyter kernel for scala and spark☆190Updated 2 years ago
- Privacy transformations on Spark and Pandas dataframes backed by a simple policy language.☆176Updated 2 years ago
- Python - Java/Scala API for the Hopsworks feature store☆55Updated 4 months ago
- Implementations of the Portable Format for Analytics (PFA)☆126Updated 3 years ago
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples☆72Updated 5 years ago
- A toolkit providing a uniform interface for connecting to and extracting data from a wide variety of (potentially remote) data stores (in…☆257Updated 2 months ago
- Apache DataLab (incubating)☆152Updated 2 years ago
- DBND is an agile pipeline framework that helps data engineering teams track and orchestrate their data processes.☆268Updated 10 months ago
- 🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)☆141Updated 2 years ago
- Distribution transparent Machine Learning experiments on Apache Spark☆91Updated last year
- Spark ML Lib serving library☆48Updated 7 years ago
- A library for exporting Spark ML models and pipelines to PFA☆54Updated 7 years ago
- Create HTML profiling reports from Apache Spark DataFrames☆197Updated 6 years ago
- Apache (Py)Spark type annotations (stub files).☆118Updated 3 years ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆97Updated this week
- Morpheus brings the leading graph query language, Cypher, onto the leading distributed processing platform, Spark.☆346Updated 3 weeks ago
- Asynchronous actions for PySpark☆48Updated 4 years ago
- Data ingestion library for Amundsen to build graph and search index☆204Updated last year