apache / incubator-marvin
Apache Marvin-AI
☆102Updated last year
Related projects: ⓘ
- A tool for building feature stores.☆281Updated this week
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆122Updated 3 years ago
- ☆22Updated this week
- ☆115Updated this week
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated 10 months ago
- 🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)☆139Updated last year
- Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful …☆142Updated 2 months ago
- Tool to automate data quality checks on data pipelines☆246Updated 2 years ago
- Debussy is an opinionated Data Architecture and Engineering framework, enabling data analysts and engineers to build better platforms and…☆28Updated last year
- Google BigQuery support for Spark, SQL, and DataFrames☆155Updated 4 years ago
- A library for exporting Spark ML models and pipelines to PFA☆54Updated 5 years ago
- Data ingestion library for Amundsen to build graph and search index☆206Updated 6 months ago
- A frictionless integrated platform for notebook☆85Updated last year
- Python - Java/Scala API for the Hopsworks feature store☆53Updated last week
- ☆37Updated 5 years ago
- Common library for serving TensorFlow, XGBoost and scikit-learn models in production.☆139Updated last year
- ☆30Updated 2 years ago
- Joblib Apache Spark Backend☆242Updated last month
- Dremio Metabase driver☆17Updated 4 years ago
- Implementations of the Portable Format for Analytics (PFA)☆129Updated last year
- Resources for Data Science Process management☆206Updated 4 years ago
- Scala Aggregators used for ML Model metrics monitoring☆91Updated last year
- Open source platform for the machine learning lifecycle☆95Updated 5 years ago
- real-time data + ML pipeline☆54Updated this week
- Asynchronous actions for PySpark☆44Updated 2 years ago
- ☆107Updated this week
- HopsWorks - Hadoop for Humans☆116Updated 5 years ago
- Dremio SDK for JavaScript☆27Updated 4 years ago
- A Spark/Scala implementation of the isolation forest unsupervised outlier detection algorithm with support for exporting in ONNX format.☆224Updated 2 weeks ago
- Apache (Py)Spark type annotations (stub files).☆115Updated 2 years ago