Distribution transparent Machine Learning experiments on Apache Spark
☆91Feb 21, 2024Updated 2 years ago
Alternatives and similar repositories for maggy
Users that are interested in maggy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Utility Library for Hopsworks. Issues can be posted at https://community.hopsworks.ai☆27Feb 3, 2026Updated last month
- Python - Java/Scala API for the Hopsworks feature store☆55Sep 24, 2025Updated 6 months ago
- Point-in-Time optimizations for Apache Spark☆30Jan 18, 2024Updated 2 years ago
- Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops☆117Jan 28, 2026Updated last month
- ☆12Apr 10, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Hopsworks - Data-Intensive AI platform with a Feature Store☆1,289Feb 10, 2025Updated last year
- Hops Hadoop is a distribution of Apache Hadoop with distributed metadata.☆322Jan 22, 2026Updated 2 months ago
- Reproducing Distributed Systems and Experiments on Cloud☆40Sep 11, 2023Updated 2 years ago
- Accompanying solution accelerator notebook for the Databricks blog on parallel training and inference☆15Jul 12, 2022Updated 3 years ago
- ☆17May 7, 2024Updated last year
- HopsYARN Tensorflow Framework.☆31Oct 22, 2019Updated 6 years ago
- Clickstream Faker Provider for Python.☆11Apr 2, 2022Updated 3 years ago
- HopsWorks - Hadoop for Humans☆117Apr 25, 2019Updated 6 years ago
- 📚 Python Color Conversion Lib☆16Oct 10, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- MinPlotX is THE! tool for mineral formula recalculation and compositional plotting.☆14Jul 30, 2025Updated 7 months ago
- Package for apatite-based thermodynamic models: ApThermo (melt hygrometry) and ApREE (REE partitioning).☆11Jul 18, 2024Updated last year
- The Flyte data-sidecar that helps move the input and output data intelligently between containers☆10Oct 9, 2023Updated 2 years ago
- Mesos Integration Tests on Docker/Ec2☆15May 25, 2023Updated 2 years ago
- Thermodynamic calculations and diagrams for geochemistry☆10Mar 12, 2026Updated 2 weeks ago
- Kompics - A message-passing component model for building distributed systems☆66Oct 4, 2022Updated 3 years ago
- Spark implementation of computing Shapley Values using monte-carlo approximation☆80Mar 20, 2023Updated 3 years ago
- Canonical repository https://git.dbogatov.org/bu/ore-benchmark/Project-Code☆20Dec 8, 2022Updated 3 years ago
- A library on top of either pex or conda-pack to make your Python code easily available on a cluster☆46Feb 4, 2026Updated last month
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Scalytics Connect development environment, pre-build☆22Feb 21, 2024Updated 2 years ago
- A model of sulfur degassing during magma ascent☆15Aug 30, 2025Updated 6 months ago
- Real-time data processing/feature engineering tailored for modern AI/ML systems.☆116Mar 18, 2026Updated last week
- This is RonDB, a distribution of NDB Cluster developed and used by Hopsworks AB. It also contains development branches of RonDB.☆705Updated this week
- ☆10Oct 1, 2020Updated 5 years ago
- Youtube crawler to measure end-to-end video reception quality☆25Oct 6, 2019Updated 6 years ago
- A spec for reporting errors in data quality.☆20May 25, 2021Updated 4 years ago
- An HPC Interface for data analysis platforms☆23Mar 14, 2020Updated 6 years ago
- Python API for Deequ☆41Nov 10, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- simple set of functions and cli for image manipulation☆12Feb 4, 2017Updated 9 years ago
- ForestFlow is a policy-driven Machine Learning Model Server. It is an LF AI Foundation incubation project.☆73Feb 21, 2024Updated 2 years ago
- Jupyter extensions for SWAN☆60Updated this week
- Dependency and data pipeline management framework for Spark and Scala☆15Apr 8, 2017Updated 8 years ago
- A curated list of scientific figures in research papers.☆28Oct 29, 2025Updated 4 months ago
- European Parliament Open Data // Twitter☆20Sep 23, 2022Updated 3 years ago
- A simple Spark-powered ETL framework that just works 🍺☆184Oct 2, 2025Updated 5 months ago