pachyderm / pachyderm
Data-Centric Pipelines and Data Versioning
β6,211Updated last month
Alternatives and similar repositories for pachyderm:
Users that are interested in pachyderm are comparing it to the libraries listed below
- Machine Learning Toolkit for Kubernetesβ14,768Updated this week
- π Parameterize, execute, and analyze notebooksβ6,113Updated 2 months ago
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning modelsβ4,488Updated this week
- Build, Deploy and Manage AI/ML Systemsβ8,644Updated this week
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visβ¦β18,175Updated last month
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycleβ3,613Updated 2 weeks ago
- High-Performance Serverless event and data processing platformβ5,401Updated this week
- Quilt is a data mesh for connecting people with actionable dataβ1,331Updated this week
- A curated list of awesome pipeline toolkits inspired by Awesome Sysadminβ6,309Updated last week
- The Open Source Feature Store for AI/MLβ5,874Updated this week
- PipelineAIβ4,171Updated 11 months ago
- Production infrastructure for machine learning at scaleβ8,030Updated 9 months ago
- π¦ Data Versioning and ML Experimentsβ14,282Updated 2 weeks ago
- Parallel computing with task schedulingβ13,045Updated this week
- NumPy and Pandas interface to Big Dataβ3,196Updated last year
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per sβ¦β8,356Updated 5 months ago
- Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.β6,109Updated this week
- Open Source ML Model Versioning, Metadata, and Experiment Managementβ1,719Updated 7 months ago
- An open-source graph databaseβ14,902Updated last week
- βΎοΈ CML - Continuous Machine Learning | CI/CD for MLβ4,075Updated this week
- A GPU-powered real-time analytics storage and query engine.β3,048Updated 8 months ago
- A next-generation curated knowledge sharing platform for data scientists and other technical professions.β5,508Updated 6 months ago
- Workflow Engine for Kubernetesβ15,457Updated this week
- Always know what to expect from your data.β10,273Updated this week
- Cadence is a distributed, scalable, durable, and highly available orchestration engine to execute asynchronous long-running business logiβ¦β8,543Updated this week
- the portable Python dataframe libraryβ5,608Updated this week
- Agile Data Preparation Workflows madeΒ easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySparkβ1,497Updated 3 months ago
- M3 monorepo - Distributed TSDB, Aggregator and Query Engine, Prometheus Sidecar, Graphite Compatible, Metrics Platformβ4,808Updated this week
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analyticsβ15,129Updated this week
- Beaker Extensions for Jupyter Notebookβ2,809Updated last year