pachyderm / pachyderm
Data-Centric Pipelines and Data Versioning
☆6,197Updated this week
Alternatives and similar repositories for pachyderm:
Users that are interested in pachyderm are comparing it to the libraries listed below
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle☆3,591Updated this week
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, vis…☆18,024Updated this week
- High-Performance Serverless event and data processing platform☆5,352Updated this week
- Machine Learning Toolkit for Kubernetes☆14,544Updated last month
- Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.☆5,924Updated this week
- Open Source AI/ML Platform☆8,446Updated this week
- A next-generation curated knowledge sharing platform for data scientists and other technical professions.☆5,499Updated 4 months ago
- A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin☆6,241Updated last month
- An orchestration platform for the development, production, and observation of data assets.☆12,300Updated this week
- An open-source graph database☆14,866Updated 3 weeks ago
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models☆4,430Updated this week
- A crazy fast analytical database, built on bitmaps. Perfect for ML applications. Learn more at: http://docs.featurebase.com/. Start a Doc…☆2,527Updated 10 months ago
- The Open Source Feature Store for Machine Learning☆5,722Updated this week
- Always know what to expect from your data.☆10,117Updated this week
- Quilt is a data mesh for connecting people with actionable data☆1,329Updated this week
- A curated list of awesome ETL frameworks, libraries, and software.☆3,319Updated 5 months ago
- Python Stream Processing☆6,757Updated 5 months ago
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and…☆10,112Updated this week
- M3 monorepo - Distributed TSDB, Aggregator and Query Engine, Prometheus Sidecar, Graphite Compatible, Metrics Platform☆4,789Updated 3 weeks ago
- 🦉 Data Versioning and ML Experiments☆14,088Updated this week
- Beaker Extensions for Jupyter Notebook☆2,804Updated last year
- A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow☆2,081Updated last year
- 📚 Parameterize, execute, and analyze notebooks☆6,047Updated last week
- A GPU-powered real-time analytics storage and query engine.☆3,042Updated 6 months ago
- BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.☆1,941Updated 2 years ago
- The leader in Next-Generation Customer Data Infrastructure☆6,868Updated 4 months ago
- Visualizations for machine learning datasets☆7,357Updated last year
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflows☆38,298Updated this week
- The Universal Storage Engine☆1,887Updated this week
- P2P Docker registry capable of distributing TBs of data in seconds☆6,168Updated this week