Pipeline definitions for managing data flows to power analytics at MIT Open Learning
☆47Apr 4, 2026Updated last week
Alternatives and similar repositories for ol-data-platform
Users that are interested in ol-data-platform are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Jun 17, 2023Updated 2 years ago
- rust-for-data☆53Jul 12, 2023Updated 2 years ago
- A framework to manage data, continuously☆34Jan 20, 2025Updated last year
- Demo Project for Open Source MDS☆169Aug 27, 2025Updated 7 months ago
- Example Dagster Cloud code for the Hooli Data Engineering organization.☆25Apr 3, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A curated list of dagster code snippets for data engineers☆56Feb 26, 2024Updated 2 years ago
- Interfacing Dagster and R☆23Jun 26, 2024Updated last year
- ☆12Jan 10, 2023Updated 3 years ago
- Configuration and schema sync for Metabase from Python☆19Mar 23, 2023Updated 3 years ago
- Ibis analytics, with Ibis (and more!)☆24Sep 24, 2024Updated last year
- A financial disclosure data extraction tool.☆21Aug 2, 2023Updated 2 years ago
- Runnable e-commerce mini data warehouse based on Python, PostgreSQL & Metabase, template for new projects☆29Mar 31, 2021Updated 5 years ago
- Department of Education (DOE) for New South Wales (AUS) data stack in a box☆35Nov 13, 2024Updated last year
- Anomstack - Painless open source anomaly detection for your metrics 📈📉🚀☆108Feb 1, 2026Updated 2 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Kirill's miscellaneous functions☆18Mar 13, 2026Updated 3 weeks ago
- Mockup data generator library.☆11Mar 24, 2025Updated last year
- Template Dagster repo using poetry and a single Docker container; works well with CICD☆68Apr 1, 2022Updated 4 years ago
- QLLM: A powerful CLI for seamless interaction with multiple Large Language Models. Simplify AI workflows, streamline development, and unl…☆35Apr 11, 2025Updated last year
- ☆20Jun 28, 2023Updated 2 years ago
- A Streamlit component for rendering leaflet maps☆18Oct 11, 2022Updated 3 years ago
- ☆22Jul 21, 2023Updated 2 years ago
- Dockerized MeCab API Server☆10Feb 5, 2019Updated 7 years ago
- A SQLite adapter plugin for dbt (data build tool)☆83Jun 28, 2025Updated 9 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Python wrapper for the Sling CLI tool☆68Feb 26, 2026Updated last month
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆124Feb 5, 2025Updated last year
- Data-aware orchestration with dagster, dbt, and airbyte☆31Jan 20, 2023Updated 3 years ago
- Dagster Labs' open-source data platform, built with Dagster.☆453Updated this week
- A PDM plugin to sync the exported files with the project file☆15Sep 6, 2025Updated 7 months ago
- Airflow operators, hooks, and sensors for interacting with the Hightouch API☆17Mar 11, 2026Updated 3 weeks ago
- A command-line tool that summarizes the size of a codebase by language, showing lines of code with and without comments and blank lines.☆56Mar 6, 2026Updated last month
- Fulfills a GitHub workflow_job webhooks into a Pub/Sub queue.☆12Mar 13, 2025Updated last year
- Add data profiling to your dbt project.☆98Feb 26, 2026Updated last month
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Scratchpad for scraper development and general utilities.☆27Aug 11, 2024Updated last year
- Codebase used to build NREL's National Thermal Generator Performance Database☆15Mar 17, 2021Updated 5 years ago
- PV value mapping: temporal output shaping☆11Oct 11, 2019Updated 6 years ago
- This repository shows how to use CDK Pipelines to create a cross-account CI/CD pipeline for Amazon Elastic Container Service (ECS)☆12Jul 25, 2023Updated 2 years ago
- Progress Bar add-on for Anki☆13Mar 24, 2021Updated 5 years ago
- The best Python package for comparing two dataframes☆11Dec 29, 2021Updated 4 years ago
- Fast PostgreSQL bulk inserts with Cython and binary copy☆12Jun 1, 2020Updated 5 years ago