Pipeline definitions for managing data flows to power analytics at MIT Open Learning
☆46Mar 16, 2026Updated this week
Alternatives and similar repositories for ol-data-platform
Users that are interested in ol-data-platform are comparing it to the libraries listed below
Sorting:
- ☆31Mar 6, 2022Updated 4 years ago
- ☆11Jun 17, 2023Updated 2 years ago
- rust-for-data☆52Jul 12, 2023Updated 2 years ago
- The DAMN (Data Assets Metric Navigation) tool extracts and reports metrics about your data assets☆11Dec 27, 2024Updated last year
- A framework to manage data, continuously☆33Jan 20, 2025Updated last year
- Demo Project for Open Source MDS☆169Aug 27, 2025Updated 6 months ago
- Example Dagster Cloud code for the Hooli Data Engineering organization.☆23Mar 5, 2026Updated 2 weeks ago
- A curated list of dagster code snippets for data engineers☆56Feb 26, 2024Updated 2 years ago
- Interfacing Dagster and R☆23Jun 26, 2024Updated last year
- Ibis analytics, with Ibis (and more!)☆24Sep 24, 2024Updated last year
- ☆12Jan 10, 2023Updated 3 years ago
- duckdb-etl-framework☆15Dec 20, 2024Updated last year
- A financial disclosure data extraction tool.☆21Aug 2, 2023Updated 2 years ago
- Build a REST API on top of your data warehouse☆42Oct 19, 2022Updated 3 years ago
- Department of Education (DOE) for New South Wales (AUS) data stack in a box☆36Nov 13, 2024Updated last year
- Anomstack - Painless open source anomaly detection for your metrics 📈📉🚀☆109Feb 1, 2026Updated last month
- Kirill's miscellaneous functions☆18Mar 13, 2026Updated last week
- Scrape and display Tyler Cowen's current favorite restaurants☆11Mar 15, 2026Updated last week
- Update, Upsert, and Merge from Python dataframes to SQL Server and Azure SQL database.☆12May 30, 2024Updated last year
- 🌸A tool to pick , bake release branches, and mechanize label-driven development☆14Mar 14, 2026Updated last week
- QLLM: A powerful CLI for seamless interaction with multiple Large Language Models. Simplify AI workflows, streamline development, and unl…☆35Apr 11, 2025Updated 11 months ago
- ☆20Jun 28, 2023Updated 2 years ago
- ☆22Jul 21, 2023Updated 2 years ago
- Sling is a CLI tool that extracts data from a source storage/database and loads it in a target storage/database.☆818Updated this week
- Scrapy exporter for Big Data formats☆16Mar 10, 2026Updated last week
- A tool for capturing snapshots of public data sources and archiving them on Zenodo for programmatic use.☆14Updated this week
- Access the Congress.gov API☆19Sep 1, 2025Updated 6 months ago
- ☆13Mar 4, 2026Updated 2 weeks ago
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆124Feb 5, 2025Updated last year
- Data-aware orchestration with dagster, dbt, and airbyte☆31Jan 20, 2023Updated 3 years ago
- ☆17May 3, 2024Updated last year
- Dagster Labs' open-source data platform, built with Dagster.☆447Updated this week
- ☆11Oct 8, 2021Updated 4 years ago
- A command-line tool that summarizes the size of a codebase by language, showing lines of code with and without comments and blank lines.☆50Mar 6, 2026Updated 2 weeks ago
- Fulfills a GitHub workflow_job webhooks into a Pub/Sub queue.☆12Mar 13, 2025Updated last year
- Add data profiling to your dbt project.☆96Feb 26, 2026Updated 3 weeks ago
- Scratchpad for scraper development and general utilities.☆27Aug 11, 2024Updated last year
- Codebase used to build NREL's National Thermal Generator Performance Database☆15Mar 17, 2021Updated 5 years ago
- PV value mapping: temporal output shaping☆11Oct 11, 2019Updated 6 years ago