A simple and easy to use Data Quality (DQ) tool built with Python.
☆51Sep 7, 2023Updated 2 years ago
Alternatives and similar repositories for tinytimmy
Users that are interested in tinytimmy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- how to unit test your PySpark code☆29Mar 26, 2021Updated 5 years ago
- Materialize plugin for dbt☆12Jan 25, 2021Updated 5 years ago
- Example orchestration pipeline for Fivetran + dbt managed by Airflow☆22Feb 18, 2021Updated 5 years ago
- Glue VSCode devcontainer setup☆14Jan 31, 2023Updated 3 years ago
- Delta Lake helper methods. No Spark dependency.☆22Jan 19, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A Rust based data/CSV/Parquet file generator☆66Mar 3, 2025Updated last year
- This checklist aims to be an exhaustive list of all elements you should consider when using Amazon Redshift.☆15Sep 21, 2020Updated 5 years ago
- Template for Data Engineering and Data Pipeline projects☆117Jan 1, 2023Updated 3 years ago
- Alto is a versatile data integration tool that allows you to easily run Singer plugins, build and cache PEX files encapsulating those plu…☆59Mar 29, 2023Updated 3 years ago
- Stores Snowplow enriched events in Redshift, Snowflake and Databricks☆30Apr 4, 2025Updated last year
- ☆158Feb 25, 2026Updated last month
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆227Mar 30, 2026Updated last week
- Catalog of datasets related underserved communities. An intermediate between community organizers and data scientists.☆47Jan 31, 2018Updated 8 years ago
- Analyzing the most strategic words to guess on Wordle, based on letter frequency distributions☆11Feb 20, 2022Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆125Mar 31, 2025Updated last year
- ☆10Mar 8, 2022Updated 4 years ago
- ☆32Jan 13, 2026Updated 2 months ago
- ☆10Jan 28, 2025Updated last year
- ☆33Apr 16, 2024Updated last year
- ☆13Jul 8, 2024Updated last year
- ☆66Jan 20, 2026Updated 2 months ago
- Compare DuckDB, Polars and Pandas for generating an artificial dataset of persons and companies☆35Aug 31, 2023Updated 2 years ago
- Delta Lake helper methods in PySpark☆328Jan 19, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Interactive Elasticsearch Analyzer☆13Dec 8, 2022Updated 3 years ago
- Yet Another (Spark) ETL Framework☆21Oct 21, 2023Updated 2 years ago
- Project for "Data pipeline design patterns" blog.☆51Aug 6, 2024Updated last year
- EpochFS is a versioned cloud file system with git-like branching, transaction support.☆17Mar 11, 2026Updated 3 weeks ago
- The code from the whylogs workshop in DataTalks.Club on 29 March 2022☆13Mar 29, 2022Updated 4 years ago
- Companion repository to the ETL & ELT Pipelines with Apache Airflow® eBook☆40Feb 16, 2026Updated last month
- Optimal Data Engine (ODE) for MSSQL☆14Dec 18, 2018Updated 7 years ago
- A script that gets data from the Twitter real-time API, passes it to a message-queue (e.g. RabbitMQ) and stores tweets into MongoDB☆11Apr 20, 2017Updated 8 years ago
- Auto-generated Diagrams from Airflow DAGs. 🔮 🪄☆358Mar 30, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Recursive partitioning (tree models) of psychometric networks☆13Sep 5, 2022Updated 3 years ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆225Apr 30, 2025Updated 11 months ago
- An ORM-Like interface for Google Cloud NoSQL Datastore☆13May 8, 2021Updated 4 years ago
- Tidymodels for Nested/Panel Data☆13Sep 30, 2023Updated 2 years ago
- Faye Ellis, Hands-on AWS Troubleshooting (1127)☆10Jul 12, 2023Updated 2 years ago
- 🏃♀️ Minimalist SQL orchestrator☆316Updated this week
- Data Observability for Data Engineering, published by Packt Publishing☆11Jan 24, 2025Updated last year