A simple and easy to use Data Quality (DQ) tool built with Python.
☆51Sep 7, 2023Updated 2 years ago
Alternatives and similar repositories for tinytimmy
Users that are interested in tinytimmy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- how to unit test your PySpark code☆29Mar 26, 2021Updated 5 years ago
- Materialize plugin for dbt☆12Jan 25, 2021Updated 5 years ago
- Example orchestration pipeline for Fivetran + dbt managed by Airflow☆22Feb 18, 2021Updated 5 years ago
- Delta Lake helper methods. No Spark dependency.☆22Jan 19, 2026Updated 5 months ago
- A toolkit for managing data access policies as code☆12Apr 18, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A Rust based data/CSV/Parquet file generator☆66Mar 3, 2025Updated last year
- This checklist aims to be an exhaustive list of all elements you should consider when using Amazon Redshift.☆15Sep 21, 2020Updated 5 years ago
- A compilation of main commands for scikit-learn with examples☆11Apr 4, 2023Updated 3 years ago
- Template for Data Engineering and Data Pipeline projects☆120Jan 1, 2023Updated 3 years ago
- ☆14Apr 19, 2023Updated 3 years ago
- This is a typing game made with Phaser 3 with my son when he was 8 years old!☆25Feb 22, 2023Updated 3 years ago
- Alto is a versatile data integration tool that allows you to easily run Singer plugins, build and cache PEX files encapsulating those plu…☆61Mar 29, 2023Updated 3 years ago
- Stores Snowplow enriched events in Redshift, Snowflake and Databricks☆30Jun 16, 2026Updated last week
- ☆160Feb 25, 2026Updated 4 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆27Sep 6, 2024Updated last year
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆227Jun 8, 2026Updated 3 weeks ago
- Analyzing the most strategic words to guess on Wordle, based on letter frequency distributions☆11Feb 20, 2022Updated 4 years ago
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆125Mar 31, 2025Updated last year
- ☆33Apr 23, 2026Updated 2 months ago
- Schema modelling framework for decentralised domain-driven ownership of data.☆261Dec 5, 2023Updated 2 years ago
- Delta Lake helper methods in PySpark☆329Jan 19, 2026Updated 5 months ago
- Interactive Elasticsearch Analyzer☆13Dec 8, 2022Updated 3 years ago
- A tutorial for using Hadoop with Python and Hive☆10May 26, 2015Updated 11 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- EpochFS is a versioned cloud file system with git-like branching, transaction support.☆17Apr 23, 2026Updated 2 months ago
- Deployed an kafka instance in AWS EC2 Instance to streamline the data into Databricks☆10Aug 15, 2023Updated 2 years ago
- Tutorial for PyData London 2019 on AB Test by cluster☆13Jul 12, 2019Updated 6 years ago
- ☆17Nov 25, 2024Updated last year
- A script that gets data from the Twitter real-time API, passes it to a message-queue (e.g. RabbitMQ) and stores tweets into MongoDB☆11Apr 20, 2017Updated 9 years ago
- Ready to use Spanish Word2Vec embeddings created from >18B chars and >3B words☆44Jun 22, 2019Updated 7 years ago
- Recursive partitioning (tree models) of psychometric networks☆13Sep 5, 2022Updated 3 years ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆225Apr 30, 2025Updated last year
- An ORM-Like interface for Google Cloud NoSQL Datastore☆13May 8, 2021Updated 5 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Tidymodels for Nested/Panel Data☆13Sep 30, 2023Updated 2 years ago
- A dbt package for modelling dbt metadata. https://brooklyn-data.github.io/dbt_artifacts☆396Jun 8, 2026Updated 3 weeks ago
- 🏃♀️ Minimalist SQL orchestrator☆327Jun 23, 2026Updated last week
- Data Observability for Data Engineering, published by Packt Publishing☆11Jan 24, 2025Updated last year
- ☆81Oct 14, 2024Updated last year
- ☆20Apr 17, 2025Updated last year
- + Provides a GitHub repository template for a configuration factory and rule set factories for friendsofphp/php-cs-fixer.☆12Jun 21, 2026Updated last week