A simple and easy to use Data Quality (DQ) tool built with Python.
☆51Sep 7, 2023Updated 2 years ago
Alternatives and similar repositories for tinytimmy
Users that are interested in tinytimmy are comparing it to the libraries listed below
Sorting:
- Python test runner built in Rust☆19Feb 20, 2026Updated last week
- A compilation of main commands for scikit-learn with examples☆11Apr 4, 2023Updated 2 years ago
- Trying out the Dataframe Polars library with Delta Lake ... feat Python.☆12Jan 29, 2025Updated last year
- how to unit test your PySpark code☆29Mar 26, 2021Updated 4 years ago
- Copy structure of your Postgres DBs as Markdown to prompt LLMs better!☆14Feb 23, 2025Updated last year
- Materialize plugin for dbt☆12Jan 25, 2021Updated 5 years ago
- Glue VSCode devcontainer setup☆14Jan 31, 2023Updated 3 years ago
- A toolkit for managing data access policies as code☆13Apr 18, 2024Updated last year
- ☆29Sep 6, 2024Updated last year
- Alto is a versatile data integration tool that allows you to easily run Singer plugins, build and cache PEX files encapsulating those plu…☆59Mar 29, 2023Updated 2 years ago
- Sentiment and language detection for text analytics.☆17Jul 3, 2024Updated last year
- ☆156Feb 6, 2026Updated 3 weeks ago
- Companion repository to the ETL & ELT Pipelines with Apache Airflow® eBook☆38Feb 16, 2026Updated last week
- Delta Lake helper methods. No Spark dependency.☆22Jan 19, 2026Updated last month
- A Rust based data/CSV/Parquet file generator☆65Mar 3, 2025Updated 11 months ago
- Template for Data Engineering and Data Pipeline projects☆116Jan 1, 2023Updated 3 years ago
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆124Mar 31, 2025Updated 11 months ago
- Schema modelling framework for decentralised domain-driven ownership of data.☆261Dec 5, 2023Updated 2 years ago
- 🏃♀️ Minimalist SQL orchestrator☆306Feb 17, 2026Updated last week
- ☆33Apr 16, 2024Updated last year
- Trino (f.k.a PrestoSQL) dialect for SQLAlchemy.☆25May 5, 2022Updated 3 years ago
- ☆31Jan 13, 2026Updated last month
- A dbt package for modelling dbt metadata. https://brooklyn-data.github.io/dbt_artifacts☆388Feb 2, 2026Updated 3 weeks ago
- ☆10May 25, 2021Updated 4 years ago
- benchmarks for LLM tokenizers☆17Jan 15, 2026Updated last month
- Python library for working with ThoughtSpot Modeling Language (TML) files programmatically☆10Oct 10, 2025Updated 4 months ago
- Docker Airflow - Contains a docker compose file for Airflow 2.0☆70Aug 23, 2022Updated 3 years ago
- Stores Snowplow enriched events in Redshift, Snowflake and Databricks☆30Apr 4, 2025Updated 10 months ago
- This is a custom project for WGU, the original project repo is https://github.com/udacity/nd0821-c2-build-model-workflow-starter☆12Feb 1, 2026Updated 3 weeks ago
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆228Feb 11, 2026Updated 2 weeks ago
- Running ClickHouse like it's BigQuery☆39Aug 24, 2023Updated 2 years ago
- A repository with some basic scripts to set up coreos hypev clusters.☆37Aug 9, 2019Updated 6 years ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆224Apr 30, 2025Updated 10 months ago
- Compare DuckDB, Polars and Pandas for generating an artificial dataset of persons and companies☆35Aug 31, 2023Updated 2 years ago
- A Python-based voice assistant integrating speech-to-text (STT), text-to-speech (TTS), and powerful AI capabilities using either a local …☆13Dec 8, 2025Updated 2 months ago
- Deploy A/B testing infrastructure in a containerized microservice architecture for Machine Learning applications.☆40Jan 10, 2025Updated last year
- Open-source hub for cleaned, annotated, and well-documented financial datasets. Contributors can add new data, notebooks, and visualizati…☆18Sep 1, 2025Updated 6 months ago
- Sample project to get started with dbt-power-user vscode extension using dev-container☆11Apr 5, 2024Updated last year
- Faye Ellis, Hands-on AWS Troubleshooting (1127)☆10Jul 12, 2023Updated 2 years ago