Fully unit tested utility functions for data engineering. Python 3 only.
☆18Feb 11, 2026Updated 2 weeks ago
Alternatives and similar repositories for dataengineeringutils3
Users that are interested in dataengineeringutils3 are comparing it to the libraries listed below
Sorting:
- Python version of dbtools☆12Jul 30, 2025Updated 7 months ago
- Parent repository for the MOJ Analytics Platform☆14Nov 16, 2021Updated 4 years ago
- ☆13Oct 18, 2023Updated 2 years ago
- A python package to create a database on the platform using our moj data warehousing framework☆21Feb 11, 2026Updated 2 weeks ago
- Terraform module which creates Snowflake RBAC resources using a simple configuration model. DISCLAIMER: Please see the following module t…☆12Jul 3, 2023Updated 2 years ago
- ⚠️ Templates of tools to help prevent committing sensitive data to github☆33Jan 6, 2021Updated 5 years ago
- NLP tool for scraping text from a corpus of PDF files, embedding the sentences in the text and finding semantically similar sentences to …☆37Jun 22, 2022Updated 3 years ago
- Package to read out BME280 sensor on Raspberry Pi☆13Nov 27, 2025Updated 3 months ago
- ☆10Aug 23, 2023Updated 2 years ago
- The ONS Big Data Team Github pages☆10May 19, 2021Updated 4 years ago
- All in one solution for Keycloak deployment into VPS by using Docker-compose, Nginx, Certbot and SSL☆11May 4, 2025Updated 9 months ago
- I will store here a bunch of home assignments that I got (without company names) and their solutions.☆12Feb 21, 2024Updated 2 years ago
- A toolkit of functions and classes to help build isometric games with Lua☆16Apr 21, 2025Updated 10 months ago
- Reproducible Analytical Pipeline of the Hospital Standardised Mortality Ratio (HSMR) quarterly publication☆11Jun 21, 2024Updated last year
- A course about terraform☆11Apr 13, 2021Updated 4 years ago
- The privacy-preserving record linkage toolkit: a proof-of-concept public demo of next-gen data linkage techniques.☆15May 22, 2024Updated last year
- ☆12Sep 10, 2025Updated 5 months ago
- Code for the paper "Match, Compare, or Select? An Investigation of Large Language Models for Entity Matching" (COLING 2025)☆19Jan 3, 2026Updated last month
- ☆15Feb 11, 2026Updated 2 weeks ago
- ☆11Jan 28, 2019Updated 7 years ago
- Interactive notebooks containing demonstration code of the splink library☆40Jan 19, 2024Updated 2 years ago
- A fastmcp server for open budget project☆13Jan 13, 2026Updated last month
- Demo of an In-database processing tool for scikit-learn☆13Oct 18, 2022Updated 3 years ago
- Flake8 plugin to lint for backwards incompatible database migrations☆12Feb 20, 2026Updated last week
- ☆12Feb 21, 2022Updated 4 years ago
- A thin wrapper around the AJV JSON Validator for Python☆12May 5, 2024Updated last year
- A Mixture‑of‑Experts Educational Framework for Adaptive Cybersecurity☆20Feb 8, 2026Updated 2 weeks ago
- A simple python library to spot holiday "bridges" and long weekends.☆10Aug 19, 2021Updated 4 years ago
- Code to implement the network histogram (Olhede and Wolfe, arXiv:1312.5306)☆11Sep 23, 2014Updated 11 years ago
- Learning GitLab, published by Packt☆13Jan 18, 2021Updated 5 years ago
- Data pipeline to extract and preprocess BigQuery user journey data.☆13Jun 16, 2022Updated 3 years ago
- Plug and play Heroicons and Tabler icons for Django Cotton.☆16Updated this week
- ☆46Feb 12, 2026Updated 2 weeks ago
- AWS Glue Configurable Test Data Generator for S3 Data Lakes and DynamoDB☆18Jan 19, 2026Updated last month
- Terraform ECS module☆15Aug 22, 2022Updated 3 years ago
- A tiny library to make writing CBV-based APIs easier in Django.☆12Aug 23, 2024Updated last year
- Working paper and notebook for unsupervised document clustering☆13Mar 6, 2018Updated 7 years ago
- ☆17Dec 2, 2025Updated 2 months ago
- textual tactics game☆10Sep 3, 2022Updated 3 years ago