A python package to create a database on the platform using our moj data warehousing framework
☆21Mar 16, 2026Updated last month
Alternatives and similar repositories for etl_manager
Users that are interested in etl_manager are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Interactive notebooks containing demonstration code of the splink library☆41Mar 4, 2026Updated last month
- The classic Titanic data science problem solved using Tableau Prep running Python scripts.☆11Nov 15, 2019Updated 6 years ago
- User guidance for the MoJ Analytical Platform☆15Mar 30, 2026Updated 2 weeks ago
- Fully unit tested utility functions for data engineering. Python 3 only.☆18Apr 8, 2026Updated last week
- Parent repository for the MOJ Analytics Platform☆14Nov 16, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A CLI to manage and monitor permissions in AWS Lake Formation☆25Feb 8, 2023Updated 3 years ago
- A pyspark lib to validate data quality☆19Nov 11, 2022Updated 3 years ago
- Python version of dbtools☆12Jul 30, 2025Updated 8 months ago
- [Deprecated] This solution helps customers reduce operational complexity and enables administrators to quickly create manual, event-based…☆14Mar 8, 2023Updated 3 years ago
- HDF masterclass materials☆29Mar 28, 2016Updated 10 years ago
- This project aims to load the UK rail timetable and station data provided by the Association of Train Operating Companies (at data.atoc.o…☆10May 28, 2022Updated 3 years ago
- This project is an example of using AWS Step functions to manage and track a series of AWS Batch jobs in N_TO_N mode.☆15Jan 20, 2026Updated 2 months ago
- Google Cloud Platform solution that provides an event driven process that flattens (unnests) Google Analytics 360 data that has been expo…☆16Updated this week
- A Scalable Data Cleaning Library for PySpark.☆29Apr 4, 2019Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- An ever growing collection of patterns for re-use☆11Feb 8, 2015Updated 11 years ago
- A small, useful collection of pandoc filters☆13Apr 5, 2025Updated last year
- Automate Uploads to Cloud Storage using Rclone☆14Jul 7, 2020Updated 5 years ago
- https://how.wtf source code☆21Jul 6, 2024Updated last year
- Utilities to translate your schema into anything, as long as 'anything' is JSON Schema.☆15Jun 24, 2022Updated 3 years ago
- Fast basic data structures for R☆11Apr 6, 2015Updated 11 years ago
- CloudFormation Cross Stack Reference Mapping☆13May 16, 2017Updated 8 years ago
- AWS Lambda Layer for Rclone☆18Updated this week
- Repo that we use for non-repo-specific stories and other shared stuff.☆22Apr 2, 2013Updated 13 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- WIP - agentic chat application template built with TypeScript, Next.js, TailwindCSS, and Shadcn.☆49Mar 24, 2026Updated 3 weeks ago
- ⚙️ Converts NDJson format data into CSV☆21Dec 15, 2024Updated last year
- ☆13Mar 4, 2022Updated 4 years ago
- Anonymizing Library for Apache Spark☆31Nov 9, 2023Updated 2 years ago
- An Excel formula parser☆12Mar 3, 2019Updated 7 years ago
- Demo App that uses GRANDstack to visualize Harry Potter network☆18Sep 19, 2021Updated 4 years ago
- react-native as an engine to drive share extension☆15Jul 16, 2018Updated 7 years ago
- Sprinkle is a volume clustering utility based on RClone. It presents all the RClone available volumes as a single clustered volume. It su…☆20Jun 14, 2024Updated last year
- A collection of python utility functions☆11Mar 30, 2026Updated 2 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Gitstats application for OpenCPU☆12May 13, 2024Updated last year
- Perform Bayesian record linkage with a one-to-one matching assumption.☆11Jul 9, 2020Updated 5 years ago
- GitHub template for small projects.☆22Mar 6, 2026Updated last month
- Introduction to Modern Data Analytics Tools Docker, Airbyte, DBT, Apache Superset with Brazilian Ecommerce Data & Applying RFM in DBT☆13Sep 8, 2022Updated 3 years ago
- Direktiv Application Containers☆19Sep 4, 2023Updated 2 years ago
- Collect and aggregate on spark events for profitz☆10Apr 22, 2022Updated 3 years ago
- Fast, reliable and intuitive object mapping.☆22Dec 2, 2021Updated 4 years ago