Data transformation
☆23Apr 18, 2021Updated 5 years ago
Alternatives and similar repositories for data-refinery
Users that are interested in data-refinery are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A dashboard is worth a thousand words => https://datastudio.google.com/reporting/755f3183-dd44-4073-804e-9f7d3d993315☆28Oct 30, 2021Updated 4 years ago
- Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.☆20Oct 11, 2021Updated 4 years ago
- ETL flow framework based on Yaml configs in Python☆22Oct 21, 2023Updated 2 years ago
- A data pipeline moving data from a Relational database system (RDBMS) to a Hadoop file system (HDFS).☆15Jun 3, 2021Updated 4 years ago
- Schedule a data pipeline in Google Cloud using cloud function, BigQuery, cloud storage, cloud scheduler, stack trace, cloud build, and p…☆25Jun 4, 2019Updated 6 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Terraform module to create Fast.ai course instance.☆12Nov 27, 2017Updated 8 years ago
- implementing an end-to-end tweets ETL/Analysis pipeline.☆59Dec 8, 2022Updated 3 years ago
- ☆15Dec 2, 2020Updated 5 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆56May 6, 2023Updated 3 years ago
- ☆14Jul 18, 2023Updated 2 years ago
- Causal Inference Using Quasi-Experimental Methods☆21Jan 15, 2021Updated 5 years ago
- Vagrant Environment for creating a macOS Base Box☆10Nov 13, 2016Updated 9 years ago
- Deploy instantly on Serverless Application Repository☆12Nov 18, 2018Updated 7 years ago
- AI driven drum patterns 🥈 Runner up for "Most fun/Best easter egg" in Supabase's Launch Week 7 Hackathon☆16Apr 5, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆11Jun 17, 2024Updated last year
- Social Engineering for the Blue Team☆11Feb 1, 2024Updated 2 years ago
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆166Jun 16, 2020Updated 5 years ago
- Polars plugin for 256-bit (U256 and signed I256) integers backed by ruint☆14Oct 2, 2025Updated 7 months ago
- PyTorch - Albert Large V2, Bert Base Uncased, Bert Large Uncased WWM Finetuned Squad, Distil Roberta Base, Roberta Base Squad2, Roberta l…☆11Jul 10, 2020Updated 5 years ago
- Scripts for building Vagrant boxes for VMware Fusion that boot macOS☆14Feb 1, 2019Updated 7 years ago
- My personal website☆11Apr 8, 2026Updated last month
- A server side rendering framework for Deno CLI and Deploy. 🦟 🦕☆15Jun 22, 2022Updated 3 years ago
- EDA Tutorial for 2017 PyCon Portland☆13May 2, 2017Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- EU Budget for Results - Data Lake☆14Jan 26, 2023Updated 3 years ago
- The chrome browser controlled via puppeteer does not support switching proxies without restarting the browser. In this tutorial I show ho…☆12Dec 20, 2020Updated 5 years ago
- kdb+/q kalman beta matlab python☆11Sep 11, 2019Updated 6 years ago
- Automated Continuous Data Quality Measurement☆12Nov 15, 2023Updated 2 years ago
- A recommender system for GitHub repositories☆14Jun 21, 2014Updated 11 years ago
- This repository contains fine tuned BERT models☆12Jul 17, 2020Updated 5 years ago
- Dockerized openconnect client. Compatible with Cisco Anyconnect (CSD). Exposes socks5 proxy.☆14Oct 16, 2020Updated 5 years ago
- ☆10Jul 2, 2016Updated 9 years ago
- ☆10Oct 15, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Master complex big data processing, stream analytics, and machine learning with Apache Spark☆18Jan 30, 2023Updated 3 years ago
- ☆10Sep 9, 2017Updated 8 years ago
- We implement AI for Hearthstone using open source Hearthstone simulator FirePlace: https://github.com/jleclanche/fireplace.☆11Jun 12, 2016Updated 9 years ago
- Real Time Streaming using Apache Spark Streaming [Video], published by Packt☆10Oct 31, 2022Updated 3 years ago
- Weird experimental videomoshing experiment (weird)☆14Jul 20, 2020Updated 5 years ago
- embedded Perl 5 interpreter in Haskell, forked from https://github.com/perl6/Pugs.hs. Candidate package on hackage at https://hackage.has…☆12Feb 7, 2021Updated 5 years ago
- Skillset Challenge for the Apprenticeship Program, June 2021.☆11Jan 8, 2022Updated 4 years ago