Tough and flexible tools for data analysis, transformation, validation and movement.
☆142Jan 26, 2024Updated 2 years ago
Alternatives and similar repositories for DataGristle
Users that are interested in DataGristle are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆62Nov 30, 2022Updated 3 years ago
- python automatic data quality check toolkit☆277Sep 15, 2020Updated 5 years ago
- An ASCII-art based data flow management system☆18Dec 11, 2017Updated 8 years ago
- pyfpds is a python wrapper around the FPDS ATOM feed☆13Mar 1, 2019Updated 7 years ago
- ☆15Mar 14, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Publication: Linked electronic health records for research on a nationwide cohort including over 54 million people in England☆18Mar 12, 2023Updated 3 years ago
- A data structure to track data over time. It works by tracking time/schedule information rather than tracking data changes over time.☆13Jan 19, 2024Updated 2 years ago
- A web-based version of the codebook, which generates a concise summary of every variable in a dataset.☆14Apr 9, 2022Updated 4 years ago
- R htmlwidget for jQuery QueryBuilder filtering of data frames☆12Sep 26, 2019Updated 6 years ago
- A Data Engineering project. Repository for backend infrastructure and Streamlit app files for a Premier League Dashboard.☆256Dec 19, 2025Updated 5 months ago
- a dataset index☆23Feb 19, 2014Updated 12 years ago
- A lightweight command line benchmarking utility☆13Jul 3, 2021Updated 4 years ago
- A traits based data validation module for pandas data structures.☆16Nov 16, 2016Updated 9 years ago
- A machine learning library in Rust from scratch.☆51Dec 31, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A Singer.io Target for Snowflake☆11Jun 9, 2023Updated 3 years ago
- ☆15Jul 24, 2024Updated last year
- [NOT MAINTAINED] Bubbles – Python ETL framework☆462Oct 4, 2017Updated 8 years ago
- Materialize plugin for dbt☆12Jan 25, 2021Updated 5 years ago
- CubETL - Framework and tool for data ETL (Extract, Transform and Load) in Python (PERSONAL PROJECT / SELDOM MAINTAINED)☆28Jul 6, 2022Updated 3 years ago
- Streaming analytics project with eventsim and Kafka☆13Dec 23, 2022Updated 3 years ago
- For managing 2P imaging datasets from preprocessing to activity trace extraction☆10Apr 12, 2019Updated 7 years ago
- pyspark methods to enhance developer productivity 📣 👯 🎉☆687Jun 9, 2026Updated last week
- dbt adwords models☆18Dec 17, 2025Updated 6 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Repository for Data Engineering Interview Series☆40Oct 17, 2024Updated last year
- A Pandas Styler class for making beautiful tables☆413Jan 8, 2023Updated 3 years ago
- Python ETL and Data Warehouse☆34Oct 5, 2015Updated 10 years ago
- The papy package provides an implementation of the flow-based programming paradigm in Python☆35Mar 28, 2026Updated 2 months ago
- Collaborative space for the Resources Working Group of the Developer Relations Foundation.☆18Sep 19, 2025Updated 8 months ago
- A key-value based store for all your things☆11Nov 28, 2017Updated 8 years ago
- A Singer.io tap for extracting data from the Xero API.☆22Jun 10, 2026Updated last week
- A selectable, scrollable list interface for terminal applications built using curses☆10Jun 30, 2015Updated 10 years ago
- events - tools, libraries & scripts, schemas & formats - (incl. whatson, rubyconf, pycon, beerfest & more)☆11Oct 6, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Update a Google Data Catalog tag with dbt Cloud run metadata☆22Jan 19, 2021Updated 5 years ago
- Developed an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as …☆17Oct 1, 2019Updated 6 years ago
- Data build tool model for replicating 3 Google Analytics reports using BigQuery GA export data.☆15Sep 25, 2019Updated 6 years ago
- An open-source celebration of creative academics & academic creatives☆14Dec 9, 2017Updated 8 years ago
- Provides an API for working with Taskpaper formatted documents in Python.☆30Mar 17, 2011Updated 15 years ago
- Final Project for Data Engineering Zoomcamp Course 2024 🧙🔥☆11Apr 17, 2024Updated 2 years ago
- Rust And Delta Demo. Explanation and walkthrough on delta-rs☆10Aug 21, 2023Updated 2 years ago