β46Jun 10, 2024Updated last year
Alternatives and similar repositories for data_engineering_best_practices
Users that are interested in data_engineering_best_practices are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Contribute to dlt verified sources π₯β113Mar 30, 2026Updated last month
- A framework to manage data, continuouslyβ35Jan 20, 2025Updated last year
- Sample project to get started with dbt-power-user vscode extension using dev-containerβ12Apr 5, 2024Updated 2 years ago
- A HTML previewer for Visual Studio Codeβ15Nov 29, 2018Updated 7 years ago
- This connector is a dbt project that maps Medicare CCLF claims data to the Tuva Input Layer.β14Apr 16, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Model Context Protocol (MCP) servers with Claude Code. These tools dramatically enhance Claude Code's capabilities, allowing it to interaβ¦β28Aug 23, 2025Updated 9 months ago
- Zhongwen Tools, tools for dealing with Chinese--pinyin, trad-simp conversion and more.β13Feb 25, 2026Updated 3 months ago
- β22Sep 1, 2025Updated 8 months ago
- Alto is a versatile data integration tool that allows you to easily run Singer plugins, build and cache PEX files encapsulating those pluβ¦β60Mar 29, 2023Updated 3 years ago
- Email digest aggregator for fail2banβ13Oct 9, 2024Updated last year
- The Postgres adapter for Harlequin, the SQL IDE for your Terminalβ20Apr 20, 2026Updated last month
- RStudio addin to paste last value as comment in codeβ11May 24, 2021Updated 5 years ago
- End to end data pipeline to extract and analyze submissions from any subreddit using Pushshift, python, dbt and BigQuery.β12Jul 17, 2023Updated 2 years ago
- π A sweet and speedy code generator for dbt ποΈβ¨β32Jan 23, 2026Updated 4 months ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- β‘ Blazing fast async/await HTTP client for Python written on Rust using reqwestsβ32Apr 16, 2023Updated 3 years ago
- A dbt package to enhance data privacyβ22May 7, 2026Updated 2 weeks ago
- R imgur API Clientβ16Apr 22, 2018Updated 8 years ago
- An Ibis back-end for the GizmoSQL Arrow Flight SQL Server (with the DuckDB engine)β16May 10, 2026Updated 2 weeks ago
- Social Watcher on Facebook Marketing APIβ10Jul 20, 2022Updated 3 years ago
- Integrating Apache Airflow, dbt, Great Expectations and Apache Superset to develop a modern open source data stack.β18Jun 19, 2022Updated 3 years ago
- Javascript for rendering CSS animations of Chinese character stroke dataβ14Dec 14, 2015Updated 10 years ago
- My custom Twitch integrations built with Effectβ17Jul 7, 2025Updated 10 months ago
- Hexagonal (ports and adapters) architecture applied to Spark and Python data engineering projectβ33Jul 26, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Data on international first names and sex of people with that nameβ13Jan 12, 2019Updated 7 years ago
- β11Nov 21, 2023Updated 2 years ago
- The go to demo for public and private dbt Learnβ84Mar 17, 2026Updated 2 months ago
- data load tool (dlt) is an open source Python library that makes data loading easy π οΈβ5,365Updated this week
- β50Mar 10, 2026Updated 2 months ago
- Real-time Credit card Fraud detection using Spark Streaming, Spark ML, Spark SQL, Kafka, Cassandra and Airflowβ11Jul 1, 2022Updated 3 years ago
- A small dataset that contains information related to Columbo - the American crime drama television series starring Peter Falk.β11Aug 9, 2022Updated 3 years ago
- A guide for leading a data (engineering) teamβ65May 7, 2024Updated 2 years ago
- R package based on ETL framework to interface with NYC CitiBike dataβ13May 20, 2019Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Professional Gemini API integration for Claude and all MCP-compatible hosts with intelligent model selection and advanced file handling |β¦β29Mar 17, 2025Updated last year
- This workshop will familiarize you with some of the key steps towards building an autonomous driving data lake and extracting images fromβ¦β10Jul 12, 2022Updated 3 years ago
- A package for interfacing with Docker from RStudio, including generating dockerfiles and allowing users to build/push/tag/run docker imagβ¦β18Jun 9, 2025Updated 11 months ago
- Generate subtitles (.srt file) from an audio/video file. It uses the faster-whisper library, which is much faster than the OpenAI originaβ¦β22Nov 8, 2023Updated 2 years ago
- Journeys between the two worlds of Python π and Rust π¦β44May 18, 2026Updated last week
- An Azure Bicep local-deploy extension for Azure DevOpsβ37Mar 29, 2026Updated last month
- β10Jul 20, 2020Updated 5 years ago