nakuleshj / news-nlp-pipelineLinks
A fully serverless, event-driven data pipeline that ingests, enriches, validates, and visualizes real-time news data using AWS services. Designed for cost-efficient, scalable deployment using only free-tier AWS services.
☆23Updated 3 months ago
Alternatives and similar repositories for news-nlp-pipeline
Users that are interested in news-nlp-pipeline are comparing it to the libraries listed below
Sorting:
- This repo is a comprehensive blueprint of how to use dbt to run data pipelines using databricks compute. It showcases modular project str…☆67Updated last month
- Quickstart for any service☆167Updated this week
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆254Updated last month
- Open Data Stack Platform: a collection of projects and pipelines built with open data stack tools for scalable, observable data platform…☆20Updated last month
- Dagster Labs' open-source data platform, built with Dagster.☆414Updated last week
- Houston orchestration API. callhouston.io☆51Updated 5 months ago
- The Open-Source Enterprise Data Platform in a single Portal☆260Updated this week
- Slow & local data allows you to move fast and deliver business value for the 99.9% of the data challenges.☆323Updated last month
- ☆211Updated 10 months ago
- SQLMesh example projects☆36Updated 4 months ago
- All things awesome related to Dagster!☆132Updated last month
- Sling is a CLI tool that extracts data from a source storage/database and loads it in a target storage/database.☆724Updated this week
- Demo Project for Open Source MDS☆167Updated 2 months ago
- ☆138Updated this week
- ☆163Updated 6 months ago
- DuckDB for streaming data☆688Updated 2 months ago
- Demonstrating the capabilities of DuckDB as a transformation engine for data lakes☆31Updated last year
- Python package for querying iceberg data through duckdb.☆70Updated last year
- PyAirbyte brings the power of Airbyte to every Python developer.☆306Updated last week
- Python wrapper for the Sling CLI tool☆60Updated 3 weeks ago
- New generation opensource data stack☆75Updated 3 years ago
- Pushdown compute from Snowflake to DuckDB running on your infrastructure☆195Updated last month
- Example Dagster Cloud code for the Hooli Data Engineering organization.☆16Updated last month
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆121Updated 7 months ago
- ☆336Updated this week
- ☆30Updated last year
- Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.☆1,247Updated last week
- A lightweight Python-based tool for extracting and analyzing data column lineage for dbt projects☆171Updated this week
- An example of a Dagster project with a possible folder structure to organize the assets, jobs, repositories, schedules, and ops. Also has…☆100Updated last year
- Dagster University courses☆116Updated this week