getindata / quickstart-ml-blueprints
Data science project development best practices and state of the art open-source tooling forged into a set of solved ML use cases to serve as blueprints for efficient prototyping.
☆15Updated last year
Related projects: ⓘ
- ☆11Updated 7 months ago
- NiFi Processor for Apache Pulsar☆10Updated 6 months ago
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆102Updated this week
- Full stack data engineering tools and infrastructure set-up☆38Updated 3 years ago
- Receipes of publicly-available Jupyter images☆8Updated last week
- Test data management tool for any data source, batch or real-time☆35Updated last week
- Set up a Cost-Effective Modern Data Stack for a Charity☆18Updated 6 months ago
- Python+VueJS application to load, explore, combine,transform and deliver data☆67Updated last week
- DuckDB Docker image☆18Updated this week
- ☆28Updated 9 months ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated 6 months ago
- A tool to automatically infer columns data types in .csv files☆33Updated last year
- rust-for-data☆42Updated last year
- CLI to create an ER Diagram from DuckDB database files☆59Updated last week
- ☆12Updated 11 months ago
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆52Updated last year
- Delta reader for the Ray open-source toolkit for building ML applications☆40Updated 7 months ago
- Hands-on workshop with Iceberg, Redpanda, Debezium and Kafka-Connect☆12Updated 6 months ago
- Data Tools Subjective List☆80Updated last year
- Geospatial clustering at massive scale☆94Updated 2 months ago
- A tool for generating docker-compose environments☆19Updated 3 months ago
- ☆17Updated 2 years ago
- end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence☆122Updated 2 weeks ago
- A write-audit-publish implementation on a data lake without the JVM☆39Updated last month
- Guide to data platforms and tools☆31Updated 2 years ago
- Intended for internal use: deploys all infrastructure required for Astronomer to run on GCP☆10Updated last month
- Building 3D Trusted Data Pipelines With Dagster, Dbt, and Duckdb☆18Updated last year
- Trying out Rust☆11Updated last year
- Unity Catalog UI☆40Updated 2 weeks ago
- A few end to end examples that use data-describe☆16Updated last year