sudarshan-koirala / 30-days-of-Databricks
30 days of Databricks is a step-by-step guide to learn Databricks in 30 days for complete beginners.
☆10Updated last year
Related projects ⓘ
Alternatives and complementary repositories for 30-days-of-Databricks
- An end-to-end data engineering pipeline that fetches real-time YouTube analytics and streams them through Kafka for processing with ksqlD…☆10Updated last year
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆29Updated 11 months ago
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆29Updated 10 months ago
- A Retrieval-Augmented Generation (RAG) application for querying legal documents. It uses PostgreSQL, Elasticsearch, and LLM to provide su…☆44Updated 2 months ago
- This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering en…☆15Updated 9 months ago
- Practical LangChain tutorials for LLM applications development☆131Updated last month
- In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …☆22Updated 10 months ago
- ☆15Updated 7 months ago
- Writes the CSV file to Postgres, read table and modify it. Write more tables to Postgres with Airflow.☆35Updated last year
- ☆42Updated 3 months ago
- ☆35Updated 10 months ago
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆63Updated last year
- This project is a Streamlit-based chat application that interacts with the Gemini AI model, allowing users to engage in conversations wit…☆52Updated 3 months ago
- Various projects using Large Language Model (GPT & LLAMA) other open source model from HuggingFace and OpenAI. OpenAI API required for ru…☆86Updated 2 months ago
- ☆10Updated 6 months ago
- ☆14Updated 3 months ago
- Local SQL Database ---> Azure ---> Power BI☆11Updated last year
- Course Material - Data Science Program☆13Updated last year
- Practical step-by-step LangChain guides☆27Updated 7 months ago
- How to stream the generation of a LLM in your Streamlit application☆21Updated 6 months ago
- This project is about building a dimensional data warehouse in BigQuery by transforming an OLTP system to an OLAP system, using dbt as ou…☆10Updated 11 months ago
- Project bike sharing predictor☆54Updated last month
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆197Updated last year
- Repo for advanced RAG evaluation on french legal Code data☆17Updated 7 months ago
- ☆49Updated 11 months ago
- Chat-with-Everything is a series of articles aimed at developers who are interested in learning about and building applications with LLMs☆42Updated last month
- A pipeline to detect data drift and retrain the model when there is drift☆22Updated last year
- ☆11Updated 5 months ago
- ☆14Updated 7 months ago
- This repository will contain the presentation and python jupyter notebooks for the DataHack Summit 2024 conference talk, Improving Real-w…☆85Updated last month