brittonlaroche / Confluent-Kafka-Vector-Encoding
Learn the first step in Retrieval-Augmented Generation (RAG), how to vector encode incoming data to insert and continuously update your vector database from enterprise data sources and systems.
☆30Updated 3 months ago
Alternatives and similar repositories for Confluent-Kafka-Vector-Encoding
Users that are interested in Confluent-Kafka-Vector-Encoding are comparing it to the libraries listed below
Sorting:
- Data Engineering with AWS Cookbook, published by Packt☆18Updated 5 months ago
- ☆30Updated 9 months ago
- ☆33Updated 2 weeks ago
- GitHub repository related to the course Mastering Elastic Map Reduce for Data Engineers☆24Updated 2 years ago
- Simplify Big Data Analytics with Amazon EMR, published by Packt☆13Updated 2 years ago
- Simplifying Data Engineering and Analytics with Delta, published by Packt☆21Updated last year
- code snippet for analytics sessions☆34Updated 3 years ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆27Updated 8 months ago
- Creating Amazon Bedrock agents with Streamlit Framework☆116Updated 2 months ago
- ☆10Updated 3 years ago
- This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.☆50Updated last year
- ☆65Updated 2 weeks ago
- ☆75Updated 8 months ago
- ☆24Updated last week
- Learn how to build Agentic Workflows on AWS☆44Updated this week
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆44Updated 2 years ago
- ☆46Updated 7 months ago
- ☆33Updated last year
- ☆81Updated 4 months ago
- Repository for code examples from my youtube channel and medium articles working with data in python on AWS☆27Updated last year
- ☆28Updated last year
- PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it☆64Updated 2 weeks ago
- This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. These examples cover IoT and CDC scenario…☆24Updated 6 months ago
- ☆27Updated 5 months ago
- Companion repository for the book 'Delta Lake Up and Running'☆47Updated last month
- This Repository contains the Demo Script, Code for all the sessions which I will be doing in Year 2024☆9Updated 4 months ago
- Serverless ETL and Analytics with AWS Glue, published by Packt☆48Updated last year
- The resources of the preparation course for Databricks Data Engineer Professional certification exam☆114Updated last month
- ☆12Updated last year
- ☆20Updated 6 months ago