airscholar / RealtimeStreamingEngineeringView on GitHub
This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenAI LLM, Kafka and Elasticsearch. It covers each stage from data acquisition, processing, sentiment analysis with ChatGPT, production to kafka topic and connection to elasticsearch.
44Jan 4, 2024Updated 2 years ago

Alternatives and similar repositories for RealtimeStreamingEngineering

Users that are interested in RealtimeStreamingEngineering are comparing it to the libraries listed below

Sorting:

Are these results useful?