This project aims to predict smartphone prices using a combination of batch and stream processing techniques in a Big Data environment. The architecture follows the Lambda Architecture pattern, providing both real-time and batch processing capabilities to users.
☆25Apr 15, 2024Updated last year
Alternatives and similar repositories for Big-Data-Project
Users that are interested in Big-Data-Project are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This project implements a real-time data pipeline using Apache Kafka, Python's psutil library for metric collection, and SQL Server for d…☆15Oct 31, 2023Updated 2 years ago
- The project aims to automate the extraction of data from a YouTube channel, transform the data into a suitable format, and make it availa…☆12Sep 24, 2023Updated 2 years ago
- Sample ecommerce website.☆10Feb 11, 2022Updated 4 years ago
- Feature demos, integration guides & hands-on labs/projects using Kpow, Flex, Kafka, Flink, Iceberg & more☆51Mar 30, 2026Updated last week
- Dynamic delta hedging (DDH) is a trading strategy that involves hedging a non-linear position with linear instruments. Linear instruments…☆16Nov 24, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Get information about Indian Mutual Funds from their ISIN numbers.☆44Mar 27, 2026Updated last week
- I am using confluent Kafka cluster to produce and consume scraped data. In this project, I've created a real-time data pipeline that uti…☆29May 2, 2023Updated 2 years ago
- ☆59Apr 2, 2025Updated last year
- Building a highly scalable Machine Learning System☆27Dec 3, 2024Updated last year
- VN Stock Advisor is an intelligent stock analysis tool utilizing CrewAI's Multi-AI-Agent system.☆63Jun 14, 2025Updated 9 months ago
- Built and deployed scalable LLM retrieval APIs on a hybrid GCP architecture with full CI/CD, IaC, and monitoring☆76Aug 10, 2025Updated 7 months ago
- ☆64Nov 24, 2024Updated last year
- This repository consists of Java Program for Employee Management System.☆109May 15, 2024Updated last year
- natual language guided image captioning☆87Feb 11, 2024Updated 2 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Practical guide to build end-to-end machine learning pipeline and deploy your model in production,☆85Aug 28, 2023Updated 2 years ago
- ☆70Aug 15, 2024Updated last year
- Streamly - Streamlit Assistant is designed to provide the latest updates from Streamlit, generate code snippets for Streamlit widgets, an…☆115Jul 25, 2024Updated last year
- ML Research Resources☆78May 24, 2022Updated 3 years ago
- React Cookbook, published by Packt☆133Feb 24, 2023Updated 3 years ago
- Data Engineering Project with Hadoop HDFS and Kafka☆124Nov 4, 2023Updated 2 years ago
- An MCP Multimodal AI Agent with eyes and ears!☆552Jan 5, 2026Updated 3 months ago
- A universal CLI toolkit for AI agent skills, enabling structured AI-assisted development across tools like Cursor, Claude Code, Codex, an…☆1,073Updated this week
- This repo contains implementation of 25+ prompt engineering techniques.☆441Dec 18, 2025Updated 3 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)☆229Jan 3, 2024Updated 2 years ago
- A turnkey MLOps pipeline demonstrating how to go from raw events to real-time predictions at scale.☆243Oct 21, 2025Updated 5 months ago
- A curated learning repository focused on High-Performance Computing (HPC) — covering fundamentals to advanced topics in CUDA, MPI, C++, a…☆361Mar 22, 2026Updated 2 weeks ago
- The resources of the preparation course for Databricks Data Engineer Associate certification exam☆622Dec 26, 2025Updated 3 months ago
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆323Feb 14, 2025Updated last year
- Data Science Handbook☆297Jan 4, 2022Updated 4 years ago
- Slides, scripts and materials for the Machine Learning in Finance Course at NYU Tandon, 2022☆548Dec 11, 2022Updated 3 years ago
- ☆390Jan 26, 2025Updated last year
- Code release for ActionFormer (ECCV 2022)☆553Apr 11, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Case Recommender: A Flexible and Extensible Python Framework for Recommender Systems☆501Jan 10, 2024Updated 2 years ago
- A Vietnamese natural language processing toolkit (NAACL 2018)☆665Feb 12, 2023Updated 3 years ago
- Demystify AI agents by building them yourself. Local LLMs, no black boxes, real understanding of function calling, memory, and ReAct patt…☆3,361Jan 21, 2026Updated 2 months ago
- Getting Started with Data Enngineering☆1,319Apr 20, 2025Updated 11 months ago
- List of software (HW interfaces, libs, protocols, etc) specifically suitable for resource-constrained Embedded Systems (low-memory and l…☆1,026Mar 11, 2026Updated 3 weeks ago
- Example end to end data engineering project.☆1,401Dec 8, 2022Updated 3 years ago
- Build an email assistant with human-in-the-loop and memory☆1,548Oct 20, 2025Updated 5 months ago