Insight Data Engineering project: A platform built in HDFS, Spark and Airflow to help you to find social influencers from GitHub Network.
☆16May 21, 2024Updated last year
Alternatives and similar repositories for Git-Influencer
Users that are interested in Git-Influencer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Amazon QuickSight and Amazon Athena workshop. Workshop will focus on ingesting data into Athena, combining it with other data sources, an…☆11Sep 19, 2017Updated 8 years ago
- ☆15Jan 22, 2017Updated 9 years ago
- Example project for consuming AWS Kinesis streamming and save data on Amazon Redshift using Apache Spark☆11May 22, 2018Updated 7 years ago
- Utility to monitor AWS Redshift Performance☆12Jul 6, 2016Updated 9 years ago
- 😎 Awesome lists about all kinds of interesting topics☆10Feb 10, 2020Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Create a data pipeline on AWS to execute batch processing in a Spark cluster provisioned by Amazon EMR. ETL using managed airflow: extrac…☆10Jul 12, 2021Updated 4 years ago
- A repo to track data engineering projects☆13Nov 11, 2022Updated 3 years ago
- A static code analyzer to generate network connection topology for micro-service applications☆18May 1, 2026Updated last week
- Python Script to download hundreds of images from 'Google Images'. It is a ready-to-run code!☆14Sep 12, 2021Updated 4 years ago
- Project Search is a Recommendation system for Youtube videos and Amazon products.☆12May 10, 2017Updated 8 years ago
- Developed an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as …☆17Oct 1, 2019Updated 6 years ago
- A tool for modeling infectious diseases.☆18Apr 29, 2024Updated 2 years ago
- Tweepy Stream Example☆19Apr 23, 2019Updated 7 years ago
- Usage examples for byte-genie API☆12Apr 27, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This is a capstone project that entails building an end-to-end ETL (Extract-Transform-Load) Data pipeline which extracts UK accident and …☆18Jun 6, 2020Updated 5 years ago
- A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract d…☆24Nov 22, 2021Updated 4 years ago
- Internet's Most Popular Tutorials on Fresh-off-the-shelf ML & Data Science Technologies, Authored by Yours Truly.☆19Apr 29, 2020Updated 6 years ago
- ELT Code for your Data Warehouse☆26Sep 18, 2023Updated 2 years ago
- Built a stream processing data pipeline to get data from disparate systems into a dashboard using Kafka as an intermediary.☆29Aug 14, 2023Updated 2 years ago
- Demonstration of using Apache Spark to build robust ETL pipelines while taking advantage of open source, general purpose cluster computin…☆25Aug 11, 2023Updated 2 years ago
- ☆19Feb 2, 2020Updated 6 years ago
- A real-time event pipeline around Kafka Ecosystem for Chicago Transit Authority.☆32Aug 14, 2023Updated 2 years ago
- A real-time streaming ETL pipeline for streaming and performing sentiment analysis on Twitter data using Apache Kafka, Apache Spark and D…☆29Aug 8, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Find mapcodes in a string☆26Jul 24, 2022Updated 3 years ago
- COVID-19 Projections Data and Dashboard☆26Dec 8, 2022Updated 3 years ago
- Mconf's wiki: https://github.com/mconf/wiki/wiki☆13Apr 30, 2014Updated 12 years ago
- This is python web scraper implemented using multithreading/multiprocessing/pool for amazon.com☆28Sep 23, 2019Updated 6 years ago
- ☆10Dec 22, 2018Updated 7 years ago
- PyPi module for Graphlet AI Knowledge Graph Factory☆32Apr 1, 2023Updated 3 years ago
- A libcluster strategy for Digital Ocean Droplets☆12May 11, 2023Updated 2 years ago
- general-purpose fast, stateless, and deterministic feature extractor written in golang for use in machine learning☆12Mar 17, 2018Updated 8 years ago
- the full stack☆13Jun 16, 2015Updated 10 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Mock async JavaScript libraries☆22Jan 2, 2016Updated 10 years ago
- An implementation of the QUIC protocol in Elixir☆13Mar 17, 2019Updated 7 years ago
- Amazon Keyword Suggestion Tool in GoLang. Tool will generate relevant Amazon Product Keywords with the number of active products per each…☆50Jan 3, 2021Updated 5 years ago
- A Yeoman generator for creating a FeathersJS plugin.☆22Aug 16, 2021Updated 4 years ago
- Pure Elixir implementation of Sha3 and the original Keccak1600-f☆16Jan 20, 2026Updated 3 months ago
- epmd written in Elixir☆20Sep 24, 2014Updated 11 years ago
- Annotate your pictures online and save in different formats☆13Oct 4, 2023Updated 2 years ago