chuqiaoshen/Git-Influencer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/chuqiaoshen/Git-Influencer)

chuqiaoshen / Git-Influencer

Insight Data Engineering project: A platform built in HDFS, Spark and Airflow to help you to find social influencers from GitHub Network.

☆16

Alternatives and similar repositories for Git-Influencer

Users that are interested in Git-Influencer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

keiraqz / artmosphere
View on GitHub
Data Engineering Project at Insight
☆15Nov 17, 2015Updated 10 years ago
drawrowfly / awesome
View on GitHub
😎 Awesome lists about all kinds of interesting topics
☆10Feb 10, 2020Updated 6 years ago
anthonywong611 / Batch-ETL-with-AWS-EMR-and-MWAA
View on GitHub
Create a data pipeline on AWS to execute batch processing in a Spark cluster provisioned by Amazon EMR. ETL using managed airflow: extrac…
☆10Jul 12, 2021Updated 5 years ago
aws-quickstart / quickstart-amazon-redshift
View on GitHub
AWS Quick Start Team
☆23Oct 3, 2024Updated last year
shravan-kuchkula / dataEngineering
View on GitHub
A repo to track data engineering projects
☆14Nov 11, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
manojahi / Project-Search-A-Recommendation-system-for-Youtube-video-and-Amazon-Product-based-on-user-comments
View on GitHub
Project Search is a Recommendation system for Youtube videos and Amazon products.
☆11May 10, 2017Updated 9 years ago
shravan-kuchkula / udacity-data-eng-proj4
View on GitHub
Developed an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as …
☆17Oct 1, 2019Updated 6 years ago
alanchn31 / Loan-Default-Prediction
View on GitHub
Loan Default Prediction using PySpark, with jobs scheduled by Apache Airflow and Integration with Spark using Apache Livy
☆22Dec 26, 2020Updated 5 years ago
byte-genie / examples-genie
View on GitHub
Usage examples for byte-genie API
☆12Apr 27, 2024Updated 2 years ago
AdeboyeML / UK_Accident_Traffic_ETL_Pipeline
View on GitHub
This is a capstone project that entails building an end-to-end ETL (Extract-Transform-Load) Data pipeline which extracts UK accident and …
☆18Jun 6, 2020Updated 6 years ago
cmu-db / dbgym
View on GitHub
Infrastructure for researching self-driving databases
☆32Jul 2, 2025Updated last year
shravan-kuchkula / udacity-data-eng-proj3
View on GitHub
Built a stream processing data pipeline to get data from disparate systems into a dashboard using Kafka as an intermediary.
☆29Aug 14, 2023Updated 2 years ago
SourabhSinghRana / real-time_crypto_data_pipeline_using_kafka
View on GitHub
I am using confluent Kafka cluster to produce and consume scraped data. In this project, I've created a real-time data pipeline that uti…
☆29May 2, 2023Updated 3 years ago
rayyan17 / jobAnalytics_and_search
View on GitHub
JobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
☆30Dec 8, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
embulk / embulk-input-gcs
View on GitHub
Embulk plugin that loads records from Google Cloud Storage
☆14Mar 15, 2025Updated last year
IndicoDataSolutions / clothing_similarity
View on GitHub
Final and skeleton code for the clothing similarity walkthrough
☆10Jan 20, 2016Updated 10 years ago
guidok91 / spark-movies-etl
View on GitHub
Spark data pipeline that processes movie ratings data.
☆31Updated this week
gtalarico / interactive-elastic-analyzer
View on GitHub
Interactive Elasticsearch Analyzer
☆13Dec 8, 2022Updated 3 years ago
pran4ajith / spark-twitter-streaming
View on GitHub
A real-time streaming ETL pipeline for streaming and performing sentiment analysis on Twitter data using Apache Kafka, Apache Spark and D…
☆29Aug 8, 2020Updated 5 years ago
yuorme / covid-projections
View on GitHub
COVID-19 Projections Data and Dashboard
☆26Dec 8, 2022Updated 3 years ago
redis-developer / redis-om-retail
View on GitHub
This repository contains several example sub-projects related to data modeling using Redis with Redis OM for Python
☆14Mar 2, 2022Updated 4 years ago
goldengrape / read_medical_device_data
View on GitHub
☆10Dec 22, 2018Updated 7 years ago
garystafford / aws-airflow-demo
View on GitHub
Project files for the post: Running PySpark Applications on Amazon EMR using Apache Airflow: Using the new Amazon Managed Workflows for A…
☆41Jul 6, 2022Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
metachris / rfid-music-player
View on GitHub
A simple RFID music player for kids (runs on a Raspberry Pi)
☆11Jun 30, 2017Updated 9 years ago
syuilo / seed-color
View on GitHub
Generate random color from a seed
☆14Aug 5, 2020Updated 5 years ago
johnspackman / UploadMgr
View on GitHub
Uploads files with background uploads and progress feedback on modern browsers
☆10Jun 3, 2026Updated last month
jsonmaur / libcluster-droplet
View on GitHub
A libcluster strategy for Digital Ocean Droplets
☆12May 11, 2023Updated 3 years ago
dustin-decker / featuremill
View on GitHub
general-purpose fast, stateless, and deterministic feature extractor written in golang for use in machine learning
☆12Mar 17, 2018Updated 8 years ago
Viddi / quic-elixir
View on GitHub
An implementation of the QUIC protocol in Elixir
☆13Mar 17, 2019Updated 7 years ago
UBOdin / jsqlparser
View on GitHub
UB's JSQLparser fork
☆12Nov 28, 2019Updated 6 years ago
abeleuta / easyannotation
View on GitHub
Annotate your pictures online and save in different formats
☆14Oct 4, 2023Updated 2 years ago
IBMDeveloperUK / ML-For-Everyone
View on GitHub
Resources, notebooks, assets for ML for Everyone Twitch stream
☆14Jul 8, 2020Updated 6 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
sethjuarez / tfjsmnist
View on GitHub
☆16Jan 3, 2019Updated 7 years ago
aws-samples / analyzing-reddit-sentiment-with-aws
View on GitHub
Learn how to use Kinesis Firehose, AWS Glue, S3, and Amazon Athena by streaming and analyzing reddit comments in realtime. 100-200 level …
☆45Apr 20, 2021Updated 5 years ago
Aiven-Labs / demo-opensearch-python
View on GitHub
This repository contains code example in how to write search queries with OpenSearch Python client
☆10Sep 20, 2023Updated 2 years ago
Vingtoft / UNO-excel-to-pdf-converter
View on GitHub
A small project that convert Excel (xlsx) files to PDF files and applies different styles to the PDF (landscape orientation, margins and …
☆12Jan 5, 2016Updated 10 years ago
r7kamura / rnes
View on GitHub
A NES emulator written in Ruby.
☆10Nov 24, 2018Updated 7 years ago
Geddd / PATB4ASM80
View on GitHub
This is a version of Li Chen Wang's Palo Alto Tiny BASIC 2.0 for use with the online 8080 emulator and assembler ASM80.com.
☆12Oct 10, 2020Updated 5 years ago
JayWood / jw-wpcli-random-posts
View on GitHub
A robust random post generator for WP CLI which supports multisite, post types, post counts, taxonomies, terms, term counts and featured …
☆57Jun 16, 2023Updated 3 years ago