Insight Data Engineering project: A platform built in HDFS, Spark and Airflow to help you to find social influencers from GitHub Network.
☆16May 21, 2024Updated last year
Alternatives and similar repositories for Git-Influencer
Users that are interested in Git-Influencer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Utility to monitor AWS Redshift Performance☆12Jul 6, 2016Updated 9 years ago
- Create a data pipeline on AWS to execute batch processing in a Spark cluster provisioned by Amazon EMR. ETL using managed airflow: extrac…☆10Jul 12, 2021Updated 4 years ago
- Qs from other and my answer on them☆10Jan 27, 2024Updated 2 years ago
- A repo to track data engineering projects☆13Nov 11, 2022Updated 3 years ago
- AWS Quick Start Team☆23Oct 3, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- AWS KR Tech Blog: 'Amazon Bedrock으로 30분 만에 멀티모달 RAG 챗봇 구축하기 실전 가이드' sample code☆13Feb 15, 2025Updated last year
- Tweepy Stream Example☆19Apr 23, 2019Updated 6 years ago
- Loan Default Prediction using PySpark, with jobs scheduled by Apache Airflow and Integration with Spark using Apache Livy☆22Dec 26, 2020Updated 5 years ago
- Use a AWS Glue Python Shell Job to connect to your Amazon Redshift cluster and execute a SQL script stored in Amazon S3.☆21Aug 8, 2022Updated 3 years ago
- ELT Code for your Data Warehouse☆26Sep 18, 2023Updated 2 years ago
- Spark data pipeline that processes movie ratings data.☆31Apr 1, 2026Updated 2 weeks ago
- Jupyter Hub Support in VS Code☆17Apr 2, 2026Updated 2 weeks ago
- Interactive Elasticsearch Analyzer☆13Dec 8, 2022Updated 3 years ago
- Embulk plugin that loads records from Google Cloud Storage☆14Mar 15, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- This repository contains all the example code to help you build a content aggregator like serverless land. It is split into 2 components:…☆39Sep 23, 2025Updated 6 months ago
- Stream smartphone data with FastAPI, Kafka, QuestDB, and Docker.☆27Sep 23, 2023Updated 2 years ago
- Project files for the post: Running PySpark Applications on Amazon EMR using Apache Airflow: Using the new Amazon Managed Workflows for A…☆41Jul 6, 2022Updated 3 years ago
- A tool for manual conversion of BGE-M3 models with preserved trainable variables and direct control over model outputs.☆44Sep 7, 2025Updated 7 months ago
- This is python web scraper implemented using multithreading/multiprocessing/pool for amazon.com☆28Sep 23, 2019Updated 6 years ago
- Analyzing shifting trends in music through the ages☆13Mar 25, 2021Updated 5 years ago
- A simple RFID music player for kids (runs on a Raspberry Pi)☆11Jun 30, 2017Updated 8 years ago
- Generate random color from a seed☆14Aug 5, 2020Updated 5 years ago
- letter avatar is angular2 directive. It will generate avatar based on given text☆15Oct 31, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- UB's JSQLparser fork☆12Nov 28, 2019Updated 6 years ago
- Pure Elixir implementation of Sha3 and the original Keccak1600-f☆16Jan 20, 2026Updated 2 months ago
- A collection of remark plugins used by HashiCorp to process markdown☆16Aug 22, 2025Updated 7 months ago
- Learn how to use Kinesis Firehose, AWS Glue, S3, and Amazon Athena by streaming and analyzing reddit comments in realtime. 100-200 level …☆45Apr 20, 2021Updated 4 years ago
- Python writable in-memory virtual filesystem for SQLite☆17Jan 6, 2024Updated 2 years ago
- A small project that convert Excel (xlsx) files to PDF files and applies different styles to the PDF (landscape orientation, margins and …☆12Jan 5, 2016Updated 10 years ago
- Jupyter notebook + Code for scraping AngelList data and making an interactive chart of SFBA salaries/equity☆14Jun 1, 2016Updated 9 years ago
- SQL Query Builder/VIsualizer☆19Apr 3, 2019Updated 7 years ago
- ✋ Stop propagation for everyday events with Angular directives 🎩☆13Feb 4, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- jgtextrank: Yet another Python implementation of TextRank☆13Nov 27, 2019Updated 6 years ago
- 🥪💾 A sample of data from the `jaffle-shop-generator` that powers the Jaffle Shop spanning one year.☆16Jan 23, 2025Updated last year
- Embed & Showcase your projects on websites☆17Jun 30, 2022Updated 3 years ago
- Semaphore demo CI/CD pipeline using Docker Compose and Python Flask☆13Jan 26, 2024Updated 2 years ago
- PySpark functions and utilities with examples. Assists ETL process of data modeling☆104Dec 3, 2020Updated 5 years ago
- This repository contains code associated with an AWS a blog which demonstrates how you can accept API keys as a query string parameter in…☆10Feb 18, 2022Updated 4 years ago
- PHP serialize/unserialize functional for Elixir lang☆13Dec 14, 2021Updated 4 years ago