yennanliu / web_scrapingView external linksLinks
Collect/process data via various data sources : website / js website / API. Run scrapping pipeline via Celery, and Travis cron task. Dump the scraped data to slack
☆14Jul 24, 2024Updated last year
Alternatives and similar repositories for web_scraping
Users that are interested in web_scraping are comparing it to the libraries listed below
Sorting:
- Stream/batch system with Hadoop, Spark on NYC taxi data | #DE☆26Sep 27, 2025Updated 4 months ago
- This is a demo project to compare two web scrapping frameworks, Playwright and Selenium and using the new Pipelining tool Dagster☆15Sep 9, 2021Updated 4 years ago
- A shell script to automate the operations of sqoop☆11Mar 29, 2021Updated 4 years ago
- Flight computer software for UCLA's rocket team.☆13Mar 18, 2017Updated 8 years ago
- Compass is a suite of tools for students enrolled in the University of London's online BSc in Computer Science program.☆11Feb 22, 2023Updated 2 years ago
- OBS browser source for displaying "now playing" from Last.fm or Spotify, scrolls on long titles. Not affiliated with OBS, please do not a…☆12Mar 13, 2023Updated 2 years ago
- Makes karaoke from any youtube video link. The method is based on machine learning methods. After dowloading video from Youtube, the audi…☆12Aug 6, 2023Updated 2 years ago
- Create Unlimited Facebook Account with Email and Number☆10Feb 24, 2021Updated 4 years ago
- 📧 Python script to forward emails using Gmail.☆10Apr 4, 2023Updated 2 years ago
- Python Script To Scrape Instagram And Print Number Of FOLLOWERS, FOLLOWING And POSTS Of User-Input Instagram Username☆13Sep 19, 2019Updated 6 years ago
- Scrape and convert Prezi presentations to PDF☆13Mar 24, 2022Updated 3 years ago
- Create your own mock interview simulator using the power of WorqHat AI☆10May 24, 2024Updated last year
- Use LLM to generate Obsidian timeline style Cornell notes☆11May 10, 2023Updated 2 years ago
- A financial assistant using LLMs and Ntropy transaction enrichment☆10Jul 18, 2023Updated 2 years ago
- 🔩 Zender gateway controller files for different providers☆10Oct 6, 2024Updated last year
- An application that helps you summarize your meetings in real time using OpenAI's ChatGPT APIs.☆12Mar 14, 2023Updated 2 years ago
- Airflow POC demo : 1) env set up 2) airflow DAG 3) Spark/ML pipeline | #DE☆11Dec 19, 2022Updated 3 years ago
- This repository is a directory of all the projects done in the 30-day AI Internship of Pantech Solutions.☆10Nov 3, 2020Updated 5 years ago
- Various data stream/batch process demo with Apache Scala Spark 🚀☆11Feb 28, 2020Updated 5 years ago
- This project helps in monitoring the performance of your LinkedIn posts over a specified amount of time by the user and analyzes their e…☆12Apr 22, 2023Updated 2 years ago
- A basic GUI for interacting with the Stability AI API for Stable Diffusion 3☆16Apr 22, 2024Updated last year
- Humanlike AI Chat is a terminal-based LLM UI designed to study how to bypass AI text detection.☆12Mar 15, 2024Updated last year
- This is an official repository for the Article Generation app using Llama2, Pexels, and Streamlit.☆13Aug 5, 2023Updated 2 years ago
- ☆11Feb 7, 2021Updated 5 years ago
- The ultimate AI detection bypass tool.☆13Jan 22, 2024Updated 2 years ago
- A platform to publish and manage volunteering events around you.☆13Updated this week
- AI Service that classifies data with given or passed model structure response to use in code☆12Apr 25, 2023Updated 2 years ago
- OSINT tool allowing the exploration and the scrapping of a user's public data from a Google email address (gmail, googlemail) to find You…☆58Mar 25, 2024Updated last year
- A small but scalable learning management system built using Django☆13Jun 10, 2021Updated 4 years ago
- Animately is an Arduino library that allows for precise animation of props or robots, down to the millisecond, without the need for threa…☆12Feb 3, 2024Updated 2 years ago
- ::A tool to abbreviate scientific paper contents using ChatGPT::☆12Nov 20, 2023Updated 2 years ago
- Svelte app to generate audiobooks using XTTS☆12Feb 13, 2024Updated 2 years ago
- a phishing page☆14Aug 7, 2017Updated 8 years ago
- Paperwise is a systematic and streamlined research workflow for managing AI papers that revolves around three core tools: Zotero, Researc…☆14Apr 11, 2025Updated 10 months ago
- Pull list of leads from a Twitter Ads Lead Generation Card☆12Feb 7, 2017Updated 9 years ago
- This Node.js app built with MongoDB allows users to compare both scores from Yelp and Google+ for a restaurant at the same time. It uses …☆10Oct 18, 2015Updated 10 years ago
- An automation script written in Node.js, powered by Puppeteer to scrape multiple pages of Justdial (an Indian Yellow Pages website) and e…☆16Jun 17, 2024Updated last year
- Use AI to shorten and re-narrate audiobooks☆16Sep 24, 2024Updated last year
- Notion class notes/study template automated using a CLI to seamlessly integrate scientifically proven study techniques with minimal frict…☆23Apr 29, 2025Updated 9 months ago