juanluisrto / Scraping-orchestra
A scraping Master-slave system based on Google App Engine
☆11Updated 4 years ago
Alternatives and similar repositories for Scraping-orchestra:
Users that are interested in Scraping-orchestra are comparing it to the libraries listed below
- A financial disclosure data extraction tool.☆13Updated last year
- This repository explores various Numpy commands which are quite useful for working with datasets and handling array operations.☆13Updated 6 years ago
- Find rss, atom, xml, and rdf feeds on webpages☆30Updated 4 months ago
- an app that makes your personalized newsletter based on your bookmarks☆11Updated 7 years ago
- This project is wraper for Leilex, legal entity identifier API. Includes ISIN-LEI conversion. Search LEI number using company name.☆24Updated 5 months ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆56Updated 2 months ago
- ☆13Updated 5 years ago
- Virtual patent marking crawler at iproduct.epfl.ch☆14Updated 7 years ago
- My dot files in one place - extensively edited over time. Your mileage may vary☆2Updated 8 years ago
- scraper for facebook, gab, google and tiktok☆22Updated 8 months ago
- A utility that searches for RSS feeds from a CSV list of URLs☆11Updated 4 years ago
- A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.☆13Updated last week
- A Flask webapp that categorizes Outlook emails using machine learning☆15Updated 9 years ago
- A maximum-strength name parser for record linkage.☆36Updated last month
- NSS Capstone project to use natural language modeling, classification, and information extraction to get the exact employee count values …☆15Updated 6 years ago
- ☆12Updated 5 years ago
- ☆10Updated 3 years ago
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆32Updated 2 years ago
- A small Python app to create Notion pages from Jira issues☆17Updated 2 years ago
- A python3 module that converts your bs4 Tag into json object (dict)☆14Updated 11 months ago
- A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.☆13Updated 5 years ago
- ☆12Updated last year
- Scrape various open data directories to create an index of what's available out there☆36Updated 3 weeks ago
- A set of tools to accelerate work in Jupyter notebooks.☆11Updated 5 years ago
- Simple job postings scraper for Indeed based on requests and BeautifulSoup☆14Updated 3 years ago
- GraphiPy: Universal Social Data Extractor☆81Updated 2 years ago
- Open + free make / model / year / style database☆14Updated 2 weeks ago
- Linkibot, easy to use, a Linkedin invite bot for expanding your network☆33Updated last year
- Functional composable pipelines allowing clean separation of the business logic and its implementation☆11Updated 9 months ago
- Parse government documents into well formed JSON☆68Updated 3 weeks ago