datawizard1337 / ARGUSLinks
ARGUS is an easy-to-use web scraping tool. The program is based on the Scrapy Python framework and is able to crawl a broad range of different websites. On the websites, ARGUS is able to perform tasks like scraping texts or collecting hyperlinks between websites. See: https://link.springer.com/article/10.1007/s11192-020-03726-9
☆88Updated 3 years ago
Alternatives and similar repositories for ARGUS
Users that are interested in ARGUS are comparing it to the libraries listed below
Sorting:
- Scraper for Facebook's Archive of Ads with Political Content☆37Updated 6 years ago
- Tag news stories based on models trained on the NYT corpus.☆42Updated 2 years ago
- A Python scraper for the Facebook Ad Library, using the official Facebook Ad Library API.☆119Updated 5 years ago
- A set of jupyter notebooks demonstrating how to use the Media Cloud API.☆38Updated this week
- An automated, programming-free web scraper for interactive sites☆111Updated last year
- A helper library full of URL-related heuristics.☆69Updated 2 weeks ago
- Pre-built Scrapy spiders for AutoExtract☆19Updated last year
- Tools for auditing autocomplete on Google and Bing☆24Updated this week
- The FBAdLibrarian is a simple tool that can pull ad data and collects images offered by Facebook’s Ad Library API.☆16Updated 2 years ago
- The documentation and scripts for the Local News Dataset☆25Updated 3 years ago
- Fast, flexible name matching for large datasets☆72Updated last month
- Now included in rigour☆151Updated last month
- Public client for consuming content from the Media Cloud Online News Archive & Directory.☆76Updated this week
- Given a set of URLs, this packages detects coordinated link sharing behavior on social media and outputs the network of entities that per…☆75Updated 10 months ago
- A framework for observing Twitter through interactive networks.☆73Updated 7 months ago
- Pushshift Telegram Ingest☆86Updated 5 years ago
- Extracts key terminology (n-grams) from any large collection of documents (>1000) and forecasts emergence☆64Updated last year
- YTDT is a collection of simple tools for extracting data from the YouTube platform via the YouTube API v3.☆127Updated 8 months ago
- Collector for Facebook's Political Ad API☆31Updated 2 years ago
- Page Object pattern for Scrapy☆123Updated 3 weeks ago
- Google Trends, made easy.☆110Updated last year
- Example tutorials for twarc v2☆12Updated 3 years ago
- ☆24Updated last year
- A complimentary proxy to help to use SPM with headless browsers☆108Updated 2 years ago
- TikTok Content Scraper -- No API-Key needed, minimal dependencies, citable | Download videos (MP4), slides (JPEG) and metadata of author,…☆27Updated 2 weeks ago
- Parse and cluster USPTO patent data. Includes applications, grants, assignments, and maintenance.☆137Updated last year
- Political Discourse Analysis Using Pre-Trained Word Vectors.☆23Updated 2 years ago
- A classifier that distinguishes political from non-political news articles.☆30Updated last year
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆23Updated last year
- A Python Client for collect and parse public data from the Youtube Data API☆81Updated last year