ivbeg / newsworkerLinks
Advanced news feeds extractor and finder library. Helps to automatically extract news from websites without RSS/ATOM feeds
☆80Updated 2 years ago
Alternatives and similar repositories for newsworker
Users that are interested in newsworker are comparing it to the libraries listed below
Sorting:
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆57Updated last year
- CoCrawler is a versatile web crawler built using modern tools and concurrency.☆191Updated 3 years ago
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated 2 years ago
- A helper library full of URL-related heuristics.☆70Updated last month
- Aiohttp web server API, which scrapes Google and returns scrape results as response. Supports proxies, multiple geos and number of result…☆57Updated last year
- Tag news stories based on models trained on the NYT corpus.☆42Updated 2 years ago
- API - extract a list of keywords from a text.☆18Updated 8 years ago
- Find rss, atom, xml, and rdf feeds on webpages☆30Updated 9 months ago
- ☆62Updated last year
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- Extract text from HTML☆134Updated 4 years ago
- Console program to get global ranking for a given website or domain☆21Updated last month
- This Python code scrapes Google search results then applies sentiment analysis, generates text summaries, and ranks keywords.☆28Updated 4 years ago
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆99Updated 2 years ago
- Simple framework for building Instagram chat bots with menu driven interface☆18Updated 5 years ago
- Parses Firefox/Chrome HTML bookmarks files☆49Updated last year
- Utilities & scripts to collect and find insight from social network data and users.☆25Updated last month
- Simple summarize ML model☆16Updated 6 years ago
- Scrape and parse Google search results in Python☆31Updated 2 years ago
- ARGUS is an easy-to-use web scraping tool. The program is based on the Scrapy Python framework and is able to crawl a broad range of diff…☆88Updated 3 years ago
- RUSSE: Russian Semantic Evaluation.☆15Updated 3 years ago
- Quick and dirty date parsing Python library to parse HTML dates really fast☆21Updated last year
- A Python Package which helps to scrape all news details from any news websites☆211Updated last month
- Grabbing all news.☆62Updated 5 years ago
- DuckDuckGo search engine API library for Python☆41Updated 5 years ago
- This repository provides usage examples for the Python module Newspaper3k.☆147Updated last year
- Firefox and Chrome compatible extension that acts as annotation tool for websites (Named Entity Recognition)☆10Updated 6 years ago
- AI based web-wrapper for web-content-extraction☆100Updated 2 years ago
- Scrape data from Google.com, Bing.com, Baidu.com, Ask.com, Yahoo.com, Yandex.com☆56Updated 3 years ago
- Extracts tables from .docx files and saves them as .csv or .xls files☆64Updated last year