ivbeg / newsworkerLinks
Advanced news feeds extractor and finder library. Helps to automatically extract news from websites without RSS/ATOM feeds
☆81Updated 2 months ago
Alternatives and similar repositories for newsworker
Users that are interested in newsworker are comparing it to the libraries listed below
Sorting:
- ☆62Updated last year
- API - extract a list of keywords from a text.☆18Updated 8 years ago
- Parses Firefox/Chrome HTML bookmarks files☆48Updated last year
- Extract text from HTML☆134Updated this week
- CoCrawler is a versatile web crawler built using modern tools and concurrency.☆193Updated 3 years ago
- Quick and dirty date parsing Python library to parse HTML dates really fast☆21Updated 3 weeks ago
- Simple framework for building Instagram chat bots with menu driven interface☆18Updated 5 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆58Updated last year
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆34Updated 2 years ago
- A helper library full of URL-related heuristics.☆73Updated 4 months ago
- Find rss, atom, xml, and rdf feeds on webpages☆31Updated 2 months ago
- Automatically extracts and normalizes an online article or blog post publication date☆118Updated 2 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 6 years ago
- This repository provides usage examples for the Python module Newspaper3k.☆150Updated 2 years ago
- Extracts tables from .docx files and saves them as .csv or .xls files☆65Updated 2 years ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆144Updated 2 months ago
- Ultimate Website Sitemap Parser☆242Updated this week
- Lazy helper tool to make easier scraping with simple tasks☆19Updated 3 years ago
- Простая обертка на языке Python для яндексового Tomita Parser'а (больше не нужна, Яндекс открыл исходники)☆17Updated 10 years ago
- Python wrapper for Ferret☆45Updated 4 years ago
- Tag news stories based on models trained on the NYT corpus.☆42Updated 2 years ago
- Firefox and Chrome compatible extension that acts as annotation tool for websites (Named Entity Recognition)☆10Updated 6 years ago
- Matrix-based News Aggregation to Explore Media Bias☆20Updated 7 years ago
- keywords-extract - Command line tool extract keywords from any web page.☆62Updated 7 years ago
- Python client for Yandex.XML☆19Updated 2 years ago
- python module to access the telegram bot api.☆66Updated last year
- A Python Package which helps to scrape all news details from any news websites☆223Updated 7 months ago
- FBLYZE is a Facebook scraping system and analysis system.☆67Updated 4 years ago
- Transfer video recordings from the Zoom to YouTube☆79Updated 2 years ago
- Awesome list of the software tools related to opendata: data catalogs, ingestion tools, data prep tools and so on☆35Updated 2 months ago