ivbeg / newsworkerLinks
Advanced news feeds extractor and finder library. Helps to automatically extract news from websites without RSS/ATOM feeds
☆80Updated 2 years ago
Alternatives and similar repositories for newsworker
Users that are interested in newsworker are comparing it to the libraries listed below
Sorting:
- Parses Firefox/Chrome HTML bookmarks files☆50Updated last year
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated 2 years ago
- ☆62Updated last year
- CoCrawler is a versatile web crawler built using modern tools and concurrency.☆190Updated 3 years ago
- API - extract a list of keywords from a text.☆18Updated 8 years ago
- Lazy helper tool to make easier scraping with simple tasks☆19Updated 2 years ago
- Extract text from HTML☆134Updated 5 years ago
- A helper library full of URL-related heuristics.☆70Updated 2 weeks ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆57Updated last year
- Extracts tables from .docx files and saves them as .csv or .xls files☆64Updated last year
- Find rss, atom, xml, and rdf feeds on webpages☆30Updated 11 months ago
- Quick and dirty date parsing Python library to parse HTML dates really fast☆21Updated last year
- A Python Package which helps to scrape all news details from any news websites☆217Updated 3 months ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 6 years ago
- Read It Later for Telegram☆83Updated 7 years ago
- Simple framework for building Instagram chat bots with menu driven interface☆18Updated 5 years ago
- Firefox and Chrome compatible extension that acts as annotation tool for websites (Named Entity Recognition)☆10Updated 6 years ago
- Python library to read, write and convert data files with formats BSON, JSON, NDJSON, Parquet, ORC, XLS, XLSX and XML☆16Updated 2 months ago
- An easy-to-use python client for Google News feeds.☆50Updated 3 years ago
- Console program to get global ranking for a given website or domain☆21Updated 3 months ago
- Scrapes sites. Gets news. Eventually events.☆87Updated 9 years ago
- Matrix-based News Aggregation to Explore Media Bias☆20Updated 7 years ago
- FBLYZE is a Facebook scraping system and analysis system.☆65Updated 4 years ago
- Ultimate Website Sitemap Parser☆227Updated last week
- Простая обертка на языке Python для яндексового Tomita Parser'а (больше не нужна, Яндекс открыл исходники)☆17Updated 9 years ago
- Project management system for publishers, magazines and content creators 🗓️⏱️✍🏼☆30Updated 4 months ago
- The little things give you away... A collection of various small helper stuff – Mirror repo only, no longer kept in sync, refer to gitea.…☆25Updated 5 years ago
- A framework to manage, monitor and deploy marketing in social-media by re-posting content from one place to the another.☆36Updated 2 years ago
- Python wrapper for Ferret☆43Updated 3 years ago
- A Python tool for downloading videos from vk.com☆21Updated 5 years ago