ivbeg / newsworkerLinks
Advanced news feeds extractor and finder library. Helps to automatically extract news from websites without RSS/ATOM feeds
☆80Updated 2 years ago
Alternatives and similar repositories for newsworker
Users that are interested in newsworker are comparing it to the libraries listed below
Sorting:
- Quick and dirty date parsing Python library to parse HTML dates really fast☆21Updated last year
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆57Updated last year
- Simple framework for building Instagram chat bots with menu driven interface☆18Updated 5 years ago
- ☆62Updated last year
- Parses Firefox/Chrome HTML bookmarks files☆50Updated last year
- CoCrawler is a versatile web crawler built using modern tools and concurrency.☆190Updated 3 years ago
- Extract text from HTML☆134Updated 5 years ago
- API - extract a list of keywords from a text.☆18Updated 8 years ago
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆99Updated 2 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 6 years ago
- RSS feed reader for Python 3☆88Updated 2 years ago
- Extracts tables from .docx files and saves them as .csv or .xls files☆64Updated last year
- Python client for Yandex.XML☆19Updated 2 years ago
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated 2 years ago
- Russian names parsers, gender identification and processing tools☆132Updated last year
- Tag news stories based on models trained on the NYT corpus.☆42Updated 2 years ago
- Find rss, atom, xml, and rdf feeds on webpages☆30Updated 10 months ago
- Lazy helper tool to make easier scraping with simple tasks☆19Updated 2 years ago
- Easy extraction of keywords and engines from search engine results pages (SERPs).☆90Updated 3 years ago
- Lightweight library that converts a HTML webpage to JSON data using a template defined in JSON.☆23Updated 2 months ago
- Automatically extracts and normalizes an online article or blog post publication date☆117Updated 2 years ago
- Firefox Web Extension to save Facebook posts as images☆21Updated 4 years ago
- Atom, RSS and JSON feed parser for Python 3☆117Updated 2 years ago
- DuckDuckGo search engine API library for Python☆41Updated 5 years ago
- Pinterest API for Python☆33Updated 8 years ago
- Aiohttp web server API, which scrapes Google and returns scrape results as response. Supports proxies, multiple geos and number of result…☆57Updated last year
- SEO python scraper to extract data from major searchengine result pages. Extract data like url, title, snippet, richsnippet and the type …☆268Updated 3 years ago
- Ultimate Website Sitemap Parser☆225Updated 2 weeks ago
- Scrapes sites. Gets news. Eventually events.☆87Updated 9 years ago
- Простая обертка на языке Python для яндексового Tomita Parser'а (больше не нужна, Яндекс открыл исходники)☆17Updated 9 years ago