ivbeg / newsworkerLinks
Advanced news feeds extractor and finder library. Helps to automatically extract news from websites without RSS/ATOM feeds
☆80Updated 3 years ago
Alternatives and similar repositories for newsworker
Users that are interested in newsworker are comparing it to the libraries listed below
Sorting:
- Simple framework for building Instagram chat bots with menu driven interface☆18Updated 5 years ago
- CoCrawler is a versatile web crawler built using modern tools and concurrency.☆191Updated 3 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆57Updated last year
- ☆62Updated last year
- Parses Firefox/Chrome HTML bookmarks files☆49Updated last year
- A helper library full of URL-related heuristics.☆73Updated last month
- Extract text from HTML☆134Updated 5 years ago
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated 2 years ago
- API - extract a list of keywords from a text.☆18Updated 8 years ago
- Firefox and Chrome compatible extension that acts as annotation tool for websites (Named Entity Recognition)☆10Updated 6 years ago
- Project on text topics evolution over time analysis☆81Updated 3 years ago
- Lightweight library that converts a HTML webpage to JSON data using a template defined in JSON.☆23Updated 5 months ago
- Ultimate Website Sitemap Parser☆231Updated 2 weeks ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 6 years ago
- extract difference between two html pages☆32Updated 7 years ago
- Aiohttp web server API, which scrapes Google and returns scrape results as response. Supports proxies, multiple geos and number of result…☆59Updated last year
- A Python Package which helps to scrape all news details from any news websites☆219Updated 5 months ago
- project to produce various useful scrapers☆33Updated last week
- Document Search Engine Tool☆74Updated 2 years ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆142Updated 2 weeks ago
- Quick and dirty date parsing Python library to parse HTML dates really fast☆21Updated 2 years ago
- Algorithms for similar image search/reverse image search☆36Updated 2 years ago
- This repository provides usage examples for the Python module Newspaper3k.☆148Updated last year
- undatum: a command-line tool for data processing. Brings CSV simplicity to NDJSON, BSON, XML and other dat files☆48Updated 3 months ago
- Google rank checker for real time bulk checking SEO keywords☆33Updated 2 years ago
- keywords-extract - Command line tool extract keywords from any web page.☆61Updated 7 years ago
- Extracts tables from .docx files and saves them as .csv or .xls files☆65Updated 2 years ago
- Easy extraction of keywords and engines from search engine results pages (SERPs).☆92Updated last month
- Crawler and scraper of the public directory of companies on LinkedIn.☆25Updated 6 years ago
- Pinterest API for Python☆33Updated 8 years ago