ivbeg / newsworkerLinks
Advanced news feeds extractor and finder library. Helps to automatically extract news from websites without RSS/ATOM feeds
☆80Updated 3 years ago
Alternatives and similar repositories for newsworker
Users that are interested in newsworker are comparing it to the libraries listed below
Sorting:
- Parses Firefox/Chrome HTML bookmarks files☆49Updated last year
- API - extract a list of keywords from a text.☆18Updated 8 years ago
- Project on text topics evolution over time analysis☆81Updated 3 years ago
- ☆62Updated last year
- Simple framework for building Instagram chat bots with menu driven interface☆18Updated 5 years ago
- Extract text from HTML☆134Updated 5 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆58Updated last year
- CoCrawler is a versatile web crawler built using modern tools and concurrency.☆191Updated 3 years ago
- Aggregates posts from the telegram channels assigned to a bot (admin), saves them into the MongoDB & renders the data in form of cards (R…☆14Updated 2 years ago
- SEO python scraper to extract data from major searchengine result pages. Extract data like url, title, snippet, richsnippet and the type …☆269Updated 3 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 6 years ago
- Console program to get global ranking for a given website or domain☆21Updated 5 months ago
- Russian names parsers, gender identification and processing tools☆134Updated last year
- ☆63Updated 2 years ago
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆99Updated 3 years ago
- Reproducing http://kingjamesprogramming.tumblr.com and having fun.☆44Updated 6 years ago
- Matrix-based News Aggregation to Explore Media Bias☆20Updated 7 years ago
- Lazy helper tool to make easier scraping with simple tasks☆19Updated 3 years ago
- Quick and dirty date parsing Python library to parse HTML dates really fast☆21Updated 2 years ago
- Tag news stories based on models trained on the NYT corpus.☆42Updated 2 years ago
- Aiohttp web server API, which scrapes Google and returns scrape results as response. Supports proxies, multiple geos and number of result…☆59Updated last year
- A Python Package which helps to scrape all news details from any news websites☆219Updated 4 months ago
- Firefox Web Extension to save Facebook posts as images☆22Updated 4 years ago
- Architecture of Twint scrapper which allow download tweets on many instances without api restrictions☆10Updated 4 years ago
- project to produce various useful scrapers☆33Updated this week
- Extract dates from text☆65Updated 4 years ago
- FeedCrunch.IO - Take RSS Feeds to the next level with personnalized recommendations☆15Updated 3 years ago
- keywords-extract - Command line tool extract keywords from any web page.☆61Updated 7 years ago
- Text analysis for automatic bookmarking/keyword extraction☆18Updated 8 years ago
- The little things give you away... A collection of various small helper stuff – Mirror repo only, no longer kept in sync, refer to gitea.…☆24Updated 5 years ago