ivbeg / newsworker
Advanced news feeds extractor and finder library. Helps to automatically extract news from websites without RSS/ATOM feeds
☆79Updated 2 years ago
Alternatives and similar repositories for newsworker:
Users that are interested in newsworker are comparing it to the libraries listed below
- Lazy helper tool to make easier scraping with simple tasks☆18Updated 2 years ago
- Project on text topics evolution over time analysis☆81Updated 2 years ago
- Find rss, atom, xml, and rdf feeds on webpages☆30Updated 5 months ago
- Russian names parsers, gender identification and processing tools☆129Updated last year
- Classification and aggregation of russian news articles. University coursework.☆17Updated 6 years ago
- Универсальный парсер деклараций в формат для передачи в Декларатор.☆18Updated 4 months ago
- Comparing quality and performance of NLP systems for Russian language☆46Updated last year
- Simple framework for building Instagram chat bots with menu driven interface☆18Updated 4 years ago
- Python client for Yandex.XML☆18Updated last year
- Russian Text Expansion based on ruGPT3Large☆25Updated 2 years ago
- ☆62Updated 10 months ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆56Updated last year
- Bot for forwarding updates from RSS/Atom feeds to Telegram☆56Updated 2 months ago
- RUSSE: Russian Semantic Evaluation.☆15Updated 3 years ago
- Tag news stories based on models trained on the NYT corpus.☆42Updated 2 years ago
- A helper library full of URL-related heuristics.☆66Updated 5 months ago
- Firefox and Chrome compatible extension that acts as annotation tool for websites (Named Entity Recognition)☆10Updated 6 years ago
- A simple dictionary-based sentiment analysis system with Russian language support☆28Updated 3 years ago
- Python library to read, write and convert data files with formats BSON, JSON, NDJSON, Parquet, ORC, XLS, XLSX and XML☆16Updated 7 months ago
- Quick and dirty date parsing Python library to parse HTML dates really fast☆20Updated last year
- Inspired by word2vec-pride-vis the replacement of words of Russian most valuable novels text with closest word2vec model words. By Boris …☆48Updated 7 months ago
- The Pereval server: a set of OSINT & misc related web-services☆36Updated 4 years ago
- NLP project that works with news (NER, context generation, news trend analytics)☆43Updated 2 years ago
- Readability.io public code☆41Updated 8 years ago
- Scrape VK media☆57Updated last year
- Russian coreference resolution competition☆10Updated last year
- VK-Top is used for getting popular posts of any public available page at VK.com☆39Updated 2 years ago
- API - extract a list of keywords from a text.☆18Updated 7 years ago
- Poetry tools and russian text parser☆8Updated 8 years ago