svven / summary
Summary is a complete solution to extract the title, image and description from any URL.
☆18Updated last year
Alternatives and similar repositories for summary:
Users that are interested in summary are comparing it to the libraries listed below
- Python package to detect and return RSS / Atom feeds for a given website. The tool supports major blogging platform including Wordpress, …☆21Updated 3 years ago
- Pure python script that takes user query and summarizes news related to it.☆25Updated 2 years ago
- This application demonstrates how to use PostgreSQL as a full-text search engine.☆63Updated 6 years ago
- Scraping tweets quickly using celery, RabbitMQ and Docker cluster☆48Updated 2 years ago
- Scrapy middleware for the autologin☆37Updated 6 years ago
- A Python library for extracting titles, images, descriptions and canonical urls from HTML.☆149Updated 4 years ago
- Spam filtering made easy for you☆142Updated 5 years ago
- Restrict crawl and scraping scope using matchers.☆25Updated 8 years ago
- Easy extraction of keywords and engines from search engine results pages (SERPs).☆90Updated 3 years ago
- 📑⚙️ Python/Django reference implementation of the ERAV data model☆21Updated 5 years ago
- Simple Web UI for Scrapy spider management via Scrapyd☆51Updated 6 years ago
- Python library for interacting with the Etsy API☆15Updated 3 years ago
- API - extract a list of keywords from a text.☆18Updated 7 years ago
- A Django based search engine powered by CouchDB, celery and whoosh.☆49Updated 9 years ago
- Python library with common functionality for writing web scrapers☆102Updated 9 years ago
- A project to attempt to automatically login to a website given a single seed☆123Updated 2 years ago
- Python module to watch Twitter user pages or search-results.☆62Updated 10 years ago
- Automated Search Engine Optimization Testing Tool☆82Updated 6 years ago
- Python code to scrape and collect data from the RSS feeds Facebook uses to augment its Trending Section☆57Updated 6 years ago
- Console program to get global ranking for a given website or domain☆21Updated 2 years ago
- Scrapy middleware which allows to crawl only new content☆80Updated 2 years ago
- PyQuery-based scraping micro-framework.☆116Updated 3 years ago
- Site Hound (previously THH) is a Domain Discovery Tool☆23Updated 3 years ago
- Bringing sanity to world of messed-up data☆66Updated 10 years ago
- Django Boilerplate Template for SaaS applications☆46Updated 11 months ago
- Python class for use with Django to detect Disposable Emails☆53Updated last year
- Extensions for using Scrapy on Amazon AWS☆32Updated 12 years ago
- Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations☆40Updated 10 months ago
- ☆49Updated 2 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆56Updated last year