thibauts / duckduckgo
Simple duckduckgo results scraping
☆67Updated 7 years ago
Alternatives and similar repositories for duckduckgo:
Users that are interested in duckduckgo are comparing it to the libraries listed below
- A spell-checker extending Peter Norvig's with multi-typo correction, hamming distance weighting, and more.☆98Updated 4 years ago
- A python module that automatically summarizes text documents and web pages☆46Updated 2 years ago
- A component that tries to avoid downloading duplicate content☆27Updated 6 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- Automatically extracts and normalizes an online article or blog post publication date☆118Updated last year
- A Python module to fetch and parse results from different search engines.☆77Updated 6 years ago
- Reduction is a python script which automatically summarizes a text by extracting the sentences which are deemed to be most important.☆55Updated 9 years ago
- Show summary of a large number of URLs in a Jupyter Notebook☆17Updated 3 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆55Updated last year
- A python module provides content extraction and summarization of a web page even if the web page was broken.☆19Updated last year
- A python instagram scraper which uses BeautifulSoup and JSON to scrape public instagram accounts☆27Updated 7 years ago
- A Python module to extract personality insights, sentiment & keywords from reddit accounts. pip install reddit_persona☆26Updated 7 years ago
- API - extract a list of keywords from a text.☆18Updated 7 years ago
- Site Hound (previously THH) is a Domain Discovery Tool☆23Updated 3 years ago
- Python bindings to the Compact Language Detector☆33Updated 4 years ago
- An interface for interacting with MediaWiki☆37Updated 3 years ago
- extract difference between two html pages☆32Updated 6 years ago
- Streaming web crawler with WebSocket API☆42Updated last year
- Aviation grade news article metadata extraction☆36Updated last year
- Intelligent Web Data Extractor☆74Updated 2 years ago
- Detect and classify pagination links☆15Updated 4 years ago
- Pipeline for distributed Natural Language Processing, made in Python☆65Updated 8 years ago
- Virtual patent marking crawler at iproduct.epfl.ch☆14Updated 7 years ago
- Adaptive crawler which uses Reinforcement Learning methods☆169Updated 6 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆42Updated 5 years ago
- This is a REST Server endpoint built using Flask and Python.☆24Updated 2 years ago
- Get user ids from social network handlers☆12Updated 8 years ago
- Wikipedia API wrapper for humans and elk. (en.wikipedia.org/w/api.php, get it?)☆36Updated 10 years ago
- Paginating the web☆37Updated 10 years ago
- Scrapy middleware which allows to crawl only new content☆80Updated 2 years ago