thibauts / duckduckgo
Simple duckduckgo results scraping
☆67Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for duckduckgo
- Streaming web crawler with WebSocket API☆44Updated last year
- Algorithms for URL Classification☆19Updated 9 years ago
- Spell correct entire sentences using nltk freqdist and symspell☆19Updated 7 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆57Updated 9 months ago
- ☆22Updated 9 years ago
- Intelligent Web Data Extractor☆75Updated last year
- 🔍 Google Search unofficial API for Python with no external dependencies☆217Updated 3 years ago
- Reduction is a python script which automatically summarizes a text by extracting the sentences which are deemed to be most important.☆54Updated 9 years ago
- Get user ids from social network handlers☆12Updated 7 years ago
- A component that tries to avoid downloading duplicate content☆27Updated 6 years ago
- ☆59Updated 3 years ago
- Pure python script that takes user query and summarizes news related to it.☆25Updated 2 years ago
- Find which links on a web page are pagination links☆29Updated 7 years ago
- gzipstream allows Python to process multi-part gzip files from a streaming source☆23Updated 7 years ago
- Pipeline for distributed Natural Language Processing, made in Python☆65Updated 7 years ago
- Show summary of a large number of URLs in a Jupyter Notebook☆17Updated 3 years ago
- A spell-checker extending Peter Norvig's with multi-typo correction, hamming distance weighting, and more.☆97Updated 4 years ago
- A Python module to extract personality insights, sentiment & keywords from reddit accounts. pip install reddit_persona☆25Updated 7 years ago
- A python module provides content extraction and summarization of a web page even if the web page was broken.☆19Updated last year
- This is a complete profile scraper that returns a JSON file.☆52Updated 7 years ago
- Site Hound (previously THH) is a Domain Discovery Tool☆23Updated 3 years ago
- TWINT Flask-Celery Server. Optimized tweets scraping☆13Updated 5 years ago
- A generic crawler☆78Updated 6 years ago
- An easy-to-use python client for Google News feeds.☆50Updated 2 years ago
- OSoMe API mashups☆11Updated 5 years ago
- Scrapes sites. Gets news. Eventually events.☆82Updated 8 years ago
- Virtual patent marking crawler at iproduct.epfl.ch☆14Updated 7 years ago
- Train a neural network optimized for generating Reddit subreddit posts☆28Updated 6 years ago
- Automates the process of repeatedly searching for a website via scraped proxy IP and search keywords☆42Updated last year
- Tools to manipulate and extract data from wikipedia dumps☆45Updated 11 years ago