Siltaar / doc_crawler.py
Explore a website recursively and download all the wanted documents (PDF, ODT…)
☆20Updated 3 years ago
Related projects: ⓘ
- An eBook tool to extract ISBN or Metadata form eBook and rename them by using ISBN database and Metadata☆30Updated 9 years ago
- Scraper for categories and lists on ecommerce and other listing websites☆43Updated 3 years ago
- Python library for finding phone numbers in random user input text.☆9Updated 7 years ago
- ☕🗄CAching Proxy in Python – Simple file based python http proxy☆15Updated 3 years ago
- Access domain information via python and command line.☆16Updated 5 months ago
- Software to monitor radio frequency activity☆18Updated 6 years ago
- Python script for searching through your digital books and cataloguing them in an easy-to-share list of files.☆31Updated 4 years ago
- Restrict crawl and scraping scope using matchers.☆25Updated 8 years ago
- Get live news instantly☆65Updated 5 years ago
- ☆12Updated this week
- Python module for Named Entity Recognition (NER) using natural language processing.☆14Updated 3 years ago
- Lightweight library that converts a HTML webpage to JSON data using a template defined in JSON.☆21Updated 3 years ago
- A component that tries to avoid downloading duplicate content☆27Updated 6 years ago
- Convert URL or RSS feed to text with readability☆49Updated 4 years ago
- Proxy-list management application for Django☆23Updated 6 years ago
- A Python library that provide unique keys for 2FA with given secret.☆41Updated 2 years ago
- TweetSploit - Is a twitter Marketing Suite allowing for a nice and simple interface, from which you can access and automate Twitter marke…☆21Updated 8 years ago
- A CLI tool to clear up your email!☆30Updated 3 years ago
- 💻 Terminal-like Python input( ) function.☆19Updated 5 years ago
- A scrapy extension to store requests and responses information in storage service☆26Updated 2 years ago
- A native web-based client for Slack.☆23Updated 7 years ago
- A Python client for Chrome's DevTools protocol / a headless chrome control library☆15Updated 6 years ago
- Passive network observation tool☆31Updated 5 years ago
- A Slack Client written in Python wtih Urwid☆28Updated 11 months ago
- Python library for modern thread / multiprocessing pooling and task processing via asyncio☆15Updated 3 years ago
- Turn your IPython console into a cross-database SQL client☆31Updated 8 years ago
- Perform lexical analysis on words, one word at a time.☆64Updated 6 years ago
- ☆12Updated this week
- 🕷Configuration based html scraper☆22Updated 3 months ago
- ☆20Updated this week