Siltaar / doc_crawler.pyLinks
Explore a website recursively and download all the wanted documents (PDF, ODT…)
☆20Updated 4 years ago
Alternatives and similar repositories for doc_crawler.py
Users that are interested in doc_crawler.py are comparing it to the libraries listed below
Sorting:
- scrapin' proxies with ocr☆19Updated 7 years ago
- Tools that will make writing tests, bots and scrapers using Selenium much easier☆139Updated last year
- Scraper for categories and lists on ecommerce and other listing websites☆43Updated 5 years ago
- A Python client for Chrome's DevTools protocol / a headless chrome control library☆15Updated 7 years ago
- Python library to print tables in Terminal.☆53Updated this week
- a tkinter form-based GUI to run python scripts☆25Updated 6 years ago
- An eBook tool to extract ISBN or Metadata form eBook and rename them by using ISBN database and Metadata☆29Updated 10 years ago
- RSS feed reader for Python 3☆88Updated 3 years ago
- Extract Social Profiles using Email Addresses (Python)☆14Updated 7 years ago
- a command-line web scraping tool☆151Updated 2 years ago
- Easy Regular Expressions for Python☆24Updated 3 years ago
- A Package/API/Command Line application to search lyrics from different web sources☆28Updated 3 years ago
- Python library for modern thread / multiprocessing pooling and task processing via asyncio☆15Updated 5 years ago
- A Python script that generates a list of pairs of funny words for naming things such as app releases, internal projects, servers and chil…☆26Updated 9 years ago
- AnyAPI is a library that helps you to write any API wrapper with ease and in pythonic way.☆131Updated 4 years ago
- 🕷Configuration based html scraper☆23Updated 3 months ago
- Get live news instantly☆66Updated 5 months ago
- A simple python tool that generates a requests/bs4 based web scraper☆26Updated 3 years ago
- A python script to download books from libgen.io☆76Updated 6 years ago
- A pure Python GUI app for GPG functionality and peer-to-peer encrypted messaging over Tor☆71Updated 4 years ago
- Upload any image, and the app will tell you the object in the image and translate it to any language you want (read out aloud)☆42Updated 8 years ago
- Search in multiple torrent sites from your CLI☆71Updated 2 years ago
- Easy Html Parser is an AST generator for html/xml documents. You can easily delete/insert/extract tags in html/xml documents as well as l…☆52Updated 6 years ago
- Python module for Named Entity Recognition (NER) using natural language processing.☆13Updated 4 years ago
- TweetSploit - Is a twitter Marketing Suite allowing for a nice and simple interface, from which you can access and automate Twitter marke…☆21Updated 9 years ago
- Software to monitor radio frequency activity☆18Updated 7 years ago
- Convert URL or RSS feed to text with readability☆51Updated 6 years ago
- ☕🗄CAching Proxy in Python – Simple file based python http proxy☆15Updated 2 months ago
- A minimalistic news aggregator built with Flask and powered by News API.☆77Updated 4 months ago
- 📻 Play your favorite radio station from the terminal☆78Updated 5 years ago