motoom / gutenberg-ebook-scrapingLinks
Download, convert and organize Gutenberg books for eBook Readers
☆46Updated 6 years ago
Alternatives and similar repositories for gutenberg-ebook-scraping
Users that are interested in gutenberg-ebook-scraping are comparing it to the libraries listed below
Sorting:
- An eBook tool to extract ISBN or Metadata form eBook and rename them by using ISBN database and Metadata☆30Updated 10 years ago
- An online sentiment analyzer built with Flask and TextBlob☆15Updated 11 years ago
- Google Books Downloader / Image Scraper☆53Updated 6 years ago
- A Python library that provides an api to search and get links from Books,Magazines,Comics,... from Library Genesis.☆121Updated 3 years ago
- Take streaming tweets, extract hashtags & usernames, create graph, export graphml for Gephi visualisation☆38Updated 12 years ago
- Analyzer and statistics generator for text-based conversations. Includes Facebook scraper and parser☆74Updated 6 years ago
- Archive.org OPDS Bookserver - A standard for digital book distribution☆130Updated 6 years ago
- WarcMiddleware lets users seamlessly download a mirror copy of a website when running a web crawl with the Python web crawler Scrapy.☆47Updated 7 years ago
- Intelligent RSS news aggregator.☆33Updated last year
- Demo of the Newspaper article extraction library.☆29Updated 10 years ago
- A component based data flow framework with a drag-n-drop Web 2.0 interface. Based on Stackless Python and inspired by Yahoo! Pipes.☆150Updated 12 years ago
- Scrapy project with spiders to extract article content from various german news sites☆21Updated 11 years ago
- scraper related helper functions☆27Updated 11 years ago
- Scraper for downloading the entire ebooks repository of project Gutenberg☆151Updated 3 weeks ago
- A Python utility for moving bookmarks/reading lists between services☆204Updated 9 years ago
- A python script to download books from libgen.io☆75Updated 6 years ago
- A Python script that parses post titles, self-texts, and comments on reddit and makes word clouds out of the word frequencies.☆291Updated 2 years ago
- Pypo is a self hosted bookmarking service like Pocket, implemented in Python with django☆29Updated 9 years ago
- An interactive map of Stack Exchange tags for all sites.☆126Updated last year
- The reddit Data Extractor is a cross-platform GUI tool for downloading almost any content posted to reddit. Downloads from specific users…☆238Updated 8 months ago
- Cocktail recipe search written in Python with werkzeug, scrapy and sphinx☆99Updated 7 years ago
- A GoodReads.com Scraper script to get books reviews including text and rating.☆44Updated 3 years ago
- Scrapy middleware which allows to crawl only new content☆80Updated 2 years ago
- API server for NLTK☆23Updated 8 years ago
- A javascript tool to visualize the diff's in wikipedia☆35Updated 2 years ago
- Aviation grade news article metadata extraction☆36Updated 2 years ago
- This is the NewsFinder software, designed to automatically crawl the web for news related to artificial intelligence, filter, categorize,…☆62Updated 11 years ago
- Lightweight, multilingual natural language processing☆63Updated 12 years ago
- Download *ALL* the submissions from Hacker News☆50Updated 11 years ago
- Personal Knowledge Management System. Capture your ideas using plain old text files. Make a journal that lasts 100 years.☆29Updated last year