motoom / gutenberg-ebook-scraping
Download, convert and organize Gutenberg books for eBook Readers
☆46Updated 5 years ago
Alternatives and similar repositories for gutenberg-ebook-scraping
Users that are interested in gutenberg-ebook-scraping are comparing it to the libraries listed below
Sorting:
- Presentations on Quantified Self and Self-Tracking with Python☆30Updated 2 years ago
- Take streaming tweets, extract hashtags & usernames, create graph, export graphml for Gephi visualisation☆38Updated 11 years ago
- Google Books Downloader / Image Scraper☆53Updated 6 years ago
- A GoodReads.com Scraper script to get books reviews including text and rating.☆43Updated 2 years ago
- An online sentiment analyzer built with Flask and TextBlob☆15Updated 11 years ago
- Demo of the Newspaper article extraction library.☆29Updated 10 years ago
- An eBook tool to extract ISBN or Metadata form eBook and rename them by using ISBN database and Metadata☆30Updated 9 years ago
- Python code to scrape and collect data from the RSS feeds Facebook uses to augment its Trending Section☆57Updated 6 years ago
- An online annotation platform for teaching and learning in the humanities.☆108Updated 3 months ago
- Scrapy project with spiders to extract article content from various german news sites☆21Updated 11 years ago
- 100k+ topic labeled news articles published from thousands of news websites☆19Updated 4 years ago
- Python script that periodically probes the Craigslist RSS feeds for new listings.☆39Updated 13 years ago
- Pipeline for distributed Natural Language Processing, made in Python☆64Updated 8 years ago
- A simple audio file transcriber that uses the Google Cloud Speech API for transcription.☆26Updated 6 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆56Updated last year
- A Python library that provides an api to search and get links from Books,Magazines,Comics,... from Library Genesis.☆121Updated 2 years ago
- A recommender system for GitHub repositories☆14Updated 10 years ago
- Save a bunch of web pages as a self-contained, compressed archive file for offline storage and sharing.☆35Updated 12 years ago
- Short script for removing watermarks from PDF files. Requires pdftk.☆58Updated 6 years ago
- Find rss, atom, xml, and rdf feeds on webpages☆30Updated 7 months ago
- Automatically tag pinboard bookmarks based on page text☆8Updated 9 years ago
- Scraping Assisted by Learning☆35Updated last month
- Source real estate prices from the Common Crawl.☆27Updated 6 years ago
- A small command-line utility that allows you to download closed captions from YouTube as a SRT file.☆30Updated 9 years ago
- Serapis is a sentence identifier and modeling pipeline / built for Wordnik☆24Updated 8 years ago
- A python autocompletion library. Easycomplete has a simple API and utilizes google's autocomplete results & the english dictionary for no…☆40Updated 11 years ago
- Simple PHP script that parses a public amazon whishlist and export the items to a CSV file.☆13Updated 8 years ago
- Scripts to auto-OCR PDFs, translate output using publicly-available or DIY NLP translation models, and generate epub/PDF☆43Updated last year
- Random fun with statistical language models.☆65Updated 5 years ago
- A python script to download books from libgen.io☆75Updated 6 years ago