maria-antoniak / goodreads-scraper
A Python scraper for Goodreads books and reviews.
☆291Updated last month
Alternatives and similar repositories for goodreads-scraper:
Users that are interested in goodreads-scraper are comparing it to the libraries listed below
- Scrape data from Goodreads using Scrapy and Selenium☆136Updated 10 months ago
- Download subreddit comments☆94Updated 3 years ago
- Example scripts for the pushshift dump files☆353Updated last week
- A Python wrapper around the topic modeling functions of MALLET.☆101Updated 5 months ago
- A multithread Pushshift.io API Wrapper for reddit.com comment and submission searches.☆217Updated 2 years ago
- An affect generator based on TextBlob and the NRC affect lexicon. Note that lexicon license is for research purposes only.☆70Updated 2 years ago
- Scrape news articles and analyze them using NLP to quantify the gender gap in Canadian mainstream media☆42Updated 11 months ago
- Code for the paper "Content Analysis of Textbooks via Natural Language Processing".☆58Updated last year
- An NLP processing pipeline for characters in fanfiction. Developed by students at Carnegie Mellon University from 2019-2021.☆32Updated 7 months ago
- GoodReads Best Books Ever dataset repository☆31Updated 4 years ago
- Cleans Reddit Text Data☆81Updated 5 years ago
- ☆124Updated last year
- Python client for thegaurdian api☆68Updated last year
- Public client for consuming content from the Media Cloud Online News Archive & Directory.☆72Updated 4 months ago
- Introduction to Cultural Analytics & Python, course website and online textbook powered by Jupyter Book☆267Updated last year
- A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books fo…☆106Updated 6 years ago
- Making Reddit data accessible to researchers, moderators and everyone else. Interact with the data through large dumps, an API or web in…☆371Updated last week
- Gathering dataset from Goodreads website☆45Updated 4 years ago
- Tools for Twitter☆35Updated 3 years ago
- a python package for cleaning Gutenberg books and dataset☆34Updated 2 years ago
- This is code that we will cover in my Hacking the Humanities class at Leiden University. Video tutorials will be uploaded to my YouTube c…☆31Updated 6 years ago
- Releases for the reddit-graph project☆19Updated 9 months ago
- Fuzzy matches and merging of datasets in pandas using csvmatch☆74Updated 4 years ago
- ☆14Updated 7 months ago
- Datasets of the daily Twitter output of Congress.☆109Updated last year
- Scraping tools for fanfiction.net☆59Updated 5 years ago
- A GoodReads.com Scraper script to get books reviews including text and rating.☆41Updated 2 years ago
- Data and code for the book Enumerations: Data and Literary Study (Chicago 2018)☆25Updated 6 years ago
- Measure the readability of a given text using surface characteristics☆78Updated 2 months ago
- Using the Gmail API to topic model my recommended Medium reads☆24Updated 3 years ago