maria-antoniak/goodreads-scraper

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/maria-antoniak/goodreads-scraper)

maria-antoniak / goodreads-scraper

A Python scraper for Goodreads books and reviews.

☆306

Alternatives and similar repositories for goodreads-scraper

Users that are interested in goodreads-scraper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

quadrismegistus / lltk
View on GitHub
Literary Language Toolkit: code, models, corpora, and web tools
☆11Jul 5, 2026Updated 2 weeks ago
organisciak / htrc-book-models
View on GitHub
Within-book topic modeling on HTRC feature extraction files
☆24May 3, 2016Updated 10 years ago
melaniewalsh / Intro-Cultural-Analytics
View on GitHub
Introduction to Cultural Analytics & Python, course website and online textbook powered by Jupyter Book
☆280Mar 31, 2026Updated 3 months ago
matinho13 / SentiArt
View on GitHub
A simple vector space model based tool for sentiment analysis of literary texts
☆19Sep 17, 2024Updated last year
tedunderwood / noveltmmeta
View on GitHub
Code and data supporting "NovelTM Data Sets for English-Language Fiction."
☆26Dec 22, 2020Updated 5 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
javierlopeza / goodreads-scraper
View on GitHub
Python script to scrap www.goodreads.com books shelves.
☆14Jan 19, 2024Updated 2 years ago
laurejt / authorless-tms
View on GitHub
Repository for code and metadata to support work described in "Authorless Topic Models: Biasing Models Away from Known Structure"
☆29May 13, 2020Updated 6 years ago
jmhessel / FightingWords
View on GitHub
Quick implementation of Monroe et al.'s algorithm for comparing languages
☆57Jun 15, 2020Updated 6 years ago
BahramJannesar / GoodreadsBookDataset
View on GitHub
Gathering dataset from Goodreads website
☆51Jan 21, 2021Updated 5 years ago
rlmv / hathitrust-api
View on GitHub
Python wrappers for the HathiTrust APIs.
☆15Mar 16, 2019Updated 7 years ago
melaniewalsh / responsible-datasets-in-context
View on GitHub
A repository of datasets paired with rich documentation, data essays, and teaching resources
☆87Apr 21, 2026Updated 3 months ago
vierth / humanitiesTutorial
View on GitHub
This is code that we will cover in my Hacking the Humanities class at Leiden University. Video tutorials will be uploaded to my YouTube c…
☆33Oct 26, 2018Updated 7 years ago
dirkhovy / NLPclass
View on GitHub
☆39Jul 22, 2021Updated 5 years ago
soniajoseph / goodreads-quotes
View on GitHub
Web scraper for Goodreads quotes 📚
☆23Mar 1, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ccgilroy / word-embeddings-workshop
View on GitHub
CSS workshop on word embeddings for the social sciences, 3/19/21
☆12Mar 19, 2021Updated 5 years ago
sreecharansankaranarayanan / FormCrawl
View on GitHub
Python package to crawl the publicly available forms filed with the Securities and Exchange Commission (SEC) under the new Electronic Dat…
☆16Aug 2, 2013Updated 12 years ago
EduNLP / textbook-analysis
View on GitHub
Code for the paper "Content Analysis of Textbooks via Natural Language Processing".
☆62Jul 6, 2023Updated 3 years ago
Fitzy1293 / reddit
View on GitHub
Random programs for reddit
☆17Feb 20, 2020Updated 6 years ago
rossgoodwin / sonnetizer
View on GitHub
Generates rhyming sonnets in (mostly) iambic pentameter from any text corpus
☆55Aug 19, 2014Updated 11 years ago
htrc / htrc-feature-reader
View on GitHub
Tools for working with HTRC Feature Extraction files
☆44Jul 8, 2025Updated last year
senderle / fandom-search
View on GitHub
An approximate nearest-neighbor search for text reuse.
☆12Oct 5, 2020Updated 5 years ago
LeonardoEmili / Wikifier
View on GitHub
Pytorch implementation of a BiLSTM model for the Wikification project.
☆18Mar 30, 2020Updated 6 years ago
karlicoss / ghexport
View on GitHub
Export your Github activity: events, repositories, stars, etc.
☆58Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
vierth / ppdh
View on GitHub
Practical Python for DH
☆12Apr 6, 2020Updated 6 years ago
machow / tidytuesday-py
View on GitHub
☆34Apr 28, 2020Updated 6 years ago
simonwiles / palladio-webcomponents
View on GitHub
Web components for rendering visualizations created with the Palladio app.
☆18Jan 7, 2023Updated 3 years ago
DocNow / twarc-csv
View on GitHub
A plugin for twarc2 for converting tweet JSON into DataFrames and exporting to CSV.
☆31Jul 6, 2023Updated 3 years ago
distant-viewing / dvt
View on GitHub
Distant Viewing Toolkit for the Analysis of Visual Culture
☆114Jul 17, 2026Updated last week
SupervisedStylometry / SuperStyl
View on GitHub
Supervised Stylometry
☆27Mar 4, 2026Updated 4 months ago
hepplerj / whatisdigitalhumanities
View on GitHub
Code repository for whatisdigitalhumanities.com
☆33Mar 1, 2026Updated 4 months ago
quadrismegistus / literarytextmining
View on GitHub
Official syllabus and course materials for English 184E: “Literary Text Mining” (Spring 2019)
☆18Jul 15, 2020Updated 6 years ago
NYPL / libbib
View on GitHub
An R package providing WorldCat API communication, functions for validating and normalizing bibliographic codes, translation from call nu…
☆27Mar 20, 2026Updated 4 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
emory-courses / data-science
View on GitHub
Practical Approaches to Data Science with Text
☆26May 8, 2019Updated 7 years ago
simonw / tweet-images
View on GitHub
Send tweets with images from the command line
☆19Apr 18, 2022Updated 4 years ago
earlynovels / end-dataset
View on GitHub
Early Novels Database dataset
☆16Jan 15, 2019Updated 7 years ago
derekgreene / topicscan
View on GitHub
TopicScan: Visualization and validation interface for NMF Topic Modeling
☆23Jul 23, 2020Updated 6 years ago
kolchinski / reddit-sarc
View on GitHub
User modeling for sarcasm detection on Reddit corpus from Khodak et al. Published in EMNLP 2018.
☆10Aug 25, 2018Updated 7 years ago
tareknaous / visual-clustering
View on GitHub
Visual Clustering: Clustering Plotted Data by Image Segmentation
☆25Feb 25, 2025Updated last year
cohure / CoHuRe
View on GitHub
☆27Feb 2, 2021Updated 5 years ago