radiolarian / AO3Scraper
A Python scraper for getting fan fiction content and metadata from Archive of Our Own.
☆188Updated last year
Alternatives and similar repositories for AO3Scraper:
Users that are interested in AO3Scraper are comparing it to the libraries listed below
- Scraping tools for fanfiction.net☆59Updated 5 years ago
- A scripted Python interface to some of the data on AO3☆84Updated 2 years ago
- An unofficial archiveofourown.org (AO3) API for python☆208Updated 3 months ago
- Scripts for scraping Archive of Our Own (AO3), Tumblr, Fanfiction.net (FFN), and Wattpad to gather fandom data.☆60Updated last year
- Utility for downloading fanfiction in bulk from the Archive of Our Own☆276Updated 3 weeks ago
- A script to quickly get all public AO3 bookmark URLs from your account☆21Updated 3 years ago
- The Easy Fanfic Library project enables users to download works from AO3 and other fanfiction sites into a library in the Calibre free e-…☆16Updated last month
- Enhancements for ArchiveOfOurOwn.org☆71Updated 5 months ago
- The Organization for Transformative Works (OTW) - Archive Of Our Own (AO3) Project☆1,519Updated this week
- NodeJS API for scraping AO3 data☆25Updated last month
- An implicit recommendation system for AO3-based fanworks.☆10Updated 2 years ago
- Fandom Statistics☆15Updated 3 years ago
- An approximate nearest-neighbor search for text reuse.☆12Updated 4 years ago
- FanFicFare is a tool for making eBooks from stories on fanfiction and other web sites.☆852Updated this week
- Code for the tumblr bot nostalgebraist-autoresponder.☆70Updated last year
- A light minimal responsive theme for Archiveofourown.org☆18Updated 4 years ago
- Contains materials for a work in progress - "A Humanist's Cookbook for Natural Language Processing in Python."☆40Updated 3 years ago
- Lazy Jekyll site of transcripts of episodes of the podcast The Magnus Archives.☆32Updated last year
- Pipeline to generate the Standardized Project Gutenberg Corpus☆182Updated last year
- Data and code for the book Enumerations: Data and Literary Study (Chicago 2018)☆25Updated 6 years ago
- ☆166Updated this week
- @DHRI-Curriculum Session on the command line, a means of interacting with your computer programmatically through text.☆14Updated last year
- A Python scraper for Goodreads books and reviews.☆292Updated 2 months ago
- Official syllabus and course materials for English 184E: “Literary Text Mining” (Spring 2019)☆18Updated 4 years ago
- I wanted all of plaintext Project Gutenberg in an easy-to-use format, so I made this☆222Updated 2 years ago
- A repository of datasets paired with rich documentation, data essays, and teaching resources☆74Updated last month
- An open etymology dataset created using Wiktionary data. Contains 3.8M entries, 1.8M terms, 2900 languages, and 31 unique relationship ty…☆93Updated 11 months ago
- Tools for Twitter☆35Updated 3 years ago
- Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.co…☆315Updated 3 years ago
- The Digital Humanities Literacy Guidebook☆66Updated 2 years ago