A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books for computational text analysis.
☆120Sep 8, 2018Updated 7 years ago
Alternatives and similar repositories for chapterize
Users that are interested in chapterize are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Project to digitize avant-garde periodicals☆12May 13, 2022Updated 4 years ago
- Early Novels Database dataset☆16Jan 15, 2019Updated 7 years ago
- A few scripts written during a system migration that use PyMARC☆10Jan 30, 2020Updated 6 years ago
- Practical Approaches to Data Science with Text☆40Dec 6, 2019Updated 6 years ago
- A simple interface to the Project Gutenberg corpus.☆333Jan 12, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆199May 3, 2024Updated 2 years ago
- Scripts for scraping metadata from Project Gutenberg books, via GITenberg.☆19Sep 11, 2018Updated 7 years ago
- Audio/Video in Hydra☆19May 26, 2017Updated 8 years ago
- A BERT-based application for reusable text classification at scale☆37Jul 23, 2023Updated 2 years ago
- A DH abstracts conversion tool☆13Apr 24, 2026Updated last month
- Code for SIGIR-2021 full paper: Initiative-Aware Self-Supervised Learning for Knowledge-Grounded Conversations☆11Aug 3, 2021Updated 4 years ago
- A tool for analyzing the word histories of a text.☆37Dec 8, 2025Updated 5 months ago
- Pipeline to generate the Standardized Project Gutenberg Corpus☆215Jan 5, 2024Updated 2 years ago
- A point-and-click tool for creating and analyzing topic models produced by MALLET.☆115Mar 1, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- (Mental) maps of texts with kernel density estimation and force-directed networks.☆108Jun 22, 2015Updated 10 years ago
- Histonets is an application to convert images of scanned maps into digital networks☆20Oct 16, 2017Updated 8 years ago
- A LibreOffice extension that converts JabRef references to plain text code and vice versa so that you can use your references with MS Off…☆13Aug 15, 2024Updated last year
- BookNLP, a natural language processing pipeline for books☆918Jul 31, 2024Updated last year
- ☆67Mar 4, 2024Updated 2 years ago
- A content-based recommender system for books using the Project Gutenberg text corpus☆29Feb 20, 2017Updated 9 years ago
- a python package for cleaning Gutenberg books and dataset☆35May 2, 2025Updated last year
- command line resource for working with digital primary sources☆29Aug 3, 2018Updated 7 years ago
- Contains materials for a work in progress - "A Humanist's Cookbook for Natural Language Processing in Python."☆41Nov 29, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- generate rules from lists of words☆16Jul 9, 2021Updated 4 years ago
- A digital humanities operating system that runs on a USB disk.☆32Jul 5, 2017Updated 8 years ago
- ☆35Jun 21, 2023Updated 2 years ago
- Collector and speech cutter for librivox audiobooks☆24Dec 8, 2022Updated 3 years ago
- Python scripts for interacting with Omeka API via YAML and CSV☆17Aug 3, 2014Updated 11 years ago
- The Multitask Long Document Benchmark☆42Nov 2, 2022Updated 3 years ago
- Notebooks and other course materials for Emory QTM 340 (Fall 2022)☆12Dec 13, 2022Updated 3 years ago
- Tools for working with HTRC Feature Extraction files☆44Jul 8, 2025Updated 10 months ago
- Digital Pedagogy in the Humanities: Concepts, Models, and Experiments☆130Oct 16, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Data and code for analyzing language associated with fictional characters.☆15Jan 6, 2018Updated 8 years ago
- A devise extension for remote user authentication☆15Dec 8, 2020Updated 5 years ago
- Flask Interface to Thompson's Motif Index☆19Jul 9, 2019Updated 6 years ago
- Dataset for NAACL 2021 paper: "QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization"☆150Aug 29, 2023Updated 2 years ago
- `Black` for Jupyter notebooks.☆19Apr 23, 2020Updated 6 years ago
- A collection of scripts for teaching and learning basic text mining methods in R☆10Sep 10, 2018Updated 7 years ago
- CAL-ACCESS Campaign Power Search☆13Nov 2, 2017Updated 8 years ago