A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books for computational text analysis.
☆119Sep 8, 2018Updated 7 years ago
Alternatives and similar repositories for chapterize
Users that are interested in chapterize are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A command-line program to download text corpora.☆34Aug 12, 2017Updated 8 years ago
- Plots various graphs for a series of plaintext files in a directory☆19Jun 6, 2016Updated 10 years ago
- Project to digitize avant-garde periodicals☆12May 13, 2022Updated 4 years ago
- Early Novels Database dataset☆16Jan 15, 2019Updated 7 years ago
- A few scripts written during a system migration that use PyMARC☆10Jan 30, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Practical Approaches to Data Science with Text☆40Dec 6, 2019Updated 6 years ago
- A simple interface to the Project Gutenberg corpus.☆333Jan 12, 2023Updated 3 years ago
- ☆198Updated this week
- Audio/Video in Hydra☆19May 26, 2017Updated 9 years ago
- A BERT-based application for reusable text classification at scale☆37Jul 23, 2023Updated 2 years ago
- Code for SIGIR-2021 full paper: Initiative-Aware Self-Supervised Learning for Knowledge-Grounded Conversations☆11Aug 3, 2021Updated 4 years ago
- Scripts that clean up OCR and munge Hathi metadata.☆78Nov 4, 2017Updated 8 years ago
- BookReconciler, A Tool for Metadata Enrichment and Clustering of Book Data☆40Mar 2, 2026Updated 3 months ago
- Notebook for looking at 35 years of historical US degrees data from NCES-IPEDS☆11Dec 18, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A tokenizer for French☆14Apr 18, 2013Updated 13 years ago
- BIBFRAME extension ontologies for modeling bibliographic metadata in the art and rare materials domains.☆17Feb 12, 2021Updated 5 years ago
- Histonets is an application to convert images of scanned maps into digital networks☆20Oct 16, 2017Updated 8 years ago
- Fetch and parse the American Presidency Project's press-briefing and presidential-news-conference transcripts.☆11Aug 18, 2016Updated 9 years ago
- ☆67Mar 4, 2024Updated 2 years ago
- Alternative implementation of the coreference scorer for the CoNLL-2011/2012 shared tasks on coreference resolution☆11Apr 29, 2021Updated 5 years ago
- Contains materials for a work in progress - "A Humanist's Cookbook for Natural Language Processing in Python."☆41Nov 29, 2021Updated 4 years ago
- A digital humanities operating system that runs on a USB disk.☆32Jul 5, 2017Updated 8 years ago
- Archive of the XML files of the Mannheim / Heidelberg CAMENA Neo-Latin project☆20Oct 10, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Collector and speech cutter for librivox audiobooks☆24Dec 8, 2022Updated 3 years ago
- Code to reproduce results in "Finding Streams in Knowledge Graphs to Support Fact Checking"☆35Apr 30, 2025Updated last year
- Python scripts for interacting with Omeka API via YAML and CSV☆17Aug 3, 2014Updated 11 years ago
- The Multitask Long Document Benchmark☆42Nov 2, 2022Updated 3 years ago
- A standalone React/Redux web application for for presenting unique printed books and manuscripts in digital facsimile.☆31Mar 10, 2023Updated 3 years ago
- Dat python client☆46Sep 1, 2016Updated 9 years ago
- *dramavis* is a Python program dedicated to the network analysis of dramatic texts. It computes a variety of network measures as well as …☆11Jan 17, 2018Updated 8 years ago
- Digital Pedagogy in the Humanities: Concepts, Models, and Experiments☆130Oct 16, 2021Updated 4 years ago
- The Open Scholarly Edition of James Joyce's A Portrait of the Artist as a Young Man☆21May 10, 2019Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A push-button Digital Humanities laboratory.☆127Jun 7, 2018Updated 8 years ago
- A Keras-based and TensorFlow-backend NLP Models Toolkit.☆12Jul 7, 2022Updated 3 years ago
- Data and code for analyzing language associated with fictional characters.☆15Jan 6, 2018Updated 8 years ago
- Wrapper for DKPro Core to extract lingustic information from books.☆16Feb 26, 2022Updated 4 years ago
- Natural Language Inflection in English☆11Jan 10, 2022Updated 4 years ago
- A devise extension for remote user authentication☆15Dec 8, 2020Updated 5 years ago
- ☆32Mar 14, 2017Updated 9 years ago