Code to analyse books and newspapers data using Apache Spark.
☆16Feb 11, 2022Updated 4 years ago
Alternatives and similar repositories for defoe
Users that are interested in defoe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Awesome AI in Libraries☆17Jul 21, 2023Updated 2 years ago
- This O'Reilly course will introduce participants to the techniques and applications of text mining and sentiment analysis by training the…☆26May 14, 2021Updated 4 years ago
- 2018 Computational Text Analysis Notebooks, University of Mannheim☆13Nov 22, 2018Updated 7 years ago
- Extract the MODS/ALTO metadata of a bunch of METS/ALTO files into pandas DataFrames for data analysis☆13Aug 21, 2025Updated 7 months ago
- TEI-encoded contents of the Egyptian Gazette☆15Jun 11, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Projekt «Named Entity Recognition für die zentralen Serien des Staatsarchivs Kanton Zürich»☆10Jul 14, 2025Updated 9 months ago
- This is my 2024 course for TAP Institute on Vector Databases and Semantic Searching.☆12Jul 26, 2024Updated last year
- The GeoNewsMiner (GNM): An interactive spatial humanities tool to visualize geographical references in historical newspapers☆20Feb 21, 2022Updated 4 years ago
- R package for Google Document AI☆44Jan 19, 2026Updated 2 months ago
- Code and Data for TAD 3001-004 at NYU☆10Apr 25, 2017Updated 8 years ago
- ☆12Jun 21, 2021Updated 4 years ago
- Read/write 8-bit/16-bit PNGs with rasters, native rasters, numeric+integer arrays, indexed images with palette, packed pixels in raw vect…☆20Feb 28, 2026Updated last month
- ☆12Jan 31, 2015Updated 11 years ago
- Discovering IIIF manifests☆19May 16, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Computer Vision tutorial for DH Summer School Antwerp☆11Updated this week
- Course materials for "The Social Informatics of Large Language Models"☆11Apr 20, 2025Updated 11 months ago
- ☆13Mar 30, 2022Updated 4 years ago
- A desktop wrapper for Mirador and its environment, allowing use of local images.☆14Aug 24, 2018Updated 7 years ago
- ☆15Aug 19, 2024Updated last year
- Print Data Frames Like Tibbles☆14Nov 5, 2020Updated 5 years ago
- A companion repository to the "You Only Write Thrice: Creating Documents, Computational Notebooks and Presentations From a Single Source"…☆20Oct 14, 2022Updated 3 years ago
- Celluloid is an open source collaborative video annotation app for educational organizations☆24Apr 2, 2026Updated 2 weeks ago
- Tools for Ex-post Survey Data Harmonization☆11Dec 4, 2025Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Codebase for "Decoding language spatial relations to 2D spatial arrangements" (Findings of EMNLP 2020).☆11Feb 10, 2023Updated 3 years ago
- Literary Language Toolkit: code, models, corpora, and web tools☆11Updated this week
- Manipulation testing using local polynomial density methods.☆13Jan 23, 2025Updated last year
- Course materials for a short course at UC Berkeley☆10Jun 22, 2022Updated 3 years ago
- Software for preprocessing textual data in multiple languages for textual analysis.☆23Feb 28, 2016Updated 10 years ago
- Repo of the Turing's Humanities & Data Science Discussion Group☆13Jul 21, 2022Updated 3 years ago
- Transcribed Area Description Data☆15Dec 5, 2023Updated 2 years ago
- ***deprecated*** A Web-based tool for validating & correcting geo-resolution results☆18Jan 16, 2021Updated 5 years ago
- This repository holds course materials for the fall 2021 offering of Statistics 243 at UC Berkeley.☆16Sep 28, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Simple text mining of journal articles from JSTOR's Data for Research service☆72Dec 26, 2016Updated 9 years ago
- The PELAGIOS Cookbook☆25Jan 16, 2021Updated 5 years ago
- natural language processing on german texts☆16Mar 20, 2018Updated 8 years ago
- Tutorials zu Schnittstellen und beispielhaften Datenanalysen☆29Feb 21, 2026Updated last month
- A tutorial & reproducible example on calculating residential segregation indices with decennial US census data (Version 1-0-0)☆14Jan 23, 2023Updated 3 years ago
- A python module for evaluating NERC and NEL system performances as defined in the HIPE shared tasks (formerly CLEF-HIPE-2020-scorer).☆15Jun 4, 2024Updated last year
- R Client for Dataverse Repositories☆65Mar 6, 2026Updated last month