A software to detect text reuse with BLAST.
☆13Oct 8, 2019Updated 6 years ago
Alternatives and similar repositories for textreuse-blast
Users that are interested in textreuse-blast are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- HFST spell checker library and command line tool☆14Feb 20, 2024Updated 2 years ago
- spaCy-compatible sm/md/lg/trf core models for Latin, i.e pipeline with POS tagger, morphologizer, lemmatizer, dependency parser, and NER☆12Aug 26, 2025Updated 7 months ago
- ☆15Aug 30, 2021Updated 4 years ago
- ☆11Dec 2, 2018Updated 7 years ago
- Named Entity Recognition tool for Europeana Newspapers☆14Apr 5, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Detect and align similar passages☆120Mar 17, 2026Updated last month
- Pedalion trees☆12Jan 24, 2023Updated 3 years ago
- Datasets for training and evaluating Ancient Greek sentence embedding models☆17Jul 12, 2024Updated last year
- Finnish language analysis for Elasticsearch using Raudikko☆12Mar 4, 2026Updated last month
- 🇧🇪 BelGPT-2: the 1st GPT model pretrained in French.☆33Feb 24, 2021Updated 5 years ago
- ☆10Mar 2, 2026Updated last month
- Python tools for performing various operations on ALTO XML files☆49Feb 27, 2025Updated last year
- A french litbank corpus☆10Jan 22, 2026Updated 2 months ago
- Format conversion and graphical representation of [Universal Dependencies](http://universaldependencies.org) trees.☆12Sep 3, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet…☆31Feb 2, 2026Updated 2 months ago
- Text Re-use Alignment Visualization☆38Nov 8, 2017Updated 8 years ago
- Named Entities Recognition Annotator Tool for Europeana Newspapers☆61Jan 12, 2018Updated 8 years ago
- Context Aware Language Models☆28Jul 3, 2018Updated 7 years ago
- XSLT for converting TEI MsDescription to IIIF manifests☆13Oct 18, 2016Updated 9 years ago
- Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)☆15Jan 20, 2026Updated 2 months ago
- The School of Salamanca. Web Application☆15Apr 9, 2026Updated last week
- Finds cats in photos☆18May 19, 2016Updated 9 years ago
- Extracts per-sentence subtitles + audio from a subtitle file + video file.☆12Oct 1, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Data and code to support Distant Horizons (University of Chicago Press, 2019).☆11Feb 28, 2019Updated 7 years ago
- ☆12Oct 9, 2019Updated 6 years ago
- The base class from which to create a CWRC-Writer XML editor.☆14Apr 18, 2023Updated 2 years ago
- Get texts from the Perseus Digital Library☆21Jun 2, 2023Updated 2 years ago
- Cornell INFO 3350: Text mining for history and literature, Fall 2020☆10Jan 14, 2021Updated 5 years ago
- Machine Learning Books and References☆19Sep 3, 2019Updated 6 years ago
- A template for single-source academic publishing with Pandoc and Make.☆20Jul 21, 2022Updated 3 years ago
- Tropy plugin for exporting items into Omeka☆11Apr 20, 2023Updated 2 years ago
- Greek New Testament, edited by Eberhard Nestle, published in 1904 by the British and Foreign Bible Society. Transcription by Diego Santos…☆23May 11, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Monitor Elasticsearch clusters with Grafana dashboards (via Elasticsearch)☆24Mar 10, 2022Updated 4 years ago
- Repository for the book Among Digitized Manuscripts by L.W. Cornelis van Lit (Leiden: Brill, 2020)☆25Feb 27, 2020Updated 6 years ago
- A framework for Oxygen XML Editor allowing researchers to transcribe historical documents in TEI☆21Jun 24, 2024Updated last year
- R package that helps to render interlinear glossed linguistic examples in html rmarkdown documents and then semi-automatically compiles t…☆17Nov 18, 2025Updated 4 months ago
- An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR☆15Dec 4, 2021Updated 4 years ago
- ☆14Oct 21, 2022Updated 3 years ago
- A general-purpose NLP pipeline for Ancient Greek☆28Mar 26, 2024Updated 2 years ago