cdli-gh / dataLinks
This is a copy of the daily dump of catalogue and ATF data from the Cuneiform Digital Library Initiative (http://cdli.ucla.edu)
☆57Updated 2 years ago
Alternatives and similar repositories for data
Users that are interested in data are comparing it to the libraries listed below
Sorting:
- ☆31Updated 8 years ago
- Python tools for working with ORACC☆13Updated 6 years ago
- 'ocr-evaluation-tools' from http://ancientgreekocr.org/. Tools to test OCR accuracy.☆22Updated 7 years ago
- Official releases of the PROIEL treebank of ancient Indo-European languages☆39Updated 2 years ago
- I.PHI dataset generation☆26Updated 2 years ago
- Libraries, Archives and Museums (LAM)☆88Updated 3 years ago
- A tool for analyzing the word histories of a text.☆37Updated 2 months ago
- Preliminary spaCy models for Latin☆14Updated 3 years ago
- Tab-delimited versions of Catalog of Copyright Entries renewals☆29Updated 6 years ago
- Latin BERT☆70Updated last year
- A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Sp…☆32Updated 7 months ago
- A simple interface to the Project Gutenberg corpus.☆331Updated 3 years ago
- Text Re-use Alignment Visualization☆38Updated 8 years ago
- Restoring ancient text using deep learning: a case study on Greek epigraphy.☆230Updated 2 years ago
- The NLG tool for Finnish☆24Updated 2 years ago
- A corpus of poetry from Project Gutenberg☆212Updated 7 years ago
- This repository contains the Framester resource, the main outcome of the framester project.☆33Updated 3 months ago
- The Global WordNet Association Collaborative Inter-Lingual Index☆50Updated last year
- Perseus Treebank Data☆76Updated last year
- Scripts for scraping metadata from Project Gutenberg books, via GITenberg.☆19Updated 7 years ago
- Python 3 library for processing historical English☆68Updated last year
- Computational Assyriology☆19Updated 3 months ago
- Java based viewer for PAGE XML files (layout + text content). Also supports ALTO XML, FineReader XML, and HOCR.☆35Updated 2 years ago
- The CMU Link Grammar natural language parser☆407Updated 3 months ago
- I wanted all of plaintext Project Gutenberg in an easy-to-use format, so I made this☆225Updated 2 years ago
- Automatically exported from code.google.com/p/foma☆128Updated 4 months ago
- An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Span…☆90Updated 3 months ago
- Open morphology for Finnish☆95Updated 2 weeks ago
- All the sources and documentation for Oracc☆15Updated 2 weeks ago
- NYPL Project to transcribe and parse pages from the US Catalog of Copyright Entries☆57Updated 3 years ago