cdli-gh / dataLinks
This is a copy of the daily dump of catalogue and ATF data from the Cuneiform Digital Library Initiative (http://cdli.ucla.edu)
☆57Updated 2 years ago
Alternatives and similar repositories for data
Users that are interested in data are comparing it to the libraries listed below
Sorting:
- Official releases of the PROIEL treebank of ancient Indo-European languages☆38Updated 2 years ago
- Python tools for working with ORACC☆13Updated 6 years ago
- ☆28Updated last year
- Libraries, Archives and Museums (LAM)☆87Updated 3 years ago
- Morphological analyzer and lemmatizer for Latin.☆27Updated 8 months ago
- Latin BERT☆66Updated last year
- Machine Learning for Ancient Languages☆30Updated last year
- All the sources and documentation for Oracc☆14Updated this week
- XML files for the works in the First Thousand Years of Greek Project. Please see our Wiki on how to contribute.☆102Updated last month
- The curation repository for the data behind Concepticon.☆40Updated 3 weeks ago
- I.PHI dataset generation☆26Updated last year
- Perseus Treebank Data☆74Updated last year
- Restoring ancient text using deep learning: a case study on Greek epigraphy.☆230Updated last year
- Text Re-use Alignment Visualization☆38Updated 7 years ago
- A multilingual parallel corpus created from translations of the Bible.☆189Updated 5 months ago
- A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Sp…☆32Updated 3 months ago
- The Global WordNet Association Collaborative Inter-Lingual Index☆47Updated 11 months ago
- A corpus of poetry from Project Gutenberg☆209Updated 7 years ago
- In-browser OCR of Ancient Greek and Latin☆26Updated last month
- Public repository for Coptic SCRIPTORIUM Corpora Releases☆37Updated last month
- A simple interface to the Project Gutenberg corpus.☆330Updated 2 years ago
- Python 3 library for processing historical English☆67Updated last year
- This is a collection of sentence-level aligned Sanskrit-Tibetan Etexts.☆15Updated 3 years ago
- TEI Reader Python Library☆18Updated 3 months ago
- Automatically exported from code.google.com/p/colore☆72Updated 10 months ago
- 👩🔬 A web-based, open-access platform for linguistic research on old indic texts☆22Updated 4 months ago
- Latin text dataset for machine learning and procedural text generation☆18Updated last year
- 'ocr-evaluation-tools' from http://ancientgreekocr.org/. Tools to test OCR accuracy.☆22Updated 7 years ago
- the EEBO TCP texts☆34Updated 7 years ago
- A tool for analyzing the word histories of a text.☆35Updated 10 months ago