cdli-gh / dataLinks
This is a copy of the daily dump of catalogue and ATF data from the Cuneiform Digital Library Initiative (http://cdli.ucla.edu)
☆57Updated 2 years ago
Alternatives and similar repositories for data
Users that are interested in data are comparing it to the libraries listed below
Sorting:
- Official releases of the PROIEL treebank of ancient Indo-European languages☆39Updated 2 years ago
- Libraries, Archives and Museums (LAM)☆88Updated 3 years ago
- A tool for analyzing the word histories of a text.☆37Updated last month
- Latin BERT☆70Updated last year
- Python tools for working with ORACC☆13Updated 6 years ago
- NYPL Project to transcribe and parse pages from the US Catalog of Copyright Entries☆57Updated 3 years ago
- The core repository for the Literary Theme Ontology Project.☆26Updated this week
- Public repository for Coptic SCRIPTORIUM Corpora Releases☆40Updated last month
- All the sources and documentation for Oracc☆15Updated 2 weeks ago
- Morphological analyzer and lemmatizer for Latin.☆27Updated last month
- Ontologies of Linguistic Annotation. Machine-readable tagsets and annotation schemata for more than 100 languages.☆22Updated last month
- Tab-delimited versions of Catalog of Copyright Entries renewals☆29Updated 6 years ago
- Oracc GUI☆12Updated 7 months ago
- Perseus Treebank Data☆76Updated last year
- Building and Using A Seed Corpus for the Human Language Project☆11Updated 7 years ago
- I.PHI dataset generation☆26Updated 2 years ago
- The curation repository for the data behind Concepticon.☆42Updated this week
- Multi Tier Annotation Search☆12Updated last year
- Lexica and lemmata for the Ancient Greek language, from various sources☆20Updated 5 years ago
- 🗣 Multilingual RDF Verbalizer – Google Summer of Code 2019☆21Updated 2 years ago
- In-browser OCR of Ancient Greek and Latin☆26Updated last month
- Machine Learning for Ancient Languages☆31Updated last year
- XML files for the works in the First Thousand Years of Greek Project. Please see our Wiki on how to contribute.☆107Updated last week
- TEI Reader Python Library☆19Updated 7 months ago
- Hexatomic is an extensible software for deep multi-layer annotation of linguistic corpora☆18Updated last year
- The Global WordNet Association Collaborative Inter-Lingual Index☆50Updated last year
- Tutorials for the CLTK☆53Updated 5 years ago
- A web framework to display Cross Linguistic Linked Data.☆63Updated 5 months ago
- OCR post correction for old German corpus☆19Updated 3 years ago
- Data and Code for "The Values Encoded in Machine Learning Research"☆45Updated 3 years ago