Latin text dataset for machine learning and procedural text generation
☆20Jun 3, 2024Updated last year
Alternatives and similar repositories for LatinTextDataset
Users that are interested in LatinTextDataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Collected files from thelatinlibrary.com☆24Apr 16, 2019Updated 7 years ago
- A french litbank corpus☆10Jan 22, 2026Updated 2 months ago
- LexInfo - Data Category Ontology for OntoLex-Lemon☆26Sep 16, 2025Updated 7 months ago
- Neural coreference resolution☆10Sep 3, 2024Updated last year
- a simple try to reproduce the paper: Super-Identity Convolutional Neural Network for Face Hallucination☆12May 4, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Collected Latin files from the Perseus Digital Library☆14Jun 21, 2017Updated 8 years ago
- Using Conditional Random Fields for segmenting Latin words written in scriptio continua☆10May 30, 2018Updated 7 years ago
- Colab Notebooks for using Instant-NGP with View Control☆11Oct 7, 2023Updated 2 years ago
- Data labeling using few shot learning GPT-3.☆25Mar 26, 2023Updated 3 years ago
- An Intelligent Tutoring System (ITS) using various algorithms from literature.☆14Jan 7, 2023Updated 3 years ago
- Matrix Methods In Data Analysis, Signal Processing, And Machine Learning☆10Sep 2, 2018Updated 7 years ago
- Backend scripts, files, etc. for parsing/updating dictionaries.☆18Jun 22, 2015Updated 10 years ago
- A bunch of modules that use/extend CLTK in order to work with Greek and Latin corpora maintained by the Perseus DL☆12Oct 26, 2019Updated 6 years ago
- A trio of Google-Colab notebooks (ipynb) for training a GPT-2 (127M) model from scratch (useful for other / non-English languages) using …☆17Jun 29, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- cross lingual text classification on amazon reviews☆10Nov 4, 2019Updated 6 years ago
- A smart distributed crawler that infers navigation models of structured websites, used to cluster pages based on their structure and extr…☆10Aug 17, 2025Updated 7 months ago
- Command-line corpus tools☆12May 15, 2017Updated 8 years ago
- Bash flauvored lodash port☆11May 1, 2016Updated 9 years ago
- Language Tool style grammar handling with spaCy 2.0☆42Jul 27, 2018Updated 7 years ago
- ☆14Dec 8, 2022Updated 3 years ago
- Citron is an experimental quote extraction system created by BBC R&D☆35Dec 14, 2021Updated 4 years ago
- Propuestas para Char.la☆12Apr 7, 2017Updated 9 years ago
- Unsupervised method for extracting quotation-speaker pairs from large news corpora.☆29Jul 4, 2018Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Language lexicons for elasticsearch https://github.com/vhyza/elasticsearch-analysis-lemmagen plugin☆15Dec 11, 2018Updated 7 years ago
- Hunspell analysis for ElasticSearch☆38Jan 20, 2012Updated 14 years ago
- Example Facebook application powered by Fandjango☆19Jan 8, 2012Updated 14 years ago
- No longer maintained. Instead, try https://github.com/LucianU/bud, which uses Vagrant + Ansible to automate a lot more things.☆27Aug 4, 2015Updated 10 years ago
- World Country Profiles Sourced from Wikipedia's Country Page Infoboxes Converted into JSON - Free Open Public Domain Data☆14Dec 10, 2020Updated 5 years ago
- 🏛️ An open-source tool for learning Latin☆25Jan 21, 2018Updated 8 years ago
- Things and stuff for times, dates and datetimes. Maybe they're useful☆14Aug 1, 2018Updated 7 years ago
- Script to import youtube-dl metadata to PostgreSQL☆14Aug 13, 2018Updated 7 years ago
- A monolithic index that supports worst-case optimal joins (WCOJ) by providing all collation orders in a single redundancy eliminating dat…☆16Sep 18, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- In-browser OCR of Ancient Greek and Latin☆27Apr 8, 2026Updated last week
- An alternative approach for probabilistic topic modeling based on agglomerative clustering of topics (not documents)☆12Apr 14, 2021Updated 5 years ago
- JVM bytecode assembler as REST api☆11Jul 27, 2025Updated 8 months ago
- This repository includes all the code and data for the paper ELiDi (End2end Entity Linking and Disambiguation)☆14Jul 18, 2021Updated 4 years ago
- Agentics is a Python framework that transform LLM computation into functional code.☆72Apr 9, 2026Updated last week
- The core NLP library for automatic question generation☆17Mar 7, 2017Updated 9 years ago
- CODO is an ontology for the semantic representation and annotation of COVID-19 data in a machine-readable form for tracking history of th…☆10Apr 19, 2022Updated 3 years ago