Latin text dataset for machine learning and procedural text generation
☆20Jun 3, 2024Updated 2 years ago
Alternatives and similar repositories for LatinTextDataset
Users that are interested in LatinTextDataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LexInfo - Data Category Ontology for OntoLex-Lemon☆28Sep 16, 2025Updated 8 months ago
- Neural coreference resolution☆12Sep 3, 2024Updated last year
- ParaNames: A multilingual resource for parallel names☆40May 20, 2024Updated 2 years ago
- a simple try to reproduce the paper: Super-Identity Convolutional Neural Network for Face Hallucination☆12May 4, 2019Updated 7 years ago
- Using Conditional Random Fields for segmenting Latin words written in scriptio continua☆10May 30, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Data labeling using few shot learning GPT-3.☆25Mar 26, 2023Updated 3 years ago
- Notes from Python's NLTK book☆15Jul 10, 2018Updated 7 years ago
- Matrix Methods In Data Analysis, Signal Processing, And Machine Learning☆10Sep 2, 2018Updated 7 years ago
- Grammar exercises generated from books & subtitles☆21Jan 9, 2024Updated 2 years ago
- several algorithms for converting dependency structures into constituency structures.☆10Feb 7, 2022Updated 4 years ago
- An offshoot of the Awesome-Public-Datasets repo I'm cultivating☆15Dec 3, 2019Updated 6 years ago
- A smart distributed crawler that infers navigation models of structured websites, used to cluster pages based on their structure and extr…☆10Aug 17, 2025Updated 9 months ago
- Command-line corpus tools☆12May 15, 2017Updated 9 years ago
- Language Tool style grammar handling with spaCy 2.0☆42Jul 27, 2018Updated 7 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- My public domain speech index☆13Sep 19, 2019Updated 6 years ago
- ☆14Dec 8, 2022Updated 3 years ago
- Citron is an experimental quote extraction system created by BBC R&D☆36Dec 14, 2021Updated 4 years ago
- Propuestas para Char.la☆12Apr 7, 2017Updated 9 years ago
- Unsupervised method for extracting quotation-speaker pairs from large news corpora.☆29Jul 4, 2018Updated 7 years ago
- A collection of computer tools for aiding the text critical workflow from transcription to collation to analysis.☆26Apr 6, 2025Updated last year
- PHP code that analyzes Latin and Greek words' parts of speech, tenses, genders, moods, etc.☆22Apr 21, 2026Updated last month
- Hunspell analysis for ElasticSearch☆38Jan 20, 2012Updated 14 years ago
- World Country Profiles Sourced from Wikipedia's Country Page Infoboxes Converted into JSON - Free Open Public Domain Data☆14Dec 10, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Script to import youtube-dl metadata to PostgreSQL☆14Aug 13, 2018Updated 7 years ago
- Reading the data from OPIEC - an Open Information Extraction corpus☆39Jun 12, 2019Updated 7 years ago
- Code for Context based Approach for Second Language Acquisition☆13Feb 10, 2023Updated 3 years ago
- JVM bytecode assembler as REST api☆11Jul 27, 2025Updated 10 months ago
- An alternative approach for probabilistic topic modeling based on agglomerative clustering of topics (not documents)☆12Apr 14, 2021Updated 5 years ago
- Make multiple tile layers transparent.☆21Jan 7, 2021Updated 5 years ago
- This repository includes all the code and data for the paper ELiDi (End2end Entity Linking and Disambiguation)☆14Jul 18, 2021Updated 4 years ago
- ☆35Feb 4, 2022Updated 4 years ago
- Agentics is a Python framework that transform LLM computation into functional code.☆81Jun 5, 2026Updated last week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- MBTI text classifer trained on the Kaggle MBTI dataset☆34Oct 7, 2018Updated 7 years ago
- The core NLP library for automatic question generation☆17Mar 7, 2017Updated 9 years ago
- 👩🏫 Pre-trained German Language Model with sub-word tokenization for ULMFIT☆15Jun 24, 2020Updated 5 years ago
- CODO is an ontology for the semantic representation and annotation of COVID-19 data in a machine-readable form for tracking history of th…☆10Apr 19, 2022Updated 4 years ago
- A visualisation tool for Spacy using Hierplane.☆64Jan 25, 2023Updated 3 years ago
- A python implementation of portrait lighting transfer using a mass transport approach.☆37Jun 1, 2022Updated 4 years ago