Latin text dataset for machine learning and procedural text generation
☆20Jun 3, 2024Updated last year
Alternatives and similar repositories for LatinTextDataset
Users that are interested in LatinTextDataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data validation for Python, inspired by the Laravel framework.☆10Sep 29, 2023Updated 2 years ago
- A french litbank corpus☆10Jan 22, 2026Updated 2 months ago
- LexInfo - Data Category Ontology for OntoLex-Lemon☆26Sep 16, 2025Updated 6 months ago
- Neural coreference resolution☆10Sep 3, 2024Updated last year
- ParaNames: A multilingual resource for parallel names☆40May 20, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Plug-n-play reinforcement learning with OpenAI Gym and Keras☆16Sep 30, 2020Updated 5 years ago
- Data labeling using few shot learning GPT-3.☆25Mar 26, 2023Updated 3 years ago
- Matrix Methods In Data Analysis, Signal Processing, And Machine Learning☆10Sep 2, 2018Updated 7 years ago
- several algorithms for converting dependency structures into constituency structures.☆10Feb 7, 2022Updated 4 years ago
- An offshoot of the Awesome-Public-Datasets repo I'm cultivating☆15Dec 3, 2019Updated 6 years ago
- Retrieve and extract citations from Crossref data☆29Mar 11, 2021Updated 5 years ago
- cross lingual text classification on amazon reviews☆10Nov 4, 2019Updated 6 years ago
- A smart distributed crawler that infers navigation models of structured websites, used to cluster pages based on their structure and extr…☆10Aug 17, 2025Updated 7 months ago
- Alphabot: a screen-less interactive spelling primer powered by computer vision☆14Sep 11, 2018Updated 7 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- The linked open dataset described at http://datahub.io/dataset/vu-wordnet, and the tools used to create it☆26Oct 19, 2020Updated 5 years ago
- Bash flauvored lodash port☆11May 1, 2016Updated 9 years ago
- Language Tool style grammar handling with spaCy 2.0☆42Jul 27, 2018Updated 7 years ago
- My public domain speech index☆13Sep 19, 2019Updated 6 years ago
- ☆14Dec 8, 2022Updated 3 years ago
- Citron is an experimental quote extraction system created by BBC R&D☆36Dec 14, 2021Updated 4 years ago
- Propuestas para Char.la☆12Apr 7, 2017Updated 8 years ago
- A vim plugin for running perl, python, ruby, bash, etc. scripts inside of vim.☆18Nov 18, 2020Updated 5 years ago
- Unsupervised method for extracting quotation-speaker pairs from large news corpora.☆29Jul 4, 2018Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A collection of computer tools for aiding the text critical workflow from transcription to collation to analysis.☆26Apr 6, 2025Updated 11 months ago
- Language lexicons for elasticsearch https://github.com/vhyza/elasticsearch-analysis-lemmagen plugin☆15Dec 11, 2018Updated 7 years ago
- Hunspell analysis for ElasticSearch☆38Jan 20, 2012Updated 14 years ago
- Example Facebook application powered by Fandjango☆19Jan 8, 2012Updated 14 years ago
- No longer maintained. Instead, try https://github.com/LucianU/bud, which uses Vagrant + Ansible to automate a lot more things.☆27Aug 4, 2015Updated 10 years ago
- Agentics is a Python framework that provides structured, scalable, and semantically grounded agentic computation.☆67Mar 19, 2026Updated last week
- Reading the data from OPIEC - an Open Information Extraction corpus☆38Jun 12, 2019Updated 6 years ago
- A monolithic index that supports worst-case optimal joins (WCOJ) by providing all collation orders in a single redundancy eliminating dat…☆16Sep 18, 2025Updated 6 months ago
- JVM bytecode assembler as REST api☆11Jul 27, 2025Updated 7 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- An alternative approach for probabilistic topic modeling based on agglomerative clustering of topics (not documents)☆12Apr 14, 2021Updated 4 years ago
- Make multiple tile layers transparent.☆21Jan 7, 2021Updated 5 years ago
- This repository includes all the code and data for the paper ELiDi (End2end Entity Linking and Disambiguation)☆14Jul 18, 2021Updated 4 years ago
- ☆34Feb 4, 2022Updated 4 years ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- command-line tool to extract taxonomies from Wikidata☆130Jun 19, 2019Updated 6 years ago
- The core NLP library for automatic question generation☆17Mar 7, 2017Updated 9 years ago