DEFI-COLaF/LADaS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DEFI-COLaF/LADaS)

DEFI-COLaF / LADaS

Layout Analysis Dataset with Segmonto (LADaS)

☆25

Alternatives and similar repositories for LADaS

Users that are interested in LADaS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

PonteIneptique / YALTAi
View on GitHub
You Actually Look Twice At it
☆42Apr 15, 2026Updated 3 months ago
hirmeos / entity-fishing-client-python
View on GitHub
Repository hosting the common code for the entity-fishing clients
☆10May 18, 2026Updated 2 months ago
PonteIneptique / choco-mufin
View on GitHub
Tools for normalizing the use of some characters and checking file consistencies
☆12May 30, 2026Updated last month
SupervisedStylometry / SuperStyl
View on GitHub
Supervised Stylometry
☆27Mar 4, 2026Updated 4 months ago
UniversalDependencies / UD_German-HDT
View on GitHub
☆14May 29, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
WHaverals / CERberus
View on GitHub
CERberus -- guardian against character errors
☆30Jul 3, 2026Updated 3 weeks ago
kermitt2 / biblio-glutton-extension
View on GitHub
A browser extension providing Open Access bibliographical services
☆18Dec 9, 2022Updated 3 years ago
beratkurar / textline_segmentation_using_fcn
View on GitHub
☆17Sep 25, 2021Updated 4 years ago
vanda / curtain-viewer
View on GitHub
☆12Sep 2, 2024Updated last year
bethelmelesse / UnifiedCrawl
View on GitHub
☆17Nov 26, 2024Updated last year
ArchivesNationalesFR / Referentiels
View on GitHub
Les référentiels des Archives nationales de France | the Archives nationales de France authority data and vocabularies
☆15Jun 16, 2026Updated last month
distributed-text-services / specifications
View on GitHub
Specifications for the DTS API
☆33May 18, 2026Updated 2 months ago
Pleias / Various-Finetuning
View on GitHub
Set of scripts to finetune LLMs
☆38Mar 30, 2024Updated 2 years ago
ahpnils / cours-linux-shell
View on GitHub
Cours d'initiation à la ligne de commande sous Linux
☆42Oct 27, 2025Updated 9 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
opinionscience / BERTransfer
View on GitHub
A BERT-based application for reusable text classification at scale
☆37Jul 23, 2023Updated 3 years ago
mlabonne / tinytuner
View on GitHub
🐜🔧 A minimalistic tool to fine-tune your LLMs
☆19Aug 17, 2023Updated 2 years ago
SebastianBodza / EnsembleForecasting
View on GitHub
Using multiple LLMs for ensemble Forecasting
☆16Jan 17, 2024Updated 2 years ago
dmahan93 / lm-evaluation-harness
View on GitHub
A framework for few-shot evaluation of autoregressive language models.
☆16Aug 23, 2023Updated 2 years ago
ComplexNetTSP / MultilayerParis
View on GitHub
Paris multilayer transport network
☆11Sep 13, 2021Updated 4 years ago
pchizhov / picky_bpe
View on GitHub
BPE modification that implements removing of the intermediate tokens during tokenizer training.
☆27Nov 25, 2024Updated last year
ja-mcm / OCRfixr
View on GitHub
A context-based spellchecker for correcting OCR output.
☆21Feb 3, 2023Updated 3 years ago
fblgit / model-similarity
View on GitHub
Simple Model Similarities Analysis
☆21Feb 3, 2024Updated 2 years ago
PonteIneptique / cours-python
View on GitHub
Cours de python enseigné à l'École nationale des Chartes
☆37Jul 6, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
cwrc / ontology
View on GitHub
CWRC ontology - primary repository
☆13Jul 8, 2026Updated 2 weeks ago
ahpnils / cours-server-linux
View on GitHub
Cours de gestion de serveur Linux
☆47Mar 15, 2026Updated 4 months ago
alexbrandsen / jsonl2bio
View on GitHub
Script that converts JSONL output from Doccano to the BIO format
☆10Jul 5, 2019Updated 7 years ago
qurator-spk / sbb_ner
View on GitHub
Named Entity Recognition
☆19Feb 13, 2026Updated 5 months ago
ecomp-shONgit / string-distance
View on GitHub
A set of (string) distance functions written in JavaScript / Python / PHP.
☆18Feb 2, 2026Updated 5 months ago
NewsEye / NLP-Notebooks-Newspaper-Collections
View on GitHub
A collection of notebooks for Natural Language Processing
☆25Jan 13, 2025Updated last year
Lucaterre / spacyfishing
View on GitHub
A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata
☆173Nov 7, 2022Updated 3 years ago
edupoux / MVA_2023_SL
View on GitHub
Course materials for the MVA course "algorithms for speech and language processing"
☆13Mar 29, 2023Updated 3 years ago
ian-nai / PyGallica
View on GitHub
A Python wrapper for the National Library of France's Gallica API.
☆22Apr 10, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
natliblux / nautilusocr
View on GitHub
METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)
☆56May 30, 2023Updated 3 years ago
scantailor / ScanTailor-CLI-GUI
View on GitHub
Batch processing helper – GUI – for “ScanTailor-CLI” -- created by Csaba Kovacs
☆16Oct 2, 2016Updated 9 years ago
CoderPat / croissant-llm-training
View on GitHub
Repository containing the code for training the CroissantLLM
☆21Feb 4, 2024Updated 2 years ago
cloneofsimo / fim-llama-deepspeed
View on GitHub
☆33Jan 1, 2024Updated 2 years ago
thiippal / MoodCat
View on GitHub
MoodCat😼 classifies the mood of English sentences.
☆14Jun 19, 2022Updated 4 years ago
colibrisson / CHAT_models
View on GitHub
Automatic transcription models for Chinese historical documents trained with the kraken OCR engine
☆21Sep 27, 2023Updated 2 years ago
lfoppiano / material-parsers
View on GitHub
Material parsers and other tools, scripts Initially developed for Grobid Superconductor
☆14Feb 21, 2025Updated last year