Automatic transcription models for Chinese historical documents trained with the kraken OCR engine
☆20Sep 27, 2023Updated 2 years ago
Alternatives and similar repositories for CHAT_models
Users that are interested in CHAT_models are comparing it to the libraries listed below
Sorting:
- Chinese character variant converter. 中文异体字转换器。☆22Oct 17, 2025Updated 5 months ago
- uncover old chinese textual parallels based on sound☆15Feb 23, 2026Updated 3 weeks ago
- A collaborative research platform for resources on natural language texts☆14Mar 13, 2026Updated last week
- GND (Gemeinsame Normdatei) Ontologie☆14Dec 18, 2025Updated 3 months ago
- Extraction de LExique par Variation d'Entropie - Lexicon extraction based on the variation of entropy☆14Oct 25, 2020Updated 5 years ago
- ☆29Nov 12, 2025Updated 4 months ago
- Layout Analysis Dataset with Segmonto (LADaS)☆24Jul 12, 2025Updated 8 months ago
- Data and some scripts for historical social network analysis in Chinese Buddhism☆19Oct 13, 2025Updated 5 months ago
- Groq-powered MAD: The first work to explore Multi-Agent Debate with Large Language Models :D☆12Jul 5, 2024Updated last year
- Complete Python Web Developer Course: First program (age in seconds)☆12Apr 18, 2020Updated 5 years ago
- This project aims to curate and provide a comprehensive collection of prompts designed specifically for generative AI models in the conte…☆36Oct 22, 2025Updated 4 months ago
- Extract the MODS/ALTO metadata of a bunch of METS/ALTO files into pandas DataFrames for data analysis☆13Aug 21, 2025Updated 7 months ago
- Research Environment for Ancient Documents☆44Jan 24, 2026Updated last month
- Recognition Models for Kraken and CLSTM☆16Aug 21, 2019Updated 6 years ago
- Topic-Specific Diagnostics for LDA and CTM Topic Models☆25Jul 17, 2022Updated 3 years ago
- ☆13Jan 12, 2026Updated 2 months ago
- An evaluation bentchmark for classical Chinese☆19Dec 13, 2023Updated 2 years ago
- Visualize market depth☆17Mar 31, 2025Updated 11 months ago
- NDL古典籍OCRのアプリケーション(ソースコードを含む)☆93Oct 14, 2025Updated 5 months ago
- Chat with Time-Series Data in PostgreSQL using LlamaIndex and Timescale Vector☆12Mar 24, 2024Updated last year
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Oct 31, 2025Updated 4 months ago
- local dictionary files☆24Oct 9, 2016Updated 9 years ago
- FoxGIS Frontend☆12Jul 7, 2017Updated 8 years ago
- This script converts arxiv papers into a certain markdown format.☆18May 19, 2023Updated 2 years ago
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago
- Java command line tool to convert PAGE XML files with layout and text content to PDF☆10Apr 27, 2020Updated 5 years ago
- Historical Diagram Vectorization☆19Nov 25, 2025Updated 3 months ago
- 《汉语大字典》字头检索表☆20Nov 29, 2022Updated 3 years ago
- ☆21Apr 24, 2025Updated 10 months ago
- NLP-helper for OCR-ed pages in PAGE XML format☆10Dec 6, 2024Updated last year
- Kitodo.Publication☆14Nov 18, 2025Updated 4 months ago
- Automatic development for retrieval augmented generation system☆10Feb 2, 2025Updated last year
- Collection of OCR-related python tools and wrappers from @OCR-D☆133Feb 4, 2026Updated last month
- Chinese Notes: A digital library for classical and historic Chinese texts with built in dictionary and reader☆27Mar 5, 2026Updated 2 weeks ago
- Suite of generic Linked Data/SPARQL as well as LinkedDataHub-specific MCP tools☆38Feb 23, 2026Updated 3 weeks ago
- Named Entity Recognition tool for Europeana Newspapers☆14Apr 5, 2018Updated 7 years ago
- Project on the history of genre.☆24Feb 13, 2020Updated 6 years ago
- JournalTouch provides a touch-optimized interface for browsing current journal tables of contents in Responsive Design. Fun!☆14May 27, 2019Updated 6 years ago
- OCR-D post-correction with encoder-attention-decoder LSTMs☆13May 1, 2025Updated 10 months ago