This is a pre-trained LSTM model. This model can help you to segment unpunctuated historical Chinese texts. 這是基於 LSTM 的預訓練模型。此模型可幫助您為漢語古文斷句。
☆28Nov 19, 2021Updated 4 years ago
Alternatives and similar repositories for sentence-segmentation-for-chinese-historical-texts
Users that are interested in sentence-segmentation-for-chinese-historical-texts are comparing it to the libraries listed below
Sorting:
- This project aims to curate and provide a comprehensive collection of prompts designed specifically for generative AI models in the conte…☆36Oct 22, 2025Updated 4 months ago
- 古代汉语资源☆17Feb 25, 2023Updated 3 years ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆15Dec 30, 2025Updated 2 months ago
- Raw text of 申報☆27Jan 17, 2022Updated 4 years ago
- Tool for performing basic text analysis on the CBETA corpus☆33Sep 6, 2023Updated 2 years ago
- classic Chinese punctuate experiment with keras using daizhige(殆知阁古代文献藏书) dataset☆35Dec 8, 2022Updated 3 years ago
- A curated list of digital things related to the field of Chinese studies.☆34Sep 4, 2020Updated 5 years ago
- ☆12Aug 24, 2022Updated 3 years ago
- A semantic image annotation tool for researchers, digital humanists and cultural heritage professionals.☆57Updated this week
- Deliveres functionality to securely fetch and provide 3rd Party resources as well as proxying requests back to the 3rd Party Provider. Th…☆10Mar 19, 2025Updated 11 months ago
- ☆37Oct 18, 2024Updated last year
- ☆10Sep 27, 2021Updated 4 years ago
- XML Schema pattern (regular expression) engine☆11Sep 26, 2024Updated last year
- Unofficial PyTorch Implementation of OpenAI's GPT-3☆13Apr 11, 2022Updated 3 years ago
- Research Environment for Ancient Documents☆44Jan 24, 2026Updated last month
- 基于BERT+Biaffine结构的关系抽取模型☆12Feb 23, 2022Updated 4 years ago
- ☆11Feb 13, 2020Updated 6 years ago
- Yet another pdf-mode for Emacs☆11Jun 7, 2021Updated 4 years ago
- ☆13Aug 29, 2022Updated 3 years ago
- ☆10Feb 2, 2026Updated last month
- Succeeded by syntaxdot-transformers: https://github.com/tensordot/syntaxdot/tree/main/syntaxdot-transformers☆19Oct 7, 2020Updated 5 years ago
- A generic Either type implementation for Rust☆14Jan 4, 2023Updated 3 years ago
- Evaluation of Natural Language Processing (NLP) tools for the Ancient Chinese language☆44Feb 26, 2026Updated last week
- linguistic converter / merging tool for multi-level annotated corpora. graph-based (using Python and NetworkX).☆51Nov 14, 2025Updated 3 months ago
- A pre-trained model with multi-exit transformer architecture.☆56Dec 10, 2022Updated 3 years ago
- Swift scripts for PDF manipulation, for Shortcuts or Terminal☆15Dec 25, 2022Updated 3 years ago
- A blazingly fast tool for converting to English punctuations☆10Sep 18, 2022Updated 3 years ago
- Neural Processing Letters: End-to-End Entity Detection with Proposer and Regressor☆12Jun 6, 2023Updated 2 years ago
- Python-based Scraping and parsing toolkit☆12Apr 1, 2023Updated 2 years ago
- XML:DB Initiative for XML Databases☆17Updated this week
- Open source XML MVC framework for eXist-db☆14May 9, 2023Updated 2 years ago
- The official GitHub repository for AC-EVAL, an ancient Chinese evaluation suite for large language models (LLMs)☆16Nov 12, 2024Updated last year
- ☆10Feb 3, 2021Updated 5 years ago
- A Proof of Concept for running V2Ray on iOS, will be superseded quickly☆13Jul 1, 2021Updated 4 years ago
- Make your IDs strongly typed!!☆13Sep 22, 2022Updated 3 years ago
- eXist-db library module to interact with GitHub via the GitHub API v3☆12Aug 24, 2021Updated 4 years ago
- Codes and Datasets for our ECIR 2021 Paper: "Reproducibility, Replicability and Beyond: Assessing Production Readiness of Aspect Based Se…☆10Jan 21, 2021Updated 5 years ago
- Comparing Polars vs Pandas vs Rust native :)☆13Aug 25, 2021Updated 4 years ago
- Backup files on Emacs through git☆10Nov 11, 2020Updated 5 years ago