NLP-Learning / TiLamb
基于LLaMA2-7B增量预训练的藏文大语言模型TiLamb(Tibetan Large Language Model Base)
☆23Updated last year
Alternatives and similar repositories for TiLamb:
Users that are interested in TiLamb are comparing it to the libraries listed below
- 基于LLAMA2的增量预训练藏文大语言 模型Tibetan-LLAMA2-7B&Tibetan-LLAMA2-13B;指令微调藏文大模型Tibetan-Alpaca-7B&Tibetan-Alpaca-13B。☆29Updated 10 months ago
- Code & data for our EMNLP2022 paper "SynGEC: Syntax-Enhanced Grammatical Error Correction with a Tailored GEC-Oriented Parser"☆82Updated last year
- LaTeX Thesis Template for Beijing Language and Culture University☆14Updated last week
- 本仓库是基于bert4keras实现的古文-现代文翻译模型。具体使用了基于掩码自注意力机制的UNILM(Li al., 2019)预训练模型作为翻译系统的backbone。我们首先使用了普通的中文(现代文)BERT、Roberta权重作为UNILM的初始权重以训练UNILM…☆49Updated 2 years ago
- Source code for the paper "Improving Chinese Spelling Check by Character Pronunciation Prediction: The Effects of Adaptivity and Granular…☆41Updated last year
- ☆267Updated 8 months ago
- ☆15Updated last year
- Yet Another Chinese Learner Corpus☆77Updated 3 years ago
- ☆74Updated 8 months ago
- This repository is intended for people who are interesting in learning/reading Classical Chinese (文言文) but be at a loss what to do to. As…☆13Updated 3 years ago
- code and data for "CSCD-NS: a Chinese Spelling Check Dataset for Native Speakers"☆68Updated 8 months ago
- 基于T5模型的中文文本纠错☆30Updated 5 months ago
- CINO: Pre-trained Language Models for Chinese Minority (少数民族语言预训练模型)☆242Updated 2 years ago
- 非官方的MDCSpell论文的实现☆18Updated 2 years ago
- text correction papers☆303Updated last year
- The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆114Updated 4 months ago
- ☆30Updated last year
- 中文soft-masked bert文本纠错复现☆21Updated 3 years ago
- CCL 2022 汉语学习者文本纠错评测☆138Updated 2 years ago
- 高性能GPU计算集群☆8Updated 5 months ago
- 历届中文句法错误诊断技术评测数据集☆40Updated 2 years ago
- Code and data of the paper "MCTS: A Multi-Reference Chinese Text Simplification Dataset".☆30Updated 10 months ago
- 中文机器阅读理解数据集☆103Updated 4 years ago
- SIGHAN中文纠错数据集及转换后格式☆64Updated 5 years ago
- ☆47Updated last year
- ChineseBert用于中文拼写纠错☆41Updated 2 years ago
- Yet Another Chinese Spelling Check Dataset (YACSC)☆19Updated last year
- OMGEval😮: An Open Multilingual Generative Evaluation Benchmark for Foundation Models☆33Updated 9 months ago
- ☆166Updated 3 years ago
- Paper list for grammatical error correction (GEC).☆41Updated 3 weeks ago