ankiteciitkgp / bertTokenizerLinks
A java implementation of Bert Tokenizer.
☆29Updated 4 years ago
Alternatives and similar repositories for bertTokenizer
Users that are interested in bertTokenizer are comparing it to the libraries listed below
Sorting:
- 一个基于预训练的句向量生成工具☆138Updated 2 years ago
- LORA微调BLOOMZ,参考BELLE☆25Updated 2 years ago
- NLU & NLG (zero-shot) depend on mengzi-t5-base-mt pretrained model☆76Updated 3 years ago
- This is a java version of Chinese tokenization descried in BERT.☆59Updated 3 years ago
- 高性能文本 Tokenizer 库☆32Updated 2 years ago
- A PyTorch-based model pruning toolkit for pre-trained language models☆388Updated 2 years ago
- llama inference for tencentpretrain☆99Updated 2 years ago
- A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …☆47Updated 4 months ago
- CLUEWSC2020: WSC Winograd模式挑战中文版,中文指代消解任务☆79Updated 5 years ago
- Chinese MobileBERT(中文MobileBERT模型)☆98Updated 3 years ago
- 🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具,支持BertSoftmax、BertSpan等模型,开箱即用。☆117Updated last year
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆49Updated 2 years ago
- sentence-transformers to onnx 让sbert模型推理效率更快☆166Updated 3 years ago
- chatglm-6b微调/LORA/PPO/推理, 样本为自动生成的整数/小数加减乘除运算, 可gpu/cpu☆165Updated 2 years ago
- 时间抽取、解析、标准化工具☆56Updated 3 years ago
- The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆120Updated last year
- code and data for "CSCD-NS: a Chinese Spelling Check Dataset for Native Speakers"☆82Updated last year
- 基于sentence-transformers实现文本转向量的机器人☆46Updated 3 years ago
- 对话改写介绍文章☆98Updated 2 years ago
- ☆313Updated 2 years ago
- ChatGLM-6B fine-tuning.☆136Updated 2 years ago
- 文本智能校对大赛(Chinese Text Correction)的baseline☆66Updated 3 years ago
- 任务型对话系统(Task-based Dialogue System)☆66Updated 4 years ago
- Correcting Chinese Spelling Errors with Phonetic Pre-training 非官方实现☆39Updated 4 years ago
- Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)☆96Updated 11 months ago
- QBQTC: 大规模搜索匹配数据集☆85Updated 4 years ago
- CCL 2022 汉语学习者文本纠错评测☆142Updated 3 years ago
- A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.☆124Updated 8 months ago
- 基于模板的文本纠错;Automatically Mining Error Templates for Grammatical Error Correction☆44Updated 3 years ago
- GoGPT:基于Llama/Llama 2训练的中英文增强大模型|Chinese-Llama2☆79Updated 2 years ago