使用sentencepiece中BPE训练中文词表,并在transformers中进行使用。
☆120Jun 24, 2023Updated 2 years ago
Alternatives and similar repositories for sentencepiece_chinese_bpe
Users that are interested in sentencepiece_chinese_bpe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆310Apr 6, 2023Updated 3 years ago
- ☆11May 2, 2023Updated 2 years ago
- 怎么训练一个LLM分词器☆152Jul 13, 2023Updated 2 years ago
- 使用指令微调对大模型进行微调。☆11Jun 28, 2023Updated 2 years ago
- ☆16Mar 22, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Debug DeepSpeed-Chat step by step in IDE (在IDE里一步一步调试DeepSpeed-Chat)☆10Apr 17, 2023Updated 2 years ago
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- BLOOM 模型的指令微调☆24Jun 15, 2023Updated 2 years ago
- 同花顺算法挑战平台:【9-10双月赛】跨领域迁移的文本语义匹配☆11Oct 28, 2021Updated 4 years ago
- 基于Pytorch实现的中文文本分类脚手架,以及常用模型对比。☆18Apr 23, 2021Updated 4 years ago
- Model to predict kinase-ligand pKi values.☆12Jul 6, 2023Updated 2 years ago
- CMeEE/CBLUE/NER实体识别☆136Apr 11, 2022Updated 4 years ago
- 中文soft-masked bert文本纠错复现☆21May 20, 2021Updated 4 years ago
- Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型☆416Oct 21, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)☆7,154Jul 15, 2025Updated 9 months ago
- ☆12Oct 17, 2024Updated last year
- 简易版任务型对话系统☆18May 17, 2019Updated 6 years ago
- 中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)☆18,952Jul 15, 2025Updated 9 months ago
- 一套代码指令微 调大模型☆39Aug 1, 2023Updated 2 years ago
- The code for the paper "Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning" (CVPR'25).☆15Sep 25, 2025Updated 6 months ago
- 用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.☆2,903May 21, 2024Updated last year
- Implementation for paper "A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation"☆24Mar 1, 2020Updated 6 years ago
- This is an unofficial implementation of universal melgan according to https://arxiv.org/abs/2011.09631☆23Aug 15, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ChatGLM-6B 指令学习|指令数据|Instruct☆653Apr 10, 2023Updated 3 years ago
- 探索中文instruct数据在ChatGLM, LLaMA上的微调表现☆389Apr 4, 2023Updated 3 years ago
- [NeurIPS2024] BoostAdapter: Improving Test-Time Adaptation via Regional Bootstrapping☆19Feb 28, 2026Updated last month
- it's a train acoustics model code lib☆27May 20, 2020Updated 5 years ago
- 语言模型中文认知能力分析☆235Sep 9, 2023Updated 2 years ago
- Framework for Detection Evaluation (F4DE) : set of evaluation tools for detection evaluations and for specific NIST-coordinated evaluatio…☆25Jul 6, 2017Updated 8 years ago
- 2024 FinVolution Global Data Science Competition-9th baseline☆20May 17, 2024Updated last year
- 使用多头的思想来进行命名实体识别☆34May 5, 2021Updated 4 years ago
- LAVIS - A One-stop Library for Language-Vision Intelligence☆10Apr 18, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Implemention of NER model on chinese dataset.☆74Apr 8, 2023Updated 3 years ago
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆35Dec 17, 2024Updated last year
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,649Oct 24, 2024Updated last year
- CHIP2020 Task 3 术语标准化任务☆31Oct 23, 2020Updated 5 years ago
- 斗破苍穹小说的新词发现☆13May 12, 2022Updated 3 years ago
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆378Jul 21, 2024Updated last year
- 刹那是永恒☆13Feb 26, 2020Updated 6 years ago