Datastory-CN / DataStoryLLMBenchmark
☆10Updated last year
Related projects ⓘ
Alternatives and complementary repositories for DataStoryLLMBenchmark
- CoSENT、STS、SentenceBERT☆162Updated last year
- GlobalPointer的优化版/NER实体识别☆112Updated 2 years ago
- [SIGIR 2022] Multi-CPR: A Multi Domain Chinese Dataset for Passage Retrieval☆171Updated last year
- Baichuan-13B 指令微调☆89Updated last year
- chinese document classification of layoutlmv3 and layoutxlm☆41Updated 2 years ago
- 阿里天池: 2023全球智能汽车AI挑战赛——赛道一:AI大模型检索问答 baseline 80+☆76Updated 10 months ago
- 该项目是为了使用layoutlmv3针对中文图片训练和推理。 其中主要解决三个问题: 1.数据标准化成可以的训练数据集格式 2.layoutlmv3-base-chinese 分词修改 2.超过512长度的文本切分和滑窗操作☆33Updated 2 months ago
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆45Updated 5 months ago
- ☆32Updated 3 years ago
- pytorch Efficient GlobalPointer☆51Updated 2 years ago
- ☆26Updated 3 weeks ago
- easy-bert是一个中文NLP工具,提供诸多bert变体调用和调参方法,极速上手;清晰的设计和代码注释,也很适合学习☆72Updated 2 years ago
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆106Updated last year
- 真 · “Deep Learning for Humans”☆140Updated 2 years ago
- 受到self-instruct启发,除了通用LLM还能做垂直领域的小LLM实现定制效果,通过GPT获得question和answer来作为训练数据☆12Updated last year
- 中文 Instruction tuning datasets☆118Updated 7 months ago
- 基于pytorch的百度UIE命名实体识别。☆53Updated last year
- text embedding☆139Updated last year
- baichuan LLM surpervised finetune by lora☆60Updated last year
- sentence-transformers to onnx 让sbert模型推理效率更快☆162Updated 2 years ago
- ☆15Updated last year
- Grammar correct project based Tencent's paper(Sequence to Action)☆16Updated 2 years ago
- Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.☆164Updated 2 years ago
- ChatGLM-6B fine-tuning.☆135Updated last year
- ☆57Updated last year
- 使用sentencepiece中BPE训练中文词表,并在transformers中进行使用。☆110Updated last year
- A Multi-modal Model Chinese Spell Checker Released on ACL2021.☆154Updated last year
- 中文文本纠错相关的论文、比赛和工具。☆49Updated 4 months ago