zhenlohuang / awesome-chinese-llmView external linksLinks
Awesome Chinese LLM: A curated list of Chinese Large Language Model 中文大语言模型数据集和模型资料汇总
☆164Jun 10, 2024Updated last year
Alternatives and similar repositories for awesome-chinese-llm
Users that are interested in awesome-chinese-llm are comparing it to the libraries listed below
Sorting:
- This is the implementation code for the paper "Trainable Undersampling for Class-Imbalance Learning" published in AAAI2019☆16Mar 17, 2019Updated 6 years ago
- ☆12Mar 29, 2023Updated 2 years ago
- High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)☆26Jan 22, 2026Updated 3 weeks ago
- ICLR 2021: Noise against noise: stochastic label noise helps combat inherent label noise☆15May 1, 2021Updated 4 years ago
- 该仓库主要记录 NLP 算法工程师相关的 搜索引擎 学习笔记☆13Apr 9, 2022Updated 3 years ago
- 整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。☆22,214May 19, 2025Updated 8 months ago
- A simple Rasa UI☆14Jul 13, 2020Updated 5 years ago
- This is the code repo for the paper <UTC-IE: A Unified Token-pair Classification Architecture for Information Extraction>☆15Aug 10, 2023Updated 2 years ago
- Turn Dify API into OpenAI API schema☆17Aug 16, 2024Updated last year
- MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、 书籍、杂志…☆4,120Jan 31, 2026Updated 2 weeks ago
- Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合☆5,516Updated this week
- Code for the KDD 2022 paper "Interpreting Trajectories from Multiple Views: A Hierarchical Self-Attention Network for Estimating the Time…☆18May 29, 2022Updated 3 years ago
- ☆35Dec 5, 2021Updated 4 years ago
- PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation [NeurIPS 2025]☆18Oct 11, 2025Updated 4 months ago
- ☆18Apr 28, 2022Updated 3 years ago
- AGI资料汇总学习(主要包括LLM和AIGC),持续更新......☆476Feb 11, 2026Updated last week
- Implementation for the different ML tasks on Kaggle platform with GPUs.☆26Jan 27, 2026Updated 3 weeks ago
- Prompting Large Language Models for Zero-Shot Clinical Prediction with Structured Longitudinal Electronic Health Record Data☆28Jan 17, 2024Updated 2 years ago
- Multimodal short video classification task, integrating video, image, audio and text modes for short video classification☆19Mar 12, 2020Updated 5 years ago
- Towards Achieving Adversarial Robustness by Enforcing Feature Consistency Across Bit Planes☆23Jun 14, 2020Updated 5 years ago
- SAIS: Supervising and Augmenting Intermediate Steps for Document-Level Relation Extraction☆20Oct 10, 2022Updated 3 years ago
- GAIIC2022商品标题实体识别Baseline,使用GlobalPointer实现,线上0.80349☆52Apr 9, 2022Updated 3 years ago
- This is the repository to reproduce the experiments of the IJCAI 2020 paper "Metric Learning in Optimal Transport for Domain Adaptation"☆23Jun 9, 2020Updated 5 years ago
- Exploiting Unlabeled Data for Target-Oriented Opinion Words Extraction☆24Sep 30, 2022Updated 3 years ago
- [IEEE TITS 2024] Activity-aware human mobility prediction with hierarchical graph attention recurrent network.☆25Jan 19, 2025Updated last year
- ☆25Aug 2, 2024Updated last year
- 《CPlusPlus编程语言基础》又称为“C加加知识树”,用树状思维导图的形式展现C++从业人员必备的所有C++基础知识。☆26Jun 21, 2020Updated 5 years ago
- [ECCV2022] Optimization over Disentangled Encoding: Unsupervised Cross-Domain Point Cloud Completion via Occlusion Factor Manipulation☆30Nov 21, 2022Updated 3 years ago
- from MHA, MQA, GQA to MLA by 苏剑林, with code☆42Feb 19, 2025Updated 11 months ago
- Travel Time Prediction Based on Tensor Decomposition and Graph Embedding☆29Dec 25, 2020Updated 5 years ago
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,635Oct 24, 2024Updated last year
- ASCEND Chinese-English code-switching dataset☆30Jul 12, 2022Updated 3 years ago
- Y-Agent Studio 是一个面向 企业级应用 的Agent开发套,Y-Agent是其中的核心模块。 包含了:支持智能体编排、RAG、流程日志、单元测试、流程测试、语料生产等垂直领域非常需要的功能。 智能体编排可以在同一个流程中,同时支持多智能体协作和流程混合编排…☆25Oct 4, 2025Updated 4 months ago
- 【丫丫】是以Moss作为基座模型,使用LoRA技术进行指令微调的尝试。由黄泓森,陈启源 @ 华中师范大学 主要完成。同时他也是【骆驼】开源中文大模型的一个子项目。☆30Apr 22, 2023Updated 2 years ago
- Taichi Course Homework Template☆35Oct 27, 2021Updated 4 years ago
- 【LLMs九层妖塔】分享 LLMs在自然语言处理(ChatGLM、Chinese-LLaMA-Alpaca、小羊驼 Vicuna、LLaMA、GPT4ALL等)、信息检索(langchain)、语言合成、语言识别、多模态等领域(Stable Diffusion、MiniGP…☆2,150Mar 30, 2024Updated last year
- 大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP☆9,857Feb 6, 2026Updated last week
- Our 2nd-gen LMM☆34May 22, 2024Updated last year
- This is the code repo for our paper "Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts".☆44Sep 27, 2025Updated 4 months ago