Alpaca Chinese Dataset -- 中文指令微调数据集
☆220Oct 6, 2024Updated last year
Alternatives and similar repositories for alpaca-chinese-dataset
Users that are interested in alpaca-chinese-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 中文《诗歌总集》,距今为止最全面,最系统的中文诗词数据集,统一数据建模.☆40Jan 6, 2026Updated 4 months ago
- CCL2024 Chinese Essay Rhetoric Recognition and Understanding☆17Oct 1, 2024Updated last year
- 中文大模型微调(LLM-SFT), 数学指令数据集MWP-Instruct, 支持模型(ChatGLM-6B, LLaMA, Bloom-7B, baichuan-7B), 支持(LoRA, QLoRA, DeepSpeed, UI, TensorboardX), 支持(微…☆216May 17, 2024Updated last year
- PptGPT是一款智能幻灯片生成插件,可以结合个人知识库生成精准匹配答辩、演讲、汇报场景的PPT幻灯片内容及原创配图,也可以通过润色及翻译功能修改优化PPT内容。☆15Jul 28, 2024Updated last year
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,643Oct 24, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- 国内外数据竞赛资讯整理☆18Nov 6, 2021Updated 4 years ago
- KWS demo based on CTC prefix beam search.☆17Oct 21, 2023Updated 2 years ago
- Llama3、Llama3.1 中文后训练版仓库 - 微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档。☆4,161Feb 21, 2026Updated 2 months ago
- 人工精调的中文对话数据集和一段chatglm的微调代码☆1,193May 3, 2025Updated last year
- Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料☆1,004Feb 6, 2026Updated 3 months ago
- 以InternLM2-chat-7为基座模型,以常用中药等为数据集,微调的大模型。中医聊天小助手。☆18Feb 29, 2024Updated 2 years ago
- 可以成功Lora微调的Qwen-VL模型☆16Oct 27, 2023Updated 2 years ago
- Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用☆14,723Apr 6, 2025Updated last year
- 中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)☆611Apr 19, 2026Updated 3 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A simple Rasa UI☆14Jul 13, 2020Updated 5 years ago
- Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 …☆40May 31, 2025Updated 11 months ago
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆307Sep 10, 2024Updated last year
- 对llama3进行全参微调、lora微调以及qlora微调。☆219Oct 4, 2024Updated last year
- 中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)☆7,143Apr 19, 2026Updated 3 weeks ago
- Acoustic Scene Classification using transfer learning on VGGish pre-trained model☆11Jan 3, 2018Updated 8 years ago
- We Speech Transcript based on LLM, in 300 lines of code.☆185Jun 20, 2025Updated 10 months ago
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆70,969May 3, 2026Updated last week
- 2020厦门国际银行数创金融杯建模大赛-优胜奖方案☆11Feb 2, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3☆1,969Apr 19, 2026Updated 3 weeks ago
- 基于qlora对baichuan-7B大模型进行指令微调。☆23Jun 22, 2023Updated 2 years ago
- BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)☆8,279Oct 16, 2024Updated last year
- 拼音转汉字, convert pinyin to 汉字 using deep networks☆23Sep 18, 2020Updated 5 years ago
- chatglm3base模型的有监督微调SFT☆80Nov 5, 2023Updated 2 years ago
- 大语言模型应用:RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛☆76Feb 10, 2025Updated last year
- 儿童故事常识推理与寓意理解评测(Commonsense Reasoning and Moral Understanding Evaluation in Children's Stories,CRMU)☆18Oct 22, 2024Updated last year
- 结合知识图谱做的有关诗词的问答demo☆11Mar 11, 2020Updated 6 years ago
- We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tunin…☆2,799Dec 12, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL…☆14,032Updated this week
- 整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。☆22,563Apr 23, 2026Updated 2 weeks ago
- 从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)☆549Mar 23, 2025Updated last year
- ☆15Jun 18, 2021Updated 4 years ago
- A Next-Generation Training Engine Built for Ultra-Large MoE Models☆5,129May 5, 2026Updated last week
- 基于pytorch的TPLinker_plus进行中文命名实体识别☆19May 14, 2023Updated 2 years ago
- KDD 2024 AQA competition 2nd place solution☆12Jul 21, 2024Updated last year