Alpaca Chinese Dataset -- 中文指令微调数据集
☆221Oct 6, 2024Updated last year
Alternatives and similar repositories for alpaca-chinese-dataset
Users that are interested in alpaca-chinese-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM model connection LangChain RAG Connection to Streamlit Web☆14Oct 22, 2023Updated 2 years ago
- 中文《诗歌总集》,距今为止最全面,最系统的中文诗词数据集,统一数据建模.☆41Jan 6, 2026Updated 4 months ago
- CCL2024 Chinese Essay Rhetoric Recognition and Understanding☆17Oct 1, 2024Updated last year
- 中文大模型微调(LLM-SFT), 数学指令数据集MWP-Instruct, 支持模型(ChatGLM-6B, LLaMA, Bloom-7B, baichuan-7B), 支持(LoRA, QLoRA, DeepSpeed, UI, TensorboardX), 支持(微…☆216May 17, 2024Updated 2 years ago
- PptGPT是一款智能幻灯片生成插件,可以结合个人知识库生成精准匹配答辩、演讲、汇报场景的PPT幻灯片内容及原创配图,也可以通过润色及翻译功能修改优化PPT内容。☆15Jul 28, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,643Oct 24, 2024Updated last year
- 国内外数据竞赛资讯整理☆18Nov 6, 2021Updated 4 years ago
- Llama3-中文后训练版☆4,152Feb 21, 2026Updated 3 months ago
- 人工精调的中文对话数据集和一段chatglm的微调代码☆1,191May 3, 2025Updated last year
- KWS demo based on CTC prefix beam search.☆18Oct 21, 2023Updated 2 years ago
- Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料☆1,010Feb 6, 2026Updated 3 months ago
- 以InternLM2-chat-7为基座模型,以常用中药等为数据集,微调的大模型。中医聊天小助手。☆18Feb 29, 2024Updated 2 years ago
- 可以成功Lora微调的Qwen-VL模型☆16Oct 27, 2023Updated 2 years ago
- Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用☆14,712Apr 6, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)☆612Apr 19, 2026Updated last month
- A simple Rasa UI☆14Jul 13, 2020Updated 5 years ago
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆306Sep 10, 2024Updated last year
- Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 …☆40May 31, 2025Updated last year
- 对llama3进行全参微调、lora微调以及qlora微调。☆220Oct 4, 2024Updated last year
- 中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)☆7,139Apr 19, 2026Updated last month
- Acoustic Scene Classification using transfer learning on VGGish pre-trained model☆11Jan 3, 2018Updated 8 years ago
- We Speech Transcript based on LLM, in 300 lines of code.☆184Jun 20, 2025Updated 11 months ago
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆71,697Updated this week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 2020厦门国际银行数创金融杯建模大赛-优胜奖方案☆11Feb 2, 2021Updated 5 years ago
- 中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3☆1,977Apr 19, 2026Updated last month
- 基于qlora对baichuan-7B大模型进行指令微调。☆22Jun 22, 2023Updated 2 years ago
- BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)☆8,276Oct 16, 2024Updated last year
- chatglm3base模型的有监督微调SFT☆80Nov 5, 2023Updated 2 years ago
- 大语言模型应用:RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛☆76Feb 10, 2025Updated last year
- 儿童故事常识推理与寓意理解评测(Commonsense Reasoning and Moral Understanding Evaluation in Children's Stories,CRMU)☆18Oct 22, 2024Updated last year
- We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tunin…☆2,796Dec 12, 2023Updated 2 years ago
- 整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。☆22,581May 10, 2026Updated 3 weeks ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL…☆14,338Updated this week
- 从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)☆552Mar 23, 2025Updated last year
- A Next-Generation Training Engine Built for Ultra-Large MoE Models☆5,138Updated this week
- 基于pytorch的TPLinker_plus进行中文命名实体识别☆19May 14, 2023Updated 3 years ago
- KDD 2024 AQA competition 2nd place solution☆12Jul 21, 2024Updated last year
- Train a 1B LLM with 1T tokens from scratch by personal☆803Apr 27, 2025Updated last year
- 基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等☆2,779Dec 12, 2023Updated 2 years ago