open-chinese / alpaca-chinese-datasetView external linksLinks
Alpaca Chinese Dataset -- 中文指令微调数据集
☆216Oct 6, 2024Updated last year
Alternatives and similar repositories for alpaca-chinese-dataset
Users that are interested in alpaca-chinese-dataset are comparing it to the libraries listed below
Sorting:
- LLM model connection LangChain RAG Connection to Streamlit Web☆14Oct 22, 2023Updated 2 years ago
- CCL2024 Chinese Essay Rhetoric Recognition and Understanding☆17Oct 1, 2024Updated last year
- 国内外数据竞赛资讯整理☆18Nov 6, 2021Updated 4 years ago
- Implementing BERT + CRF with PyTorch for Chinese NER.☆10Mar 7, 2022Updated 3 years ago
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,635Oct 24, 2024Updated last year
- 中文大模型微调(LLM-SFT), 数学指令数据集MWP-Instruct, 支持模型(ChatGLM-6B, LLaMA, Bloom-7B, baichuan-7B), 支持(LoRA, QLoRA, DeepSpeed, UI, TensorboardX), 支持(微…☆217May 17, 2024Updated last year
- This plugin provides tools to extract text from a document using the Azure AI Document Intelligence service.☆12Jan 17, 2025Updated last year
- 2020厦门国际银行数创金融杯建模大赛-优胜奖方案☆11Feb 2, 2021Updated 5 years ago
- 利用BERT预训练模型进行文本生成,可用于对话、摘要、问题生成等任务。 目前支持策略,词表的插入和删除、自定义Character Embedding、随机词替换等☆10Jun 1, 2022Updated 3 years ago
- ☆13Mar 16, 2025Updated 11 months ago
- PptGPT是一款智能幻灯片生成插件,可以结合个人知识库生成精准匹配答辩、演讲、汇报场景的PPT幻灯片内容及原创配图,也可以通过润色及翻译功能修改优化PPT内容。☆14Jul 28, 2024Updated last year
- Implementation for EACL 2024 paper "Corpus-Steered Query Expansion with Large Language Models"☆12Mar 19, 2024Updated last year
- 从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)☆527Mar 23, 2025Updated 10 months ago
- 大语言模型应用:RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛☆75Feb 10, 2025Updated last year
- Train a 1B LLM with 1T tokens from scratch by personal☆788Apr 27, 2025Updated 9 months ago
- ☆17Aug 17, 2024Updated last year
- Dify Streamlit Chat App☆14Aug 31, 2024Updated last year
- 教员选集(5册),常读常新。☆13Jan 18, 2025Updated last year
- 🤗 HF Downloader (Hugging Face Downloader) 📦 A user-friendly GUI tool for downloading Hugging Face resources with enhanced connectivity…☆13Jan 5, 2025Updated last year
- KDD 2024 AQA competition 2nd place solution☆12Jul 21, 2024Updated last year
- Evolve diffusion models by merging.☆13Jun 15, 2024Updated last year
- 基于Qwen2.5模型、使用DISC-Law-SFT-Pair数据集微调的法律大模型☆10Dec 29, 2024Updated last year
- 人工精调的中文对话数据集和一段chatglm的微调代码☆1,195May 3, 2025Updated 9 months ago
- 雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)☆315Aug 8, 2024Updated last year
- Llama3、Llama3.1 中文后训练版仓库 - 微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档。☆4,151Jan 6, 2026Updated last month
- Copy the MLP of llama3 8 times as 8 experts , created a router with random initialization,add load balancing loss to construct an 8x8b Mo…☆27Jul 1, 2024Updated last year
- Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料☆997Feb 6, 2026Updated last week
- 中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)☆609Apr 30, 2024Updated last year
- Recursive Abstractive Processing for Tree-Organized Retrieval☆10May 30, 2024Updated last year
- 9th solution☆11Oct 11, 2022Updated 3 years ago
- ☆12Jul 24, 2024Updated last year
- ☆12Jan 5, 2023Updated 3 years ago
- 一个基于langchain实现RAG的简单示例☆606Jan 19, 2026Updated 3 weeks ago
- 大模型检索增强生成技术最佳实践。☆88Sep 4, 2024Updated last year
- An easy-to-use framework for modular RAG☆433Updated this week
- Automatic prompt optimization framework for multi-step agent tasks.☆36Nov 12, 2024Updated last year
- We Speech Transcript based on LLM, in 300 lines of code.☆183Jun 20, 2025Updated 7 months ago
- WiNGPT是一个基于GPT的医疗垂直领域大模型,旨在将专业的医学知识、医疗信息、数据融会贯通,为医疗行业提供智能化的医疗问答、诊断支持和医学知识等信息服务,提高诊疗效率和医疗服务质量。☆421Nov 28, 2024Updated last year
- Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 …☆37May 31, 2025Updated 8 months ago