基于qlora对baichuan-7B大模型进行指令微调。
☆22Jun 22, 2023Updated 2 years ago
Alternatives and similar repositories for baichuan-Qlora-Tuning
Users that are interested in baichuan-Qlora-Tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Baichuan-13B 指令微调☆88Jul 14, 2023Updated 2 years ago
- Finetune baichuan pretrained model with QLora method☆16Jul 13, 2023Updated 2 years ago
- 使用LoRA对ChatGLM进行微调。☆49Jun 26, 2023Updated 2 years ago
- 结合知识图谱做的有关诗词的问答demo☆11Mar 11, 2020Updated 6 years ago
- 使用qlora对中文大语言模型进行微调,包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE☆88Jun 27, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- word2vec demo for #hourofcode using gensim☆22Jan 17, 2015Updated 11 years ago
- An automated pipeline for scraping, processing, and visualizing medical Q&A data to build high-quality datasets. Includes a comprehensive…☆24Dec 24, 2024Updated last year
- HeartLink 是一个心理共情大模型,通过 `Large Language Model` 在构建的大型共情问答数据集指令微调而来,能在对话过程中感知用户的情绪与此时用户的经历,通过丰富的心理学知识,给予共情回复,达到理解安慰、共情支持用户的目的。在回复中附有 emoji…☆44Nov 13, 2024Updated last year
- 该项目专注于识别智能对话场景中的用户文本,自动判断情绪类别并给出相应的准确度。可以广泛应用于社交媒体评论情感分析、智能客服情绪分析等场景,成为情感支持工具,帮助用户 从情绪中解脱。多次Prompt提升后,GPT模型最终识别准确率高于人类Baseline水准。☆11Jul 25, 2023Updated 2 years ago
- 对qwen2.5进行微调以及知识蒸馏☆17Dec 24, 2024Updated last year
- 使用LLaMA-Factory微调多模态大语言模型的示例代码 Demo of Finetuning Multimodal LLM with LLaMA-Factory☆58Sep 8, 2024Updated last year
- 基于internlm-chat-7b的保险知识大模型微调☆20Apr 26, 2024Updated 2 years ago
- ☆10Aug 16, 2022Updated 3 years ago
- Constrained learning using boxes for event-event relation extraction☆12Aug 5, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Code of the paper “A Fin-BERT-based Event Extraction Method for Chinese Financial Domain”☆12May 22, 2024Updated 2 years ago
- 手写一个迷你版本的Tomcat,实现了静态、动态资源的访问。☆10Dec 27, 2020Updated 5 years ago
- TECHS: Temporal Logical Graph Networks for Explainable Extrapolation Reasoning☆10Jan 16, 2024Updated 2 years ago
- A benchmark for assessing the strength of causal relationships between real-world events (EMNLP 2023).☆15Nov 23, 2023Updated 2 years ago
- MoM: Mixtures of Scenario-Aware Document Memories for Retrieval-Augmented Generation Systems☆44Oct 17, 2025Updated 7 months ago
- 使用ACE2005创建以事件和实体为节点的事件知识图谱,用于智能问答☆18Feb 29, 2020Updated 6 years ago
- Convert DeepBind models to Keras☆12Jul 15, 2018Updated 7 years ago
- Python常见工具集合-繁简转换/繁体转换; 词频统计;☆20May 8, 2017Updated 9 years ago
- The project is based on YoloV3 and PyTorch to detect the national flag in the picture.☆11Aug 3, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 用于训练中文DeepSeek R1大模型的Lora脚本☆13Mar 20, 2025Updated last year
- 中文突发事件语料库(Chinese Emergency Corpus)-上海大学-语义智能实验室☆12Apr 21, 2021Updated 5 years ago
- 曙光 h5 SDK,隶属于 https://github.com/eventtracing/dawn 项目☆15Mar 15, 2023Updated 3 years ago
- ☆22Jun 10, 2025Updated 11 months ago
- Therapixel solution of 2017's kaggle challenge on lung cancer detection☆14Feb 9, 2018Updated 8 years ago
- This repo offers advanced tutorials for LLMs, BERT-based models, and multimodal models, covering fine-tuning, quantization, vocabulary ex…☆24May 5, 2025Updated last year
- 适用于sophon bm1684x,基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答☆13Jun 5, 2024Updated last year
- 小红书网页版助手,一款支持固定在电脑桌面上进行小窗模式浏览阅读、多账户同时登录提升用户活跃度、图片 笔记 视频批量自动化下载等功能的软件助手,让用户在小红书笔记阅读上,获得更开阔的视觉体验和交互享受。☆10Jul 22, 2024Updated last year
- 从jieba分词到BERT-wwm,一步步带你进入中文NLP的世界☆15Sep 1, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A Web Spider for Weibo(Chinese Twitter)☆18Aug 12, 2015Updated 10 years ago
- Combination of Yolov8 and Swin-Transformer☆15Sep 27, 2025Updated 8 months ago
- ☆14Jun 20, 2022Updated 3 years ago
- Ziya-LLaMA-13B是IDEA基于LLaMa的130亿参数的大规模预训练模型,具备翻译,编程,文本分类,信息抽取,摘要,文案生成,常识问答和数学计算等能力。目前姜子牙通用大模型已完成大规模预训练、多任务有监督微调和人类反馈学习三阶段的训练过程。本文主要用于Ziya-…☆46Jun 9, 2023Updated 2 years ago
- 多模态情绪识别☆30Aug 11, 2023Updated 2 years ago
- Code for COLING 2022 paper "FactMix: Using a Few Labeled In-domain Examples to Generalize to Cross-domain Named Entity Recognition"☆15Jan 15, 2023Updated 3 years ago
- My pytorch implementation of the model described in the paper **Hierarchical Attention Networks for Document Classification** [paper](htt…☆10Mar 22, 2019Updated 7 years ago