实现了Baichuan-Chat微调,Lora、QLora等各种微调方式,一键运行。
☆70Aug 15, 2023Updated 2 years ago
Alternatives and similar repositories for Baichuan-Chat-Tuning
Users that are interested in Baichuan-Chat-Tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆23Jul 17, 2023Updated 2 years ago
- 基于baichuan-7b的开源多模态大语言模型☆72Dec 7, 2023Updated 2 years ago
- ☆14Mar 28, 2025Updated last year
- The source code of 《 FGN:Fusion Glyph Network for Chinese Named Entity Recognition 》. SOTA Chinese NER method fusing both glyph represne…☆50Mar 22, 2020Updated 6 years ago
- 官方transformers源码解析。AI大模型时代,pytorch、transformer是新操作系统,其他都是运行在其上面的软件。☆16Sep 25, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 基于qlora对baichuan-7B大模型进行指令微调。☆22Jun 22, 2023Updated 2 years ago
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,643Oct 24, 2024Updated last year
- pytorch版unilm模型☆26Jun 19, 2021Updated 4 years ago
- 千问14B和7B的逐行解释☆64Sep 26, 2023Updated 2 years ago
- Create Persona dataset from reddit en movie category comment☆11Aug 6, 2021Updated 4 years ago
- A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …☆47Oct 8, 2025Updated 7 months ago
- 使用多头的思想来进行命名实体识别☆34May 5, 2021Updated 5 years ago
- This repository contains the data and code for the paper "SideControl: Controlled Open-domain Dialogue Generation via Additive Side Netwo…☆12Dec 1, 2021Updated 4 years ago
- [ICML 2022] HousE: Knowledge Graph Embedding with Householder Parameterization☆40Feb 1, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 🎬 豆瓣电影评论摘要生成☆10Aug 8, 2016Updated 9 years ago
- A series of large language models developed by Baichuan Intelligent Technology☆4,102Nov 8, 2024Updated last year
- 零样本学习测评基准,中文版☆59Jun 23, 2021Updated 4 years ago
- ☆17Apr 8, 2025Updated last year
- ChatGLM2-6B-Explained☆36Jul 28, 2023Updated 2 years ago
- This is the source code of our paper PALT in EMNLP2022.☆13Nov 19, 2022Updated 3 years ago
- A 13B large language model developed by Baichuan Intelligent Technology☆2,931Sep 6, 2023Updated 2 years ago
- CIKM 2021 Full Paper: FedMatch: Federated Learning Over Heterogeneous Question Answering Data☆12Dec 14, 2021Updated 4 years ago
- ☆12Aug 24, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Tools for working with the S800 corpus☆12Sep 17, 2020Updated 5 years ago
- Imitate OpenAI with Local Models☆91Aug 27, 2024Updated last year
- ☆14Mar 10, 2020Updated 6 years ago
- Text Diffusion Model with Encoder-Decoder Transformers for Sequence-to-Sequence Generation [NAACL 2024]☆100Aug 17, 2023Updated 2 years ago
- CNN对 中文商品名称进行分类,基于Tensorflow☆13Mar 22, 2019Updated 7 years ago
- Get a nicely-chunked local copy of the biomedical literature (to use for other projects)!☆15Jun 10, 2024Updated last year
- 中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)☆7,139Apr 19, 2026Updated last month
- Official repository to release the code and datasets in the paper, "Article Reranking by Memory-enhanced Key Sentence Matching for Detect…☆19Dec 15, 2021Updated 4 years ago
- ☆11Mar 12, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12Jun 14, 2022Updated 3 years ago
- A quick and dirty script to call LLaMA.cpp in Python. Supports streaming and interactive mode.☆13Apr 17, 2023Updated 3 years ago
- ☆19Aug 9, 2024Updated last year
- paper notes about joint extraction of entity and relation☆13Jul 22, 2019Updated 6 years ago
- Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation, ICML 2024☆23Jun 26, 2024Updated last year
- 🩹Editing large language models within 10 seconds⚡☆1,364Aug 13, 2023Updated 2 years ago
- ☆35Dec 23, 2022Updated 3 years ago