实现了Baichuan-Chat微调,Lora、QLora等各种微调方式,一键运行。
☆70Aug 15, 2023Updated 2 years ago
Alternatives and similar repositories for Baichuan-Chat-Tuning
Users that are interested in Baichuan-Chat-Tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,☆42Aug 16, 2023Updated 2 years ago
- ☆23Jul 17, 2023Updated 2 years ago
- Baichuan-13B 指令微调☆90Jul 14, 2023Updated 2 years ago
- 基于baichuan-7b的开源多模态大语言模型☆72Dec 7, 2023Updated 2 years ago
- OpenAPI specifications => MCP (Model Context Protocol) tools☆19Dec 9, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆11Jan 3, 2024Updated 2 years ago
- Official Repository for "Modeling Hierarchical Structures with Continuous Recursive Neural Networks" (ICML 2021)☆12Aug 18, 2021Updated 4 years ago
- Baichuan2代码的逐行解析版本,适合小白☆211Sep 20, 2023Updated 2 years ago
- 官方transformers源码解析。AI大模型时代,pytorch、transformer是新操作系统,其他都是运行在其上面的软件。☆16Sep 25, 2023Updated 2 years ago
- 用Paddle复现论文ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information(ACL2021)☆10Nov 15, 2021Updated 4 years ago
- 基于qlora对baichuan-7B大模型进行指令微调。☆23Jun 22, 2023Updated 2 years ago
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,645Oct 24, 2024Updated last year
- ☆11Jun 4, 2021Updated 4 years ago
- the newest version of llama3,source code explained line by line using Chinese☆22Apr 19, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- pytorch版unilm模型☆26Jun 19, 2021Updated 4 years ago
- A dataset of news headlines for detecting causalities☆14May 9, 2022Updated 4 years ago
- Create Persona dataset from reddit en movie category comment☆11Aug 6, 2021Updated 4 years ago
- A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …☆47Oct 8, 2025Updated 7 months ago
- 使用多头的思想来进行命名实体识别☆34May 5, 2021Updated 5 years ago
- [ICML 2022] HousE: Knowledge Graph Embedding with Householder Parameterization☆40Feb 1, 2022Updated 4 years ago
- A series of large language models developed by Baichuan Intelligent Technology☆4,108Nov 8, 2024Updated last year
- 零样本学习测评基准,中文版☆59Jun 23, 2021Updated 4 years ago
- ☆16Apr 8, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Transformer related optimization, including BERT, GPT☆17Jul 29, 2023Updated 2 years ago
- ChatGLM2-6B-Explained☆36Jul 28, 2023Updated 2 years ago
- Imitate OpenAI with Local Models☆90Aug 27, 2024Updated last year
- Tools for working with the S800 corpus☆12Sep 17, 2020Updated 5 years ago
- Attention Is All You Need (https://arxiv.org/abs/1706.03762)☆10Apr 26, 2018Updated 8 years ago
- ☆14Mar 10, 2020Updated 6 years ago
- Text Diffusion Model with Encoder-Decoder Transformers for Sequence-to-Sequence Generation [NAACL 2024]☆99Aug 17, 2023Updated 2 years ago
- CNN对中文商品名称进行分类,基于Tensorflow☆13Mar 22, 2019Updated 7 years ago
- 针对常见的BAT公司中的大数据面试和笔试问题,列出解决思路,并使用python来实现☆11Aug 17, 2015Updated 10 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Mind map for the course on Andrew Ng Machine Learning and popular platforms and libs for AI.☆12Dec 1, 2023Updated 2 years ago
- 中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)☆7,143Apr 19, 2026Updated 3 weeks ago
- Official repository to release the code and datasets in the paper, "Article Reranking by Memory-enhanced Key Sentence Matching for Detect…☆19Dec 15, 2021Updated 4 years ago
- ☆11Mar 12, 2024Updated 2 years ago
- Wasserstein-Fisher-Rao Embedding: Logical Query Embeddings with Local Comparison and Global Transport (Findings-ACL 2023)☆13May 4, 2023Updated 3 years ago
- ☆12Jun 14, 2022Updated 3 years ago
- A quick and dirty script to call LLaMA.cpp in Python. Supports streaming and interactive mode.☆13Apr 17, 2023Updated 3 years ago