实现了Baichuan-Chat微调,Lora、QLora等各种微调方式,一键运行。
☆70Aug 15, 2023Updated 2 years ago
Alternatives and similar repositories for Baichuan-Chat-Tuning
Users that are interested in Baichuan-Chat-Tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,☆42Aug 16, 2023Updated 2 years ago
- ☆23Jul 17, 2023Updated 2 years ago
- Baichuan-13B 指令微调☆90Jul 14, 2023Updated 2 years ago
- ☆14Mar 28, 2025Updated last year
- The source code of 《 FGN:Fusion Glyph Network for Chinese Named Entity Recognition 》. SOTA Chinese NER method fusing both glyph represne…☆50Mar 22, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Official Repository for "Modeling Hierarchical Structures with Continuous Recursive Neural Networks" (ICML 2021)☆11Aug 18, 2021Updated 4 years ago
- Baichuan2代码的逐行解 析版本,适合小白☆212Sep 20, 2023Updated 2 years ago
- 官方transformers源码解析。AI大模型时代,pytorch、transformer是新操作系统,其他都是运行在其上面的软件。☆16Sep 25, 2023Updated 2 years ago
- 基于qlora对baichuan-7B大模型进行指令微调。☆23Jun 22, 2023Updated 2 years ago
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,652Oct 24, 2024Updated last year
- the newest version of llama3,source code explained line by line using Chinese☆22Apr 19, 2024Updated last year
- pytorch版unilm模型☆26Jun 19, 2021Updated 4 years ago
- A dataset of news headlines for detecting causalities☆14May 9, 2022Updated 3 years ago
- A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …☆47Oct 8, 2025Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of SATA Tree-LSTM (Dynamic Compositionality in Recursive Neural Networks with Structure-aware Tag Representations, AAAI 20…☆10Jun 21, 2022Updated 3 years ago
- 使用多头的思想来进行命名实体识别☆34May 5, 2021Updated 4 years ago
- This repository contains the data and code for the paper "SideControl: Controlled Open-domain Dialogue Generation via Additive Side Netwo…☆12Dec 1, 2021Updated 4 years ago
- [ICML 2022] HousE: Knowledge Graph Embedding with Householder Parameterization☆40Feb 1, 2022Updated 4 years ago
- 🎬 豆瓣电影评论摘要生成☆10Aug 8, 2016Updated 9 years ago
- A series of large language models developed by Baichuan Intelligent Technology☆4,117Nov 8, 2024Updated last year
- ☆16Apr 8, 2025Updated 11 months ago
- ChatGLM2-6B-Explained☆36Jul 28, 2023Updated 2 years ago
- A 13B large language model developed by Baichuan Intelligent Technology☆2,943Sep 6, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Imitate OpenAI with Local Models☆89Aug 27, 2024Updated last year
- Tools for working with the S800 corpus☆12Sep 17, 2020Updated 5 years ago
- Attention Is All You Need (https://arxiv.org/abs/1706.03762)☆10Apr 26, 2018Updated 7 years ago
- KuaiSearch PERKS☆12Nov 16, 2021Updated 4 years ago
- 命名实体识别☆12Dec 21, 2020Updated 5 years ago
- 中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)☆7,161Jul 15, 2025Updated 8 months ago
- ☆11Mar 12, 2024Updated 2 years ago
- Official repository to release the code and datasets in the paper, "Article Reranking by Memory-enhanced Key Sentence Matching for Detect…☆19Dec 15, 2021Updated 4 years ago
- Wasserstein-Fisher-Rao Embedding: Logical Query Embeddings with Local Comparison and Global Transport (Findings-ACL 2023)☆13May 4, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆12Jun 14, 2022Updated 3 years ago
- paper notes about joint extraction of entity and relation☆13Jul 22, 2019Updated 6 years ago
- baichuan-7B 微调 C++ 面试大模型☆14Jul 8, 2023Updated 2 years ago
- Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation, ICML 2024☆22Jun 26, 2024Updated last year
- 🩹Editing large language models within 10 seconds⚡☆1,359Aug 13, 2023Updated 2 years ago
- ☆35Dec 23, 2022Updated 3 years ago
- CCL2022 领域问答库构建测评☆20Oct 31, 2022Updated 3 years ago