实现了Baichuan-Chat微调,Lora、QLora等各种微调方式,一键运行。
☆70Aug 15, 2023Updated 2 years ago
Alternatives and similar repositories for Baichuan-Chat-Tuning
Users that are interested in Baichuan-Chat-Tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆23Jul 17, 2023Updated 2 years ago
- Baichuan-13B 指令微调☆90Jul 14, 2023Updated 2 years ago
- The source code of 《 FGN:Fusion Glyph Network for Chinese Named Entity Recognition 》. SOTA Chinese NER method fusing both glyph represne…☆50Mar 22, 2020Updated 6 years ago
- ☆11Jan 3, 2024Updated 2 years ago
- Official Repository for "Modeling Hierarchical Structures with Continuous Recursive Neural Networks" (ICML 2021)☆12Aug 18, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 官方transformers源码解析。AI大模型时代,pytorch、transformer是新操作系统,其他都是运行在其上面的软件。☆16Sep 25, 2023Updated 2 years ago
- 用Paddle复现论文ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information(ACL2021)☆10Nov 15, 2021Updated 4 years ago
- 基于qlora对baichuan-7B大模型进行指令微调。☆23Jun 22, 2023Updated 2 years ago
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,649Oct 24, 2024Updated last year
- ☆11Jun 4, 2021Updated 4 years ago
- the newest version of llama3,source code explained line by line using Chinese☆22Apr 19, 2024Updated 2 years ago
- pytorch版unilm模型☆26Jun 19, 2021Updated 4 years ago
- Create Persona dataset from reddit en movie category comment☆11Aug 6, 2021Updated 4 years ago
- A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …☆47Oct 8, 2025Updated 6 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Implementation of SATA Tree-LSTM (Dynamic Compositionality in Recursive Neural Networks with Structure-aware Tag Representations, AAAI 20…☆10Jun 21, 2022Updated 3 years ago
- 使用多头的思想来进行命名实体识别☆34May 5, 2021Updated 4 years ago
- [ICML 2022] HousE: Knowledge Graph Embedding with Householder Parameterization☆40Feb 1, 2022Updated 4 years ago
- 🎬 豆瓣电影评论摘要生成☆10Aug 8, 2016Updated 9 years ago
- A series of large language models developed by Baichuan Intelligent Technology☆4,113Nov 8, 2024Updated last year
- 零样本学习测评基准,中文版☆59Jun 23, 2021Updated 4 years ago
- ☆16Apr 8, 2025Updated last year
- Transformer related optimization, including BERT, GPT☆17Jul 29, 2023Updated 2 years ago
- This is the source code of our paper PALT in EMNLP2022.☆13Nov 19, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ChatGLM2-6B-Explained☆36Jul 28, 2023Updated 2 years ago
- A 13B large language model developed by Baichuan Intelligent Technology☆2,933Sep 6, 2023Updated 2 years ago
- ☆12Aug 24, 2022Updated 3 years ago
- KuaiSearch PERKS☆12Nov 16, 2021Updated 4 years ago
- CNN对中文商品名称进行分类,基于Tensorflow☆13Mar 22, 2019Updated 7 years ago
- Mind map for the course on Andrew Ng Machine Learning and popular platforms and libs for AI.☆12Dec 1, 2023Updated 2 years ago
- Get a nicely-chunked local copy of the biomedical literature (to use for other projects)!☆15Jun 10, 2024Updated last year
- ChatGPT Telegram bot☆52Mar 5, 2026Updated last month
- 命名实体识别☆12Dec 21, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)☆7,154Jul 15, 2025Updated 9 months ago
- Official repository to release the code and datasets in the paper, "Article Reranking by Memory-enhanced Key Sentence Matching for Detect…☆19Dec 15, 2021Updated 4 years ago
- ☆11Mar 12, 2024Updated 2 years ago
- Wasserstein-Fisher-Rao Embedding: Logical Query Embeddings with Local Comparison and Global Transport (Findings-ACL 2023)☆13May 4, 2023Updated 2 years ago
- ☆12Jun 14, 2022Updated 3 years ago
- A quick and dirty script to call LLaMA.cpp in Python. Supports streaming and interactive mode.☆13Apr 17, 2023Updated 3 years ago
- baichuan-7B 微调 C++ 面试大模型☆14Jul 8, 2023Updated 2 years ago