实现了Baichuan-Chat微调,Lora、QLora等各种微调方式,一键运行。
☆70Aug 15, 2023Updated 2 years ago
Alternatives and similar repositories for Baichuan-Chat-Tuning
Users that are interested in Baichuan-Chat-Tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆23Jul 17, 2023Updated 2 years ago
- Baichuan-13B 指令微调☆87Jul 14, 2023Updated 2 years ago
- 基于baichuan-7b的开源多模态大语言模型☆72Dec 7, 2023Updated 2 years ago
- The source code of 《 FGN:Fusion Glyph Network for Chinese Named Entity Recognition 》. SOTA Chinese NER method fusing both glyph represne…☆50Mar 22, 2020Updated 6 years ago
- ☆11Jan 3, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official Repository for "Modeling Hierarchical Structures with Continuous Recursive Neural Networks" (ICML 2021)☆12Aug 18, 2021Updated 4 years ago
- Baichuan2代码的逐行解析版本,适合小白☆211Sep 20, 2023Updated 2 years ago
- 官方transformers源码解析。AI大模型时代,pytorch、transformer是新操作系统,其他都是运行在其上面的软件。☆16Sep 25, 2023Updated 2 years ago
- 用Paddle复现论文ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information(ACL2021)☆10Nov 15, 2021Updated 4 years ago
- 基于qlora对baichuan-7B大模型进行指令微调。☆22Jun 22, 2023Updated 2 years ago
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,641Oct 24, 2024Updated last year
- the newest version of llama3,source code explained line by line using Chinese☆22Apr 19, 2024Updated 2 years ago
- 千问14B和7B的逐行解释☆64Sep 26, 2023Updated 2 years ago
- A dataset of news headlines for detecting causalities☆14May 9, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …☆47Oct 8, 2025Updated 8 months ago
- Implementation of SATA Tree-LSTM (Dynamic Compositionality in Recursive Neural Networks with Structure-aware Tag Representations, AAAI 20…☆10Jun 21, 2022Updated 3 years ago
- 使用多头的思想来进行命名实体识别☆34May 5, 2021Updated 5 years ago
- [ICML 2022] HousE: Knowledge Graph Embedding with Householder Parameterization☆40Feb 1, 2022Updated 4 years ago
- A series of large language models developed by Baichuan Intelligent Technology☆4,098Nov 8, 2024Updated last year
- 零样本学习测评基准,中文版☆59Jun 23, 2021Updated 4 years ago
- Transformer related optimization, including BERT, GPT☆17Jul 29, 2023Updated 2 years ago
- ☆17Apr 8, 2025Updated last year
- This is the source code of our paper PALT in EMNLP2022.☆13Nov 19, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ChatGLM2-6B-Explained☆36Jul 28, 2023Updated 2 years ago
- CIKM 2021 Full Paper: FedMatch: Federated Learning Over Heterogeneous Question Answering Data☆12Dec 14, 2021Updated 4 years ago
- ☆12Aug 24, 2022Updated 3 years ago
- Tools for working with the S800 corpus☆12Sep 17, 2020Updated 5 years ago
- Imitate OpenAI with Local Models☆91Aug 27, 2024Updated last year
- Attention Is All You Need (https://arxiv.org/abs/1706.03762)☆10Apr 26, 2018Updated 8 years ago
- KuaiSearch PERKS☆12Nov 16, 2021Updated 4 years ago
- ☆14Mar 10, 2020Updated 6 years ago
- Mind map for the course on Andrew Ng Machine Learning and popular platforms and libs for AI.☆12Dec 1, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Get a nicely-chunked local copy of the biomedical literature (to use for other projects)!☆15Jun 10, 2024Updated 2 years ago
- 命名实体识别☆12Dec 21, 2020Updated 5 years ago
- Official repository to release the code and datasets in the paper, "Article Reranking by Memory-enhanced Key Sentence Matching for Detect…☆19Dec 15, 2021Updated 4 years ago
- Implementing DBSCAN using numpy and pytorch☆11Aug 21, 2020Updated 5 years ago
- ☆11Mar 12, 2024Updated 2 years ago
- Wasserstein-Fisher-Rao Embedding: Logical Query Embeddings with Local Comparison and Global Transport (Findings-ACL 2023)☆13May 4, 2023Updated 3 years ago
- ☆19Aug 9, 2024Updated last year