wp931120 / baichuan_sft_loraView external linksLinks
baichuan LLM surpervised finetune by lora
☆64Jun 28, 2023Updated 2 years ago
Alternatives and similar repositories for baichuan_sft_lora
Users that are interested in baichuan_sft_lora are comparing it to the libraries listed below
Sorting:
- Baichuan-13B 指令微调☆90Jul 14, 2023Updated 2 years ago
- ☆15Mar 12, 2024Updated last year
- 文本数据增强☆15Apr 10, 2020Updated 5 years ago
- ☆27Nov 25, 2025Updated 2 months ago
- chatglm3-6b, 微调/LORA/推理/单机多卡/deepspeed/支持多轮对话☆17Nov 30, 2023Updated 2 years ago
- Baichuan2代码的逐行解析版本,适合小白☆212Sep 20, 2023Updated 2 years ago
- codes and pre-trained models of paper "Segatron: Segment-aware Transformer for Language Modeling and Understanding"☆18Oct 25, 2022Updated 3 years ago
- Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.☆619Jan 24, 2025Updated last year
- Deepseek-r1复现科普与资源汇总☆22Mar 5, 2025Updated 11 months ago
- [ACL 2024] Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"☆25Mar 28, 2024Updated last year
- 🛍 A full E-commerce app with nice UI consists of on-boarding, login, sign-up, home, product details, cart and user profile.☆10Sep 8, 2024Updated last year
- 基于ChatGLM2-6B进行微调,包括全参数、参数有效性、量化感知训练等,可实现指令微调、多轮对话微调等。☆26Jul 29, 2023Updated 2 years ago
- 🚀 Automatically convert unstructured data into a high-quality 'textbook' format, optimized for fine-tuning Large Language Models (LLMs)☆25Oct 15, 2023Updated 2 years ago
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆110Jul 19, 2023Updated 2 years ago
- A large-scale 7B pretraining language model developed by BaiChuan-Inc.☆5,686Jul 18, 2024Updated last year
- Deepspeed、LLM、Medical_Dialogue、医疗大模型、预训练、微调☆290Jun 7, 2024Updated last year
- [ICLR 2026] A novel cross-modal decoupling and alignment framework for multimodal representation learning.☆44Feb 5, 2026Updated last week
- ☆34Sep 14, 2024Updated last year
- MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning [NeurIPS 2025 Poster]☆22Dec 10, 2025Updated 2 months ago
- A series of large language models developed by Baichuan Intelligent Technology☆4,118Nov 8, 2024Updated last year
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆12Apr 15, 2025Updated 10 months ago
- Spam SMS Detector, Flutter Application☆11Oct 14, 2021Updated 4 years ago
- 基于WebSocket协议实现实时弹幕信息爬取与信息通信。通过MaxKB容器训练直播互动模型,具备智能互动能力,通过微调预训练的语言模型来适应特定的直播场景需求,提升数字人的交互体验。基于TTS和Wav2lip开发语音克隆和唇形同步算法,通过预训练数字人模型的方式压缩生成时…☆13Oct 7, 2024Updated last year
- A django-yolov5 starter webapp. Based on yolov5-flask example.☆11Mar 6, 2022Updated 3 years ago
- A full-featured AI chatbot built with Next.js 15 and iFlow CLI SDK, providing Claude Code-like interactive experience with file operation…☆23Dec 7, 2025Updated 2 months ago
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 4 months ago
- ☆20Sep 11, 2025Updated 5 months ago
- LLM-powered chatbot app to enhance accessibility to knowledge contained within PDFs☆14May 6, 2025Updated 9 months ago
- ☆11Aug 20, 2025Updated 5 months ago
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆24Jul 21, 2025Updated 6 months ago
- finetune llama2 with traditional chinese dataset☆39Aug 8, 2023Updated 2 years ago
- Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.☆1,018Apr 27, 2024Updated last year
- 通用机器人控制器上位机☆11Feb 10, 2021Updated 5 years ago
- Towards an implementation of hierarchical temporal memory and the cortical learning algorithm by Jeff Hawkins and Dileep George of Nument…☆12Mar 15, 2017Updated 8 years ago
- 带动画插值的 UI 抽象工具集 模拟器☆10Jun 21, 2024Updated last year
- ESP32 Balancing Cube☆17Aug 12, 2025Updated 6 months ago
- ☆10Apr 17, 2024Updated last year
- A very light C/C++ implementation of Obyte (formerly Byteball) for Arduino☆13Jul 21, 2020Updated 5 years ago
- T22_034_han_shi_hao_CRDDC_2022_SourceCode☆11Dec 29, 2023Updated 2 years ago