mark1879 / Baichuan-13B-FinetuningView external linksLinks
Baichuan-13B 指令微调
☆90Jul 14, 2023Updated 2 years ago
Alternatives and similar repositories for Baichuan-13B-Finetuning
Users that are interested in Baichuan-13B-Finetuning are comparing it to the libraries listed below
Sorting:
- Research project on glyph-based Chinese character embedding. Preparing for EMNLP 2019☆10Mar 18, 2019Updated 6 years ago
- A 13B large language model developed by Baichuan Intelligent Technology☆2,951Sep 6, 2023Updated 2 years ago
- InternLM-7B微调, SFT/LoRA, instruction finetune☆13May 17, 2024Updated last year
- ☆12Apr 29, 2024Updated last year
- ☆16Aug 5, 2018Updated 7 years ago
- 使用peft库,对chatGLM-6B/chatGLM2-6B实现4bit的QLoRA高效微调,并做lora model和base model的merge及4bit的量化(quantize)。☆358Aug 22, 2023Updated 2 years ago
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,635Oct 24, 2024Updated last year
- ChatGLM2-6B 全参数微调,支持多轮对话的高效微调。☆402Aug 17, 2023Updated 2 years ago
- Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.☆619Jan 24, 2025Updated last year
- Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调☆3,730Oct 12, 2023Updated 2 years ago
- 纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,☆43Aug 16, 2023Updated 2 years ago
- This is an official PyTorch implementation of "Gesture2Vec: Clustering Gestures using Representation Learning Methods for Co-speech Gestu…☆26Feb 9, 2024Updated 2 years ago
- Oak National Academy's AI Auto Eval tools provide LLM as a judge evaluation on lesson plans and resources☆17Nov 4, 2025Updated 3 months ago
- ☆18Jun 10, 2025Updated 8 months ago
- 使用多头的思想来进行命名实体识别☆34May 5, 2021Updated 4 years ago
- A large-scale 7B pretraining language model developed by BaiChuan-Inc.☆5,686Jul 18, 2024Updated last year
- A series of large language models developed by Baichuan Intelligent Technology☆4,118Nov 8, 2024Updated last year
- Generalized Sentiment Classifier finetuned by KoELECTRA☆11Nov 28, 2024Updated last year
- This project showcases engaging interactions between two AI chatbots.☆10Jan 10, 2024Updated 2 years ago
- chatglm多gpu用deepspeed和☆409Jul 8, 2024Updated last year
- Resources for my <model-viewer> course☆11Jul 25, 2023Updated 2 years ago
- XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.☆644Apr 9, 2024Updated last year
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Sep 14, 2025Updated 5 months ago
- High-level Rust library that binds to Poppler to extract text from a PDF☆11Dec 16, 2020Updated 5 years ago
- 天池比赛☆10Jul 4, 2021Updated 4 years ago
- The official implementation of our work SQLFixAgent: Towards Semantic-Accurate Text-to-SQL Parsing via Consistency-Enhanced Multi-Agent C…☆23May 2, 2025Updated 9 months ago
- ☆10Sep 2, 2024Updated last year
- reproduce SimCSE in jupyter-notebook☆10Nov 28, 2021Updated 4 years ago
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆23Sep 21, 2025Updated 4 months ago
- A simple algorithm to find ordered key-value pairs from paddleOCR recognition outputs☆10Mar 1, 2021Updated 4 years ago
- ☆10Jul 20, 2020Updated 5 years ago
- A Toolkit for Fine-Tuning Large Language Models with LoRA and DeepSpeed☆11Apr 14, 2023Updated 2 years ago
- ☆10Apr 7, 2023Updated 2 years ago
- Ice segment plugin for Bluge☆12Jul 4, 2022Updated 3 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- python人脸识别和情绪识别☆15Oct 2, 2023Updated 2 years ago
- vertex and uv texture map☆12Mar 13, 2023Updated 2 years ago
- (ICCV'25) TF-TI2I: Training-Free Text-and-Image-to-Image Generation via Multi-Modal Implicit-Context Learning in Text-to-Image Models (Au…☆14Aug 22, 2025Updated 5 months ago
- ☆11Jun 4, 2021Updated 4 years ago