OpenCSGs / llm-finetune
The framework of training large language models,support lora, full parameters fine tune etc, define yaml to start training/fine tune of your defined models, data and methods. Easy define and easy start.
☆25Updated 5 months ago
Alternatives and similar repositories for llm-finetune:
Users that are interested in llm-finetune are comparing it to the libraries listed below
- llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deploy…☆79Updated 9 months ago
- The CSGHub SDK is a powerful Python client specifically designed to interact seamlessly with the CSGHub server. This toolkit is engineere…☆14Updated 2 months ago
- LLM scheduler user interface☆14Updated 9 months ago
- GLM Series Edge Models☆130Updated 3 weeks ago
- ☆107Updated 11 months ago
- Its an open source LLM based on MOE Structure.☆58Updated 8 months ago
- Using Llama-3.1 70b on Groq to create o1-like reasoning chains☆19Updated 5 months ago
- This is an NVIDIA AI Workbench example project that demonstrates an end-to-end model development workflow using Llamafactory.☆50Updated 4 months ago
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆65Updated 8 months ago
- ☆44Updated last year
- The RedStone repository includes code for preparing extensive datasets used in training large language models.☆113Updated last month
- Imitate OpenAI with Local Models☆87Updated 6 months ago
- Qwen-Efficient-Tuning☆43Updated last year
- LM inference server implementation based on *.cpp.☆131Updated this week
- AGI模块库架构图☆75Updated last year
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆54Updated 9 months ago
- SUS-Chat: Instruction tuning done right☆48Updated last year
- ☆25Updated 4 months ago
- Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024☆54Updated 3 months ago
- 🌟 Revolutionize Your Operations with One Sentence Automation: Utilizing large language models and Multi-Agents to generate operational c…☆53Updated last year
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆37Updated 2 months ago
- ☆92Updated 3 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated 10 months ago
- Mixture-of-Experts (MoE) Language Model☆185Updated 6 months ago
- code for piccolo embedding model from SenseTime☆122Updated 9 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆132Updated 3 months ago
- bisheng-unstructured library☆42Updated 3 months ago
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆70Updated last month
- Dingo: A Comprehensive Data Quality Evaluation Tool☆70Updated last week