OpenCSGs / llm-finetuneLinks

The framework of training large language models，support lora, full parameters fine tune etc, define yaml to start training/fine tune of your defined models, data and methods. Easy define and easy start.

☆28

Alternatives and similar repositories for llm-finetune

Users that are interested in llm-finetune are comparing it to the libraries listed below

Sorting:

OpenCSGs / llm-inference
llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deploy…
☆85Updated last year
arcstep / illufly
✨🦋 illufly - 【幻蝶】基于记忆蒸馏、资料检索的自我进化智能体
☆69Updated last month
ziwang-com / AGI-MAP
AGI模块库架构图
☆76Updated last year
OpenBMB / MobileCPM
A Toolkit for Running On-device Large Language Models (LLMs) in APP
☆77Updated last year
thunlp / Delta-CoMe
Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024
☆57Updated 8 months ago
OpenCSGs / csghub-sdk
The CSGHub SDK is a powerful Python client specifically designed to interact seamlessly with the CSGHub server. This toolkit is engineere…
☆17Updated this week
OpenCSGs / llm-scheduler-ui
LLM scheduler user interface
☆16Updated last year
IEIT-Yuan / Yuan2.0-M32
Mixture-of-Experts (MoE) Language Model
☆189Updated 10 months ago
xverse-ai / XVERSE-MoE-A4.2B
XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.
☆40Updated last year
zzlgreat / smart_agent
☆105Updated last year
UGAIForge / AgileGen
AgileGen: Empowering Agile-Based Generative Software Development through Human-AI Teamwork (accepted by ACM TOSEM)
☆24Updated 8 months ago
Ai-trainee / o1-flow
Using Llama-3.1 70b on Groq to create o1-like reasoning chains
☆18Updated 10 months ago
xuyuan23 / operateGPT
🌟 Revolutionize Your Operations with One Sentence Automation: Utilizing large language models and Multi-Agents to generate operational c…
☆54Updated last year
aliyun / qwen-dianjin
Qwen DianJin: LLMs for the Financial Industry by Alibaba Cloud
☆119Updated 2 months ago
tpoisonooo / ROGRAG
[ACL2025 demo track] ROGRAG: A Robustly Optimized GraphRAG Framework
☆167Updated last month
Alibaba-NLP / MaskSearch
Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"
☆136Updated 2 months ago
zai-org / GLM-Edge
GLM Series Edge Models
☆144Updated last month
llm-factory / imitater
Imitate OpenAI with Local Models
☆87Updated 11 months ago
ClosedCharacter / Peach
我们是第一个完全可商用的角色大模型。
☆40Updated 11 months ago
dataelement / bisheng-unstructured
bisheng-unstructured library
☆54Updated 2 months ago
icip-cas / DeepSolution
DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking
☆48Updated 4 months ago
xverse-ai / XVERSE-65B
XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.
☆139Updated last year
ArtificialZeng / transformers-Explained
官方transformers源码解析。AI大模型时代，pytorch、transformer是新操作系统，其他都是运行在其上面的软件。
☆17Updated last year
NVIDIA / workbench-llamafactory
This is an NVIDIA AI Workbench example project that demonstrates an end-to-end model development workflow using Llamafactory.
☆62Updated 9 months ago
ArtificialZeng / llama3_explained
the newest version of llama3，source code explained line by line using Chinese
☆22Updated last year
CLUEbenchmark / SuperCLUE-RAG
中文原生检索增强生成测评基准
☆120Updated last year
ictnlp / FlexRAG
FlexRAG: A RAG Framework for Information Retrieval and Generation.
☆203Updated last month
01-ai / Descartes
☆111Updated last year
codefuse-ai / FasterTransformer4CodeFuse
High-performance LLM inference based on our optimized version of FastTransfomer
☆123Updated last year
xorbitsai / xllamacpp
xllamacpp - a Python wrapper of llama.cpp
☆48Updated last week