yangjianxin1 / unslothLinks

Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory

☆29

Alternatives and similar repositories for unsloth

Users that are interested in unsloth are comparing it to the libraries listed below

Sorting:

CrazyBoyM / llama2-Chinese-chat
首个llama2 13b 中文版模型（Base + 中文对话SFT，实现流畅多轮人机自然语言交互)
☆91Updated 2 years ago
ArtificialZeng / Qwen-Explained
千问14B和7B的逐行解释
☆63Updated 2 years ago
ClosedCharacter / Peach
我们是第一个完全可商用的角色大模型。
☆40Updated last year
StarRing2022 / R1-Nature
最简易的R1结果在小模型上的复现，阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证，对于强推理能力，think思考过程性内容是AGI/ASI的核心。
☆44Updated 9 months ago
AI-Study-Han / Mini-Llama2-Chinese
想要从零开始训练一个中文的mini大语言模型，可以进行基本的对话，模型大小根据手头的机器决定
☆64Updated last year
Gzy1112 / MMRAG-DocQA
☆31Updated 2 months ago
xverse-ai / XVERSE-MoE-A4.2B
XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.
☆39Updated last year
shootime2021 / APUS-xDAN-4.0-moe
Its an open source LLM based on MOE Structure.
☆58Updated last year
lutongyv / Textin_Tester
如需体验textin文档解析，请点击https://cc.co/16YSIy
☆22Updated last year
MetaGLM / OpenLM
本项目致力于为大模型领域的初学者提供全面的知识体系，包括基础和高阶内容，以便开发者能迅速掌握大模型技术栈并全面了解相关知识。
☆62Updated 10 months ago
xverse-ai / XVERSE-7B
XVERSE-7B: A multilingual large language model developed by XVERSE Technology Inc.
☆53Updated last year
t6am3 / law_glm_baseline
☆15Updated last year
soulteary / dify-with-qwen-vl
视频理解：千问视频多模态模型 & Dify
☆65Updated last year
zai-org / GLM-Edge
GLM Series Edge Models
☆153Updated 5 months ago
xverse-ai / XVERSE-65B
XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.
☆141Updated last year
CLUEbenchmark / SuperCLUElyb
SuperCLUE琅琊榜：中文通用大模型匿名对战评价基准
☆145Updated last year
shibing624 / github-hot
Tracking the hot Github repos and update daily 每天自动追踪Github热门项目
☆49Updated last week
SUSTech-IDEA / SUS-Chat
SUS-Chat: Instruction tuning done right
☆49Updated last year
MonolithFoundation / Bumblebee
A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.
☆38Updated last year
ssbuild / aigc_data
share data， prompt data , pretraining data
☆36Updated 2 years ago
kaixindelele / ChatSensitiveWords
利用LLM+敏感词库，来自动判别是否涉及敏感词。
☆134Updated 2 years ago
xverse-ai / XVERSE-V-13B
☆79Updated last year
RapidAI / Open-Llama
The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
☆68Updated 2 years ago
ssbuild / qwen_finetuning
qwen models finetuning
☆104Updated 8 months ago
zzlgreat / smart_agent
☆106Updated 2 years ago
infinigence / InfiniWebSearch
A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.
☆38Updated 11 months ago
CLUEbenchmark / SuperCLUE-RAG
中文原生检索增强生成测评基准
☆123Updated last year
CrazyBoyM / LLM-Chinese
（撰写ing..)本仓库偏教程性质，以「模型中文化」为一个典型的模型训练问题切入场景，指导读者上手学习LLM二次微调训练。
☆36Updated last year
SmartFlowAI / Hand-on-RAG
顾名思义：手搓的RAG
☆130Updated last year
glide-the / InterpretationoDreams
基于langchain设计的智能体任务，包含规划会话场景资源，构建子任务，任务执行器包含（MCTS）
☆30Updated 3 weeks ago