yangjianxin1 / unslothLinks
Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory
☆28Updated last year
Alternatives and similar repositories for unsloth
Users that are interested in unsloth are comparing it to the libraries listed below
Sorting:
- 首个llama2 13b 中文版模型 (Base + 中文对话SFT,实现流畅多轮人机自然语言交互)☆90Updated last year
- zero零训练llm调参☆31Updated last year
- 全球首个StableVicuna中文优化版。☆64Updated 2 years ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Updated last year
- XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.☆39Updated last year
- 想要从零开始训练一个中文的mini大语言模型,可以进行基本的对话,模型大小根据手头的机器决定☆60Updated 11 months ago
- 千问14B和7B的逐行解释☆60Updated last year
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆38Updated 6 months ago
- 本项目致力于为大模型领域的初学者提供全面的知识体系,包括基础和高阶内容,以便开发者能迅速掌握大模型技术栈并全面了解相关知识。☆61Updated 6 months ago
- 我们是第一个完全可商用的角色大模型。☆40Updated 11 months ago
- unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆55Updated 11 months ago
- share data, prompt data , pretraining data☆36Updated last year
- 中文原生检索增强生成测评基准☆119Updated last year
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆67Updated 11 months ago
- Its an open source LLM based on MOE Structure.☆58Updated last year
- ☆105Updated last year
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆139Updated last year
- GLM Series Edge Models☆144Updated last month
- ☆44Updated 4 months ago
- 顾名思义:手搓的RAG☆125Updated last year
- SuperCLUE琅琊榜:中文通用大模型匿名对战评价基准☆145Updated last year
- Agentica: Effortlessly Build Intelligent, Reflective, and Collaborative Multimodal AI Agents! 构建智能的多模态AI Agent。☆179Updated this week
- 视频理解:千问视频多模态模型 & Dify☆61Updated 10 months ago
- ☆27Updated 8 months ago
- Silk Road will be the dataset zoo for Luotuo(骆驼). Luotuo is an open sourced Chinese-LLM project founded by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子…☆39Updated last year
- 百度QA100万数据集☆47Updated last year
- Python3 package for Chinese/English OCR, with paddleocr-v4 onnx model(~14MB). 基于ppocr-v4-onnx模型推理,可实现 CPU 上毫秒级的 OCR 精准预测,通用场景中英文OCR达到开源SO…☆90Updated 5 months ago
- accelerate generating vector by using onnx model☆17Updated last year
- GPT+神器,简单实用的一站式AGI架构,内置本地化,LLM模型,agent,矢量数据库,智能链chain☆48Updated last year
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆45Updated 5 months ago