LLaMA-Factory使用经验记录
☆44Aug 26, 2024Updated last year
Alternatives and similar repositories for My-LLaMA-Factory
Users that are interested in My-LLaMA-Factory are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于ChatGLM3基座模型和LLAMA-Factory框架进行微调的一个中医问答机器人☆114Jan 3, 2024Updated 2 years ago
- langchain-study☆31Jun 18, 2026Updated last week
- ☆14Apr 10, 2025Updated last year
- 2024CCF国际AIOps挑战赛-赛道二(GLM4):基于检索增强的运维知识问答挑战赛解决方案分享。☆14Jul 5, 2024Updated last year
- LLM inference in C/C++☆12Jun 5, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 基于qwen3的医疗大模型研发全流程 0.分词训练 1.增量预训练 2.微调 3.强化 4.量化 5.蒸馏 6.评估 7.lora模型合并 8.服务 9.部署☆46Jan 3, 2026Updated 5 months ago
- ☆15Feb 25, 2025Updated last year
- 一个微博毒舌AI,疯狂 diss 微博博主☆15Jan 2, 2025Updated last year
- We implement an efficient mechanism for compressing large networks by {\em tensorizing\/} network layers: i.e. mapping layers on to high-…☆11Jul 10, 2018Updated 7 years ago
- Native AI 是一个探索本地生活电商领域的多智能体系统,通过 AI 助手一站式解决用户吃喝玩乐住行等日常生活需求。系统基于大语言模型技术,主要为了探索Multi Agent的应用。☆11Apr 13, 2025Updated last year
- Simple code for the tutorial on Polynomial Nets.☆13Jan 19, 2023Updated 3 years ago
- GraphRAG 中文文档。GraphRAG是一种结构化的、分层的检索增强生成(RAG)方法,而不是使用纯文本片段的语义搜索方法。GraphRAG 过程包括从原始文本中提取出知识图谱,构建社群层级(这种结构通常用来描述个体、群体及它们之间的关系,帮助理解信息如何在社群内部传…☆19Jul 12, 2024Updated last year
- A simple module/way to use Perplexity AI in Python.☆13May 9, 2024Updated 2 years ago
- An AI project to provide `private` chat and RAG service. 一个提供私有化检索增强生成的AI项目☆11Jul 14, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Codebase for Instruction Following without Instruction Tuning☆36Sep 24, 2024Updated last year
- LGEB: Benchmark of Language Generation Evaluation☆16Oct 21, 2022Updated 3 years ago
- Python中文分词,根据词频生成词云图片☆25Nov 18, 2020Updated 5 years ago
- Qwen1.5大模型微调、基于PEFT框架LoRA微调,在数据集HC3-Chinese上实现文本分类。☆12Jun 29, 2024Updated 2 years ago
- TBD☆12Feb 27, 2026Updated 4 months ago
- ubuntu 系统下 GLM-4-Voice 部署经验分享☆18Oct 31, 2024Updated last year
- [AAAI 2025 Oral] Synergistic Multi-Agent Framework with Trajectory Learning for Knowledge-Intensive Tasks☆31Apr 14, 2025Updated last year
- 或许这里有作为同济大学软件学院机器智能的一位学生学业所需的所有东西☆20Aug 5, 2024Updated last year
- 利用大模型LLM对中文文本、图片以及pdf中的非结构化文本内容进行分析,并提取主-谓-宾(SPO)三元组的知识形式,以及将这些关系可视化为知识图谱。The large LLM model is used to analyze the unstructured text co…☆30Apr 16, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- rt_memcpy Cortex-M 汇编加速版☆12Jun 9, 2022Updated 4 years ago
- Python library for the Gemini Exchange API☆17Oct 31, 2019Updated 6 years ago
- Docker&vLLM官方镜像部署DeepSeek模型, 在生产环境中提供类OpenAI接口服务。☆14Jul 17, 2025Updated 11 months ago
- Apply CP, Tucker, TT/TR, HT to compress neural networks. Train from scratch.☆17Nov 26, 2020Updated 5 years ago
- 这个仓库用于存储一些强化学习练手小项目与算法实验。具体来讲,就是不至于单独成一个 repo 的项目,但是又值得拿出来讨论的代码。☆28May 27, 2021Updated 5 years ago
- ☆17Apr 14, 2024Updated 2 years ago
- [TCSS 2024] MAE pre-training models (ViT and ConvNeXt) using AffectNet images for static facial expression recognition (SFER).☆42Jun 3, 2025Updated last year
- LLM powered AI multi agent platform that coordinate global to individual health through scaling each layer of healthcare☆28May 8, 2024Updated 2 years ago
- A collection of practical code generation tasks and tests from open source projects. Complementary to HumanEval by OpenAI.☆24Jan 28, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Website for Learning from "Big Code"☆30Jun 19, 2021Updated 5 years ago
- ModelScope+Transformers+SwanLab实现Qwen-1.5-7b的指令微调任务☆23Jun 9, 2024Updated 2 years ago
- ☆12Oct 7, 2023Updated 2 years ago
- 基于qwenvl微调一个多模态Xray识别的大模型☆22Oct 22, 2024Updated last year
- c implementation of Maximally Stable Extremal Regions (MSER) algorithm (modify from VLFeat)☆15Aug 4, 2019Updated 6 years ago
- 这是同济大学软件学院2024年网络方向数据分析与数据挖掘专选作业和笔记🌸~☆21Jun 20, 2024Updated 2 years ago
- Leader-based Multi-Scale Attention Deep Architecture for Person Re-identification☆12Jan 21, 2021Updated 5 years ago