NonlinearWorld001 / LLMs-learningLinks
小模型LLM的搭建,学习LLM的建模、训练过程 基于DeepSeek-MOE架构的小模型,用于个人学习,从0开始,解释每一条语句
☆11Updated 8 months ago
Alternatives and similar repositories for LLMs-learning
Users that are interested in LLMs-learning are comparing it to the libraries listed below
Sorting:
- tensorrt部署教程☆11Updated 3 months ago
- deploy onnx models with TensorRT and LibTorch☆19Updated 4 years ago
- 为centos服务器配置clash服务☆13Updated 11 months ago
- ☆54Updated 8 months ago
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆19Updated last year
- LLM手撕代码合集☆17Updated 8 months ago
- 一个基于多模态大模型的图表解析器☆41Updated 8 months ago
- springboot demo combined with scala and java☆11Updated 7 years ago
- ☆29Updated last year
- 中文金融大模型测评基准,六大类二十五任务、等级化评价,国内模型获得A级☆10Updated last year
- Happy experimenting with MLLM and LLM models!☆126Updated last year
- Converted the training data of OpenVLA into general form of multimodal training instructions and then used with LLaVA-OneVision☆23Updated 10 months ago
- RAG 系列教程源码仓库☆94Updated 6 months ago
- 使用煤矿历史事故案例,事故处理报告、安全规程规章制度、技术文档、煤矿从业人员入职考试题库等数据,微调internlm2模型实现针对煤矿事故和煤矿安全知识的智能问答。☆54Updated 10 months ago
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆27Updated last year
- 基于MindSpore的TinyRAG实现☆19Updated 11 months ago
- unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆58Updated last year
- 筱可的工程实验仓库!☆99Updated last month
- 让算法工程化更简单☆95Updated 8 months ago
- 《自然语言处理:大模型理论与实践》配套数据和代码☆73Updated last week
- Programming with local large language model.☆24Updated last month
- 从MinerU中提取出来的文本检测识别部分,通过pytorch实现paddleocr的文本检测识别☆19Updated 5 months ago
- 利用大模型LLM对中文文本、图片以及pdf中的非结构化文本内容进行分析,并提取主-谓-宾(SPO)三元组的知识形式,以及将这些关系可视化为知识图谱。The large LLM model is used to analyze the unstructured text co…☆18Updated 7 months ago
- Java library to fulfil the requirement of numpy in java☆22Updated last year
- The objective of this project is to demonstrate how to fine-tune deepseek-r1-distill-llama-8b.☆16Updated 9 months ago
- 在RAG技术中,嵌入向量的生成和匹配是关键环节。本文介绍了一种基于CLIP/BLIP模型的嵌入服务,该服务支持文本和图像的嵌入生成与相似度计算,为多模态信息检索提供了基础能力。☆38Updated 11 months ago
- 最少使用 3090 即可训练自己的比特大脑(miniLLM)🧠(进行中). Train your own BitBrain(A mini LLM) with just an RTX 3090 minimum.☆38Updated 5 months ago
- The deployment of deep learning model inference on the Java platform includes some common CV and NLP tasks.☆15Updated last year
- 基于qwenvl微调一个多模态Xray识别的大模型☆21Updated last year
- transformer深入学习,使用Excel实现☆19Updated 6 years ago