☆394Apr 29, 2025Updated 10 months ago
Alternatives and similar repositories for intro-llm-code
Users that are interested in intro-llm-code are comparing it to the libraries listed below
Sorting:
- website☆463Mar 5, 2025Updated last year
- 最基本最小白的自然语言处理入门读物,基于deepseek-r1,涵盖了传统NLP和现代大模型☆23Jan 16, 2026Updated last month
- The implement of LLMTreeRec☆14Dec 9, 2024Updated last year
- 回声Echo:AI文案助手☆10May 6, 2023Updated 2 years ago
- 本项目从零开始构建并优化了一个千万参数级别的大规模预训练语言模型,涵盖预训练、有监督微调(SFT)和R1推理蒸馏三个阶段。项目采用自定义Transformer架构(包括RMSNorm、分组注意力、多Query机制、SwiGLU激活和RoPE位置编码),实现高效的长文本处理和…☆21Mar 10, 2025Updated last year
- PyTorch Implementation of ViT-TTS (EMNLP'23)☆11Oct 20, 2023Updated 2 years ago
- ☆15Jun 22, 2025Updated 8 months ago
- Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination.☆21Jul 18, 2025Updated 7 months ago
- an implementation of parallel skills like amp, ddp, pp, tp for learning purposes☆14Nov 18, 2023Updated 2 years ago
- 可以成功Lora微调的Qwen-VL模型☆16Oct 27, 2023Updated 2 years ago
- Bootstrapping loss function implementation in pytorch☆36Dec 3, 2020Updated 5 years ago
- ☆75Mar 7, 2024Updated 2 years ago
- 《动手学大模型Dive into LLMs》系列编程实践教程☆21,930Oct 10, 2025Updated 5 months ago
- 保存有关DDPM直播的资料☆20Apr 7, 2024Updated last year
- 本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)☆23,400Mar 1, 2026Updated last week
- Learnable Item Tokenization for Generative Recommendation (Most Cited Paper at CIKM'24)☆137Jan 1, 2025Updated last year
- ☆30Jul 19, 2024Updated last year
- 《大模型白盒子构建指南》:一个全手搓的Tiny-Universe☆4,563Feb 12, 2026Updated last month
- TinyRAG☆435Jun 28, 2025Updated 8 months ago
- 每个人都能看懂的大模型知识分享,LLMs春/秋招大模型面试前必看,让你和面试官侃侃而谈☆5,706Mar 2, 2026Updated last week
- 中文翻译的 Hands-On-Large-Language-Models (hands-on-llms),动手学习大模型☆2,241Oct 19, 2025Updated 4 months ago
- Teacher Ma's resource sharing group☆26Nov 18, 2025Updated 3 months ago
- MetaInv-Net: Meta Inversion Network for Sparse View CT Image Reconstruction☆26Jan 10, 2022Updated 4 years ago
- 通过动画学强化学习笔记☆65Feb 17, 2025Updated last year
- Keras implementation of Training Deep Neural Networks on Noisy Labels with Bootstrapping, Reed et al. 2015☆22Jan 28, 2021Updated 5 years ago
- 🏆🏆 「大模型」All in one & All from scratch. 🌍🌍 收集、清洗数据,训练Tokenizer,预训练、SFT、GRPO!☆53Aug 12, 2025Updated 7 months ago
- [ICLR 2025] Official implementation of DICL (Disentangled In-Context Learning), featured in the paper "Zero-shot Model-based Reinforcemen…☆26Feb 14, 2025Updated last year
- Code for EMNLP 2020 paper `Connecting the Dots: Event Graph Schema Induction with Path Language Modeling`☆23Nov 16, 2020Updated 5 years ago
- LLMs-from-scratch项目中文翻译☆2,379Oct 15, 2025Updated 4 months ago
- 《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣☆4,350Sep 2, 2025Updated 6 months ago
- ☆24Sep 24, 2024Updated last year
- 筱可的工程实验仓库!☆109Oct 31, 2025Updated 4 months ago
- 《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程☆28,676Feb 24, 2026Updated 2 weeks ago
- A Minimalistic Auto-Diff Optimization Framework for Teaching and Understanding Pytorch☆25Updated this week
- 使用Spark GraphX基于PageRank算法构建一个仿微博用户好友的分布式推荐系统。☆24Aug 26, 2018Updated 7 years ago
- 主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题☆12,768Apr 30, 2025Updated 10 months ago
- Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library☆50Aug 20, 2025Updated 6 months ago
- 仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理☆4,008Aug 15, 2024Updated last year
- 图深度学习(葡萄书),在线阅读地址: https://datawhalechina.github.io/grape-book☆276Apr 21, 2024Updated last year