yhlleo / LLMs_from_scratchLinks
Learning records for building a large language model from scratch
☆58Updated last year
Alternatives and similar repositories for LLMs_from_scratch
Users that are interested in LLMs_from_scratch are comparing it to the libraries listed below
Sorting:
- ☆55Updated last year
- Stream live plots to a matplotlib figure☆80Updated 8 months ago
- support BM25+vecetor☆29Updated 7 months ago
- Cookbook for Crafting Good Code☆57Updated last year
- A transformer-based multimodal model for music.☆29Updated last year
- Customize your arXiv recommendation every day.☆137Updated 3 months ago
- Fetch arxiv data to LLM-friendly text☆128Updated 3 weeks ago
- An AI-driven daily arXiv paper crawler, analyzer, and organizer tool, focusing on AIGC☆71Updated this week
- Mission intent compiler and autonomy supervisor for unmanned systems.☆144Updated 2 weeks ago
- Chrome / Edge extension to turn arXiv papers into Markdown codes in one click.☆88Updated 9 months ago
- Large Language Model in Action☆341Updated 11 months ago
- ☆170Updated last year
- Wanna breeze through some papers?☆65Updated last month
- An interactive web-based demonstration of fundamental tabular Reinforcement Learning (RL) algorithms in a simple grid world environment.☆90Updated 6 months ago
- 论文阅读工具,一键截图+AI翻译,支持数学公式,贴片多窗口管理☆131Updated 4 months ago
- ☆80Updated 8 months ago
- 如何得到最好的结果,Improve-Your-Prompt是一个用于优化prompt的prompt☆37Updated last year
- Countdown Game Distill&RL☆47Updated 3 months ago
- Enable tool-use ability for any LLM model (DeepSeek V3/R1, etc.)☆58Updated 7 months ago
- Static suckless single batch CUDA-only qwen3-0.6B mini inference engine☆535Updated 3 months ago
- ☆176Updated last year
- 基于Roo Cline+DeepSeek的AI开发教程☆75Updated 9 months ago
- 让 AI 设计 AI,让大模型帮助小模型进化,用魔法创造魔法! Empower Artificial Intelligence to sculpt its own kind, where colossal models gracefully usher the petit…☆98Updated 2 years ago
- 《The Maeiee's Book》(简称 TMB)☆182Updated 2 months ago
- [EMNLP 2025 Demo] PresentAgent: Multimodal Agent for Presentation Video Generation☆120Updated last month
- Using APPL to reimplement popular algorithms for Large Language Models (LLMs) and prompts☆45Updated 11 months ago
- 🌟 Dive into the world of machine learning with three no-framework, beginner-friendly models. | 基于项目的机器学习入门理论详解。☆34Updated last year
- ☆50Updated 3 months ago
- 该系列的目的是让读者可以在基础的pytorch上,不依赖任何其他现成的外部库,从零开始理解并实现一个大语言模型的所有组成部分,以及训练微调代码,因此读者仅需python,pytorch和最基础深度学习背景知识即可。☆378Updated 4 months ago
- 北理 ”编译原理与设计“ 课设,一款使用 Java 开发的简易 C 语言编译器(x86 架构),支持绝大部分 C 语言语法。☆111Updated 9 months ago