yhlleo / LLMs_from_scratchLinks
Learning records for building a large language model from scratch
☆57Updated 8 months ago
Alternatives and similar repositories for LLMs_from_scratch
Users that are interested in LLMs_from_scratch are comparing it to the libraries listed below
Sorting:
- ☆55Updated 10 months ago
- Cookbook for Crafting Good Code☆56Updated last year
- Stream live plots to a matplotlib figure☆79Updated 5 months ago
- A transformer-based multimodal model for music.☆29Updated last year
- support BM25+vecetor☆29Updated 3 months ago
- Customize your arXiv recommendation every day.☆123Updated 5 months ago
- Chrome / Edge extension to turn arXiv papers into Markdown codes in one click.☆82Updated 5 months ago
- An interactive web-based demonstration of fundamental tabular Reinforcement Learning (RL) algorithms in a simple grid world environment.☆76Updated 3 months ago
- 1000个创业Idea,来自ycombinator,一行一个创业思路;1000 entrepreneurial ideas from ycombinator, one entrepreneurial idea per line.☆55Updated 9 months ago
- Fetch arxiv data to LLM-friendly text☆125Updated 6 months ago
- Static suckless single batch CUDA-only qwen3-0.6B mini inference engine☆433Updated last week
- An AI-driven daily arXiv paper crawler, analyzer, and organizer tool, focusing on AIGC☆62Updated this week
- An AI agent to control drones from your CLI☆130Updated last month
- 基于Roo Cline+DeepSeek的AI开发教程☆69Updated 6 months ago
- ☆168Updated 10 months ago
- Large Language Model in Action☆335Updated 7 months ago
- [EMNLP 2025 Demo] PresentAgent: Multimodal Agent for Presentation Video Generation☆99Updated last week
- Countdown Game Distill&RL☆47Updated 2 weeks ago
- 该系列的目的是让读者可以在基础的pytorch上,不依赖任何其他现成的外部库,从零开始理解并实现一个大语言模型的所有组成部分,以及训练微调代码,因此读者仅需python,pytorch和最基础深度学习背景知识即可。☆365Updated 3 weeks ago
- 如何得到最好的结果,Improve-Your-Prompt是一个用于优化prompt的prompt☆39Updated 9 months ago
- AgentForge is a powerful and flexible signal-driven workflow framework designed for building intelligent, dynamic, and adaptive systems.☆19Updated 5 months ago
- 论文阅读工具,一键截图+AI翻译,支持数学公式,贴片多窗口管理☆120Updated 3 weeks ago
- DeepSearch Code-Actions Agent (DSCA). Build 🙌 with 🤗 smolagents☆117Updated last month
- ☆175Updated last year
- Train a Language Model with GRPO to create a schedule from a list of events and priorities☆231Updated 4 months ago
- LLM-Powered Semi-Structured Table Question Answering☆224Updated this week
- Using APPL to reimplement popular algorithms for Large Language Models (LLMs) and prompts☆45Updated 8 months ago
- Convert Everything to PDF☆164Updated 4 months ago
- 🧐 Open Deep Research Agent: Automated Knowledge Discovery with TextGAN☆94Updated last month
- ☆54Updated 6 months ago