yhlleo / LLMs_from_scratchLinks
Learning records for building a large language model from scratch
☆57Updated 7 months ago
Alternatives and similar repositories for LLMs_from_scratch
Users that are interested in LLMs_from_scratch are comparing it to the libraries listed below
Sorting:
- ☆55Updated 9 months ago
- Stream live plots to a matplotlib figure☆79Updated 4 months ago
- Cookbook for Crafting Good Code☆56Updated last year
- support BM25+vecetor☆29Updated 3 months ago
- Chrome / Edge extension to turn arXiv papers into Markdown codes in one click.☆81Updated 5 months ago
- Customize your arXiv recommendation every day.☆121Updated 5 months ago
- An interactive web-based demonstration of fundamental tabular Reinforcement Learning (RL) algorithms in a simple grid world environment.☆71Updated 2 months ago
- A transformer-based multimodal model for music.☆29Updated last year
- Fetch arxiv data to LLM-friendly text☆124Updated 6 months ago
- An AI agent to control drones from your CLI☆130Updated 3 weeks ago
- 1000个创业Idea,来自ycombinator,一行一个创业思路;1000 entrepreneurial ideas from ycombinator, one entrepreneurial idea per line.☆55Updated 8 months ago
- Large Language Model in Action☆335Updated 7 months ago
- Countdown Game Distill&RL☆46Updated 4 months ago
- An AI-driven daily arXiv paper crawler, analyzer, and organizer tool, focusing on AIGC☆61Updated this week
- ☆166Updated 9 months ago
- ☆77Updated 4 months ago
- 论文阅读工具,一键截图+AI翻译,支持数学公式,贴片多窗口管理☆114Updated this week
- 基于Roo Cline+DeepSeek的AI开发教程☆67Updated 5 months ago
- PresentAgent: Multimodal Agent for Presentation Video Generation☆96Updated 3 weeks ago
- 该系列的目的是让读者可以在基础的pytorch上,不依赖任何其他现成的外部库,从零开始理解并实现一个大语言模型的所有组成部分,以及训练微调代码,因此读者仅需python,pytorch和最基础深度学习背景知识即可。☆361Updated this week
- A Deep Research agent from scratch☆201Updated 3 months ago
- 🧐 Open Deep Research Agent: Automated Knowledge Discovery with TextGAN☆92Updated last month
- 🎧 Pod-Helper: Real-time audio transcription and repair on consumer hardware☆77Updated last year
- Convert Everything to PDF☆161Updated 3 months ago
- ☆175Updated last year
- Python Implementation of MUVERA (Multi-Vector Retrieval via Fixed Dimensional Encodings)☆302Updated last month
- 如何得到最好的结果,Improve-Your-Prompt是一个用于优化prompt的prompt☆40Updated 9 months ago
- Train a Language Model with GRPO to create a schedule from a list of events and priorities☆227Updated 4 months ago
- Curated resources for discovering, reading, and working with arXiv papers☆333Updated 2 months ago
- An abstraction library for building domain-specific intelligent agents based on Large Language Models (LLMs). LLMAgent provides a core ar…☆28Updated 4 months ago