尝试自己从头写一个LLM,参考llama和nanogpt
☆68Apr 27, 2024Updated 2 years ago
Alternatives and similar repositories for my_llm
Users that are interested in my_llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Retriever-0.1B☆96Jun 6, 2024Updated last year
- LLM对话生成工具☆33Apr 12, 2024Updated 2 years ago
- 2025.01:从零到一实现了一个多模态大模型,并命名为Reyes(睿视),R:睿,eyes:眼。Reyes的参数量为8B,视觉编码器使用的是InternViT-300M-448px-V2_5,语言模型侧使用的是Qwen2.5-7B-Instruct,Reyes也通过一个两…☆33Feb 10, 2026Updated 2 months ago
- ☆29Jan 5, 2025Updated last year
- Automatically exported from code.google.com/p/tmitter☆10Sep 17, 2015Updated 10 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆15Jun 6, 2023Updated 2 years ago
- GraphRAG 的中文优化版本☆23Dec 19, 2025Updated 4 months ago
- NUST-API集合☆10Oct 29, 2018Updated 7 years ago
- 一个炫酷的脚本语言 An Amazing Script Language☆20Jul 12, 2024Updated last year
- https://bbuf.github.io/gpu-glossary-zh/☆27Nov 7, 2025Updated 5 months ago
- 想要从零开始训练一个中文的mini大语言模型,可以进行基本的对话,模型大小根据手头的机器决定☆65Aug 14, 2024Updated last year
- 《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合中国宝宝的部署教程☆11Jun 8, 2024Updated last year
- LLM101n: Let's build a Storyteller 中文版☆138Aug 15, 2024Updated last year
- ☆19Aug 9, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆16Jul 29, 2025Updated 9 months ago
- hyperscan using dpdk☆13Jul 15, 2018Updated 7 years ago
- Papers of "A Survey on Multimodal LLMs from the Perspective of Input-Output Space Extension"☆17Feb 4, 2026Updated 3 months ago
- Split a string into a char array by a given delimiter☆14Apr 1, 2016Updated 10 years ago
- A LLM Paper note list.☆21Apr 6, 2024Updated 2 years ago
- network time protocol client☆18Dec 15, 2010Updated 15 years ago
- ☆64Updated this week
- ☆14Jan 3, 2023Updated 3 years ago
- Building an RPG with Unity 2018, published by Packt☆17Jan 18, 2023Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- 本仓库是关于大模型面试中常见面试试题和面试经验的整理。这里收集了各类与大模型相关的面试题目,并提供详细的解答和分析。本仓库由上海交大交影社区维护☆125Aug 23, 2024Updated last year
- 使用 Bert 进行文本分类☆20Dec 7, 2021Updated 4 years ago
- 用Numpy复现可训练的LLaMa3☆34Jul 5, 2024Updated last year
- ☆11Feb 9, 2022Updated 4 years ago
- An exact algorithm for the maximum clique problem (MCP) which improves over state-of-the-art approaches in some cases by orders of magnit…☆15Nov 15, 2025Updated 5 months ago
- Vue+Element+Express全栈开发☆16Jan 4, 2023Updated 3 years ago
- 埃及象形文字聖書體MdC轉寫☆12May 3, 2020Updated 6 years ago
- Achieve your exclusive DeepResearch.☆26Apr 25, 2025Updated last year
- Creates a Visio flowchart using Python☆20Sep 11, 2015Updated 10 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 通过实验对比LLM推理中Prefill和Decoding阶段的吞吐量差异,揭示性能瓶颈,解释PD分离优化技术的原理。包含CUDA和Apple MPS (M系列芯片) 的测试脚本。☆21May 22, 2025Updated 11 months ago
- 一些有关长时间序列预测、CV和机器学习的论文中译版☆45Jan 16, 2025Updated last year
- iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models (ICLR2026)☆22Mar 29, 2026Updated last month
- Executable UML tools (xml schema, java model compiler, java + javascript model viewer) based on miUML metamodels☆20Sep 18, 2024Updated last year
- ☆16Jun 15, 2017Updated 8 years ago
- StrongSORT with Selective Feature Extraction Mechanism☆15Sep 25, 2024Updated last year
- Proofs written in Lean4 for the core katydid validation algorithm☆18Sep 17, 2025Updated 7 months ago