尝试自己从头写一个LLM,参考llama和nanogpt
☆69Apr 27, 2024Updated 2 years ago
Alternatives and similar repositories for my_llm
Users that are interested in my_llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Retriever-0.1B☆96Jun 6, 2024Updated 2 years ago
- 2025.01:从零到一实现了一个多模态大模型,并命名为Reyes(睿视),R:睿,eyes:眼。Reyes的参数量为8B,视觉编码器使用的是InternViT-300M-448px-V2_5,语言模型侧使用的是Qwen2.5-7B-Instruct,Reyes也通过一个两…☆34Feb 10, 2026Updated 4 months ago
- ☆29Jan 5, 2025Updated last year
- ☆20May 28, 2025Updated last year
- Automatically exported from code.google.com/p/tmitter☆10Sep 17, 2015Updated 10 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 120部中文网络小说对话语料库☆18Feb 14, 2017Updated 9 years ago
- GraphRAG 的中文优化版本☆23Dec 19, 2025Updated 5 months ago
- NUST-API集合☆10Oct 29, 2018Updated 7 years ago
- 基于Bart语言模型的指针生成网络,用于中文语法纠错任务☆16Sep 8, 2022Updated 3 years ago
- Nano-BERT is a straightforward, lightweight and comprehensible custom implementation of BERT, inspired by the foundational "Attention is …☆21Oct 19, 2023Updated 2 years ago
- 一个炫酷的脚本语言 An Amazing Script Language☆20Jul 12, 2024Updated last year
- 想要从零开始训练一个中文的mini大语言模型,可以进行基本的对话,模型大小根据手头的机器决定☆65Aug 14, 2024Updated last year
- OpenFlow protocol endpoint written in C++☆10Updated this week
- 《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合中国宝宝的部署教程☆11Jun 8, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆19Aug 9, 2024Updated last year
- LLM101n: Let's build a Storyteller 中文版☆138Aug 15, 2024Updated last year
- ☆16Jul 29, 2025Updated 10 months ago
- 使用langraph构建Agentic-RAG☆23Jul 30, 2025Updated 10 months ago
- [ICRA'24] Influence of Camera-LiDAR Configuration on 3D Object Detection for Autonomous Driving☆21Sep 14, 2024Updated last year
- ChatGPT前端页面模板,AI问答,AI对话(包含上下文),下载即可运行!☆20Feb 21, 2023Updated 3 years ago
- 改造langchain团队 open-deep-research, 结合chainlit 为国内开发者提供便捷使用☆22Oct 9, 2025Updated 8 months ago
- A LLM Paper note list.☆20Apr 6, 2024Updated 2 years ago
- network time protocol client☆18Dec 15, 2010Updated 15 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆65Updated this week
- ☆26Oct 23, 2019Updated 6 years ago
- 使用spring-boot-spark的一个样例☆11Aug 3, 2018Updated 7 years ago
- [NeurIPS 2025 🔥] Official implementation for "Don't Just Chase “Highlighted Tokens” in MLLMs: Revisiting Visual Holistic Context Retenti…☆64Mar 5, 2026Updated 3 months ago
- Live clone of https://gitlab.com/libeigen/eigen.☆13Updated this week
- Building an RPG with Unity 2018, published by Packt☆17Jan 18, 2023Updated 3 years ago
- 本仓库是关于大模型面试中常见面试试题和面试经验的整理。这里收集了各类与大模型相关的面试题目,并提供详细的解答和分析。本仓库由上海交大交影社区维护☆130Aug 23, 2024Updated last year
- ☆15Oct 23, 2023Updated 2 years ago
- 使用 Bert 进行文本分类☆20Dec 7, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 用Numpy复现可训练的LLaMa3☆34Jul 5, 2024Updated last year
- 慕课网-Python Flask构建可扩展的RESTful API☆16Jun 20, 2018Updated 7 years ago
- Achieve your exclusive DeepResearch.☆28Apr 25, 2025Updated last year
- iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models (ICLR2026)☆22Mar 29, 2026Updated 2 months ago
- A vLLM plugin built on the FlagOS unified multi-chip backend.☆56Updated this week
- ☆16Jun 15, 2017Updated 9 years ago
- 👀「大模型」2小时从0训练65M参数的视觉多模态VLM!Train a 65M-parameter VLM from scratch in just 2h!☆8,126May 19, 2026Updated 3 weeks ago