☆523Apr 29, 2024Updated last year
Alternatives and similar repositories for Transformer-from-scratch
Users that are interested in Transformer-from-scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 《Reinforcement Learning》读书学习与视频分享笔记☆79Apr 1, 2025Updated last year
- 训练自己的中文 Embedding 模型☆30Jan 6, 2025Updated last year
- Well documented, unit tested, type checked and formatted implementation of a vanilla transformer - for educational purposes.☆287Mar 27, 2026Updated last week
- ☆13Oct 17, 2024Updated last year
- 中文翻译的 Hands-On-Large-Language-Models (hands-on-llms),动手学习大模型☆2,352Oct 19, 2025Updated 5 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- 复现大模型相关算法及一些学习记录☆3,236Mar 21, 2026Updated 2 weeks ago
- introduce AI infra knowledges. 人工智能系统基础架构知 识库☆16Jun 4, 2023Updated 2 years ago
- ☆20Apr 7, 2024Updated 2 years ago
- 从无名小卒到大模型(LLM)大英雄~ 欢迎关注后续!!!☆2,127Nov 22, 2025Updated 4 months ago
- A simple implementation of Llama 1, 2. Llama Architecture built from scratch using PyTorch all the models are built from scratch that inc…☆14May 6, 2024Updated last year
- 零实现 AlphaGo Zero☆17Nov 10, 2024Updated last year
- Zero-human, cold-start construction of long-chain agents in professional domains☆48Nov 10, 2025Updated 5 months ago
- Datastructure for data science☆23Apr 12, 2024Updated last year
- ☆13Sep 12, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 基于gradio的极简 ragflow API 聊天Web界面☆18Mar 31, 2025Updated last year
- Deep Dynamic Factor Models☆26Mar 30, 2026Updated last week
- ☆14Jul 13, 2025Updated 8 months ago
- 《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程☆29,557Mar 27, 2026Updated 2 weeks ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆39Aug 30, 2025Updated 7 months ago
- llama3 implementation one matrix multiplication at a time☆15,244May 23, 2024Updated last year
- 《大模型白盒子构建指南》:一个全手搓的Tiny-Universe☆4,698Feb 12, 2026Updated last month
- ☆24Dec 10, 2024Updated last year
- Llama3-Tutorial(XTuner、LMDeploy、OpenCompass)☆511May 10, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆28Jan 5, 2025Updated last year
- Code repo for MathAgent☆20Dec 15, 2023Updated 2 years ago
- NeurIPS 2022 paper, SubHypergraph Inductive Neural nEtwork☆19Aug 4, 2023Updated 2 years ago
- A curated collection of prompts for Grok Imagine by xAI☆26Oct 19, 2025Updated 5 months ago
- 🚀🚀 「大模型」2小时完全从0训练64M的小参数GPT!🌏 Train a 64M-parameter GPT from scratch in just 2h!☆45,668Updated this week
- Beyond Graph Convolution: Multimodal Recommendation with Topology-aware MLPs☆13Jan 28, 2025Updated last year
- This project implements the Titans architecture from the paper "Titans: Learning to Memorize at Test Time" for market data prediction.☆11Jan 19, 2025Updated last year
- Implement a ChatGPT-like LLM in PyTorch from scratch, step by step☆90,284Updated this week
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆498May 1, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆20Feb 24, 2025Updated last year
- [CIKM 2024] Do We Really Need Graph Convolution During Training? Light Post-Training Graph-ODE for Efficient Recommendation☆14Aug 11, 2024Updated last year
- 🚀 「大模型」1小时从0训练67M参数的视觉多模态VLM!🌏 Train a 67M-parameter VLM from scratch in just 1 hours!☆7,329Updated this week
- 这是一个用于与 RAGflow API 交互的 Python 客户端,支持数据集管理、文件管理、分块管理、聊天助手管理以及代理管理的完整功能。☆21Feb 21, 2025Updated last year
- 手把手带你实 战 Huggingface Transformers 课程视频同步更新在B站与YouTube☆3,902Jul 15, 2024Updated last year
- 🏆 SUCCESS-GS: Survey of Compactness and Compression for Efficient Static and Dynamic Gaussian Splatting☆28Updated this week
- 本项目是一个面向小白开发者的大模型应用开发教程,在线阅读地址:https://datawhalechina.github.io/llm-universe/☆12,479Feb 24, 2026Updated last month