My Solution and Notes for the Stanford CS336: LLM from scratch
☆236Mar 23, 2026Updated 2 months ago
Alternatives and similar repositories for Stanford-CS336
Users that are interested in Stanford-CS336 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆118Jan 18, 2026Updated 4 months ago
- ☆96Jul 20, 2025Updated 10 months ago
- Project of hardware course group in Tongji University☆15Dec 26, 2019Updated 6 years ago
- ☆67Dec 11, 2025Updated 5 months ago
- Implementation of my CS336 assignment1☆44Dec 23, 2025Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A lightweight and highly extensible Agent framework☆23Apr 28, 2026Updated 3 weeks ago
- 同济大学软件学院大三上计算机网络实验报告☆21Jan 10, 2024Updated 2 years ago
- This is a repository of all the can-be-made-public notes and assignments during my undergraduate studies in Computer Science and Technolo…☆15Nov 13, 2023Updated 2 years ago
- 【ICME2025 Oral】Offical Pytorch Code for "Fraesormer: Learning Adaptive Sparse Transformer for Efficient Food Recognition"☆11Mar 21, 2025Updated last year
- 济勤大一下沈坚的高程作业。不支持复制粘贴,被查重了不怪我哈。希望帮助到扣头想不出作业的同学☆11Mar 17, 2025Updated last year
- 或许这里有作为同济大学软件学院机器智能的一位学生学业所需的所有东西☆19Aug 5, 2024Updated last year
- Simple template DAG scheduler in c++☆15Aug 13, 2020Updated 5 years ago
- Collections of RLxLM experiments using minimal codes☆14Feb 17, 2025Updated last year
- 同济大学软件学院2023年秋软件工程课程笔记☆17Jan 16, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A Guided Reinforcement Learning framework enhancing MLLM reasoning via process-level verification and collaborative rollout strategies.☆47May 4, 2026Updated 3 weeks ago
- ☆12Mar 18, 2024Updated 2 years ago
- pytorch☆10Apr 13, 2022Updated 4 years ago
- ArXiv daily dump and viewer using GitHub Actions - luvata.github.io/arxive☆14Updated this week
- 同济大学22级沈坚面向对象程序设计大作业☆11Aug 31, 2024Updated last year
- ☆10Oct 12, 2021Updated 4 years ago
- This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data☆13Jul 21, 2024Updated last year
- ☆21Aug 22, 2022Updated 3 years ago
- 数字信号处理大作业,语音处理☆10Mar 16, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Complete Reinforcement Learning Toolkit for Large Language Models!☆21Aug 2, 2025Updated 9 months ago
- ☆25Apr 5, 2025Updated last year
- 同济大学2022-2023第二学期计算机视觉课程作业☆14Jun 27, 2023Updated 2 years ago
- A courses related repo for Jinan University (JNU) International School (IS) CST major / 暨南大学国际学院计算机科学与技术专业相关课程仓库☆33Jul 11, 2025Updated 10 months ago
- 同济大学计算机组成原理课程设计☆19Nov 17, 2021Updated 4 years ago
- B.Tech Thesis Code for RLCaR: Deep Reinforcement Learning Framework for Optimal and Adaptive Cache Replacement☆13Oct 25, 2020Updated 5 years ago
- 李宏毅 (Hung-yi Lee) 机器学习 Machine Learning 2023 Spring☆14Dec 25, 2024Updated last year
- Artifact evaluation for HPCA'24 paper Lightening-Transformer: A Dynamically-operated Optically-interconnected Photonic Transformer Accele…☆11Mar 3, 2024Updated 2 years ago
- 同济大学现代密码学课程设计,包含AES算法,CBC模式,RSA加密解密,RSA签名,基于RSA的证书以及基于RSA和AES算法的文件加密系统。☆15Sep 1, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICLRW'26] EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation☆45Apr 21, 2026Updated last month
- gnuradio OFDM blocks for over the air communications. Tested with hardware (USRP's).☆18Oct 13, 2016Updated 9 years ago
- DNN_Partition辅助工具,用于对pytorch模型进行简单的性能分析以及支持模型切分☆14May 31, 2021Updated 4 years ago
- Using DDPG agent to control UAV system with energy efficiency☆16Jan 7, 2023Updated 3 years ago
- Unofficial Implementation of Selective Attention Transformer☆20Oct 31, 2024Updated last year
- Proactive Content Caching with Deep Learning☆14Oct 17, 2022Updated 3 years ago
- High Performance FP8 GEMM Kernels for SM89 and later GPUs.☆21Jan 24, 2025Updated last year