My Solution and Notes for the Stanford CS336: LLM from scratch
☆209Mar 23, 2026Updated 3 weeks ago
Alternatives and similar repositories for Stanford-CS336
Users that are interested in Stanford-CS336 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆115Jan 18, 2026Updated 2 months ago
- ☆97Jul 20, 2025Updated 8 months ago
- Project of hardware course group in Tongji University☆15Dec 26, 2019Updated 6 years ago
- Super mario running on FPGA☆12Mar 14, 2019Updated 7 years ago
- LLM implementation one matrix multiplication at a time☆13Aug 8, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 操作系统课设自用 xv6-2021 。☆10Jul 13, 2022Updated 3 years ago
- This is a repository of all the can-be-made-public notes and assignments during my undergraduate studies in Computer Science and Technolo…☆16Nov 13, 2023Updated 2 years ago
- ☆21Apr 5, 2025Updated last year
- ☆21Jan 9, 2023Updated 3 years ago
- Collections of RLxLM experiments using minimal codes☆14Feb 17, 2025Updated last year
- ☆13Feb 2, 2023Updated 3 years ago
- ☆10Oct 12, 2021Updated 4 years ago
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Aug 28, 2025Updated 7 months ago
- CMU BusTub Relational Database Management System (Fall 2023)☆11Dec 13, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ICLRW'26] EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation☆33Mar 24, 2026Updated 3 weeks ago
- 💡 受到TJ-CSCCG / TJCS-Course的启发,该repo用于存放(大家贡献的)同济大学生物相关专业的课程资源。准备加入部分科目教材及讲义、笔记、报告模板、实验Protocol等内容。期待更多课程加入……☆18Feb 16, 2024Updated 2 years ago
- Complete Reinforcement Learning Toolkit for Large Language Models!☆21Aug 2, 2025Updated 8 months ago
- Mixture of Lora Experts☆10Apr 7, 2024Updated 2 years ago
- ☆13Jun 15, 2021Updated 4 years ago
- 同济大学2022-2023第二学期计算机视觉课程作业☆14Jun 27, 2023Updated 2 years ago
- Official implementation for Text Generation Beyond Discrete Token Sampling☆24Aug 11, 2025Updated 8 months ago
- 同济大学计算机组成原理课程设计☆19Nov 17, 2021Updated 4 years ago
- WeKnora‑pro是基于原始 WeKnora 的二次开发版本,核心在于提升文档解析能力。 主要改进:1. 支持扫描件通过 (CPU/GPU 自动优化)进行 OCR 与表格提取;且兼容WeKnora多模态增加 2. 文档大小上限提升至 300 MB☆46Oct 29, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- B.Tech Thesis Code for RLCaR: Deep Reinforcement Learning Framework for Optimal and Adaptive Cache Replacement☆12Oct 25, 2020Updated 5 years ago
- This is the repo of "RR-Compound: RDMA-Fused gRPC for Low Latency and High Throughput With an Easy Interface" published in TPDS☆29Mar 12, 2025Updated last year
- demo natural language video db using CLIP☆28Aug 7, 2024Updated last year
- Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …☆65Jan 28, 2026Updated 2 months ago
- An Ultra-Long Output Reinforcement Learning Approach☆23Jul 31, 2025Updated 8 months ago
- ☆14Dec 21, 2024Updated last year
- 同济大学软件学院离散数学tjf老师班大作业,附项目说明文档等☆17Dec 9, 2022Updated 3 years ago
- Code for the paper "Automated Generation of Hospital Discharge Summaries Using Clinical Guidelines and Large Language Models"☆11May 3, 2024Updated last year
- [ICCV'25] The official implementation of "PseudoMapTrainer: Learning Online Mapping without HD Maps" by Löwens et al.☆49Aug 27, 2025Updated 7 months ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- DNN_Partition辅助工具,用于对pytorch模型进行简单的性能分析以及支持模型切分☆14May 31, 2021Updated 4 years ago
- ☆13Apr 1, 2026Updated 2 weeks ago
- Unofficial Implementation of Selective Attention Transformer☆21Oct 31, 2024Updated last year
- High Performance FP8 GEMM Kernels for SM89 and later GPUs.☆21Jan 24, 2025Updated last year
- ☆20May 1, 2025Updated 11 months ago
- personal info☆11Mar 23, 2024Updated 2 years ago
- ☆27Jul 18, 2025Updated 8 months ago