My Solution and Notes for the Stanford CS336: LLM from scratch
☆194Mar 23, 2026Updated this week
Alternatives and similar repositories for Stanford-CS336
Users that are interested in Stanford-CS336 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆110Jan 18, 2026Updated 2 months ago
- ☆96Jul 20, 2025Updated 8 months ago
- Code for "RSQ: Learning from Important Tokens Leads to Better Quantized LLMs"☆21Mar 17, 2026Updated last week
- Project of hardware course group in Tongji University☆16Dec 26, 2019Updated 6 years ago
- Super mario running on FPGA☆12Mar 14, 2019Updated 7 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆61Dec 11, 2025Updated 3 months ago
- 同济23数字逻辑张冬冬老师作业☆13Nov 23, 2024Updated last year
- Source code and data for ADEPT: A DEbiasing PrompT Framework (AAAI-23).☆15Dec 13, 2024Updated last year
- 操作系统课设自用 xv6-2021 。☆10Jul 13, 2022Updated 3 years ago
- 同济大学软件学院大三上计算机网络实验报告☆19Jan 10, 2024Updated 2 years ago
- C++ RPC based on RDMA☆13Sep 12, 2023Updated 2 years ago
- ☆15Sep 23, 2024Updated last year
- [ICLR/AAAI 2026] Open-Source LLM-Based Data Analysis Agents☆73Jan 26, 2026Updated 2 months ago
- ☆21Apr 5, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 或许这里有作为同济大学软件学院机器智能的一位学生学业所需的所有东西☆18Aug 5, 2024Updated last year
- ☆21Jan 9, 2023Updated 3 years ago
- Collections of RLxLM experiments using minimal codes☆14Feb 17, 2025Updated last year
- ☆13Feb 2, 2023Updated 3 years ago
- 同济大学软件学院2023年秋软件工程课程笔记☆17Jan 16, 2024Updated 2 years ago
- pytorch☆10Apr 13, 2022Updated 3 years ago
- 同济大学22级沈坚面向对象程序设计大作业☆14Aug 31, 2024Updated last year
- This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data☆14Jul 21, 2024Updated last year
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆24Mar 4, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆12Jan 21, 2024Updated 2 years ago
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Aug 28, 2025Updated 6 months ago
- [ICLRW'26] EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation☆29Mar 16, 2026Updated last week
- 💡 受到TJ-CSCCG / TJCS-Course的启发,该repo用于存放(大家贡献的)同济大学生物相关专业的课程资源。准备加入部分科目教材及讲义、笔记、报告模板、实验Protocol等内容。期待更多课程加入……☆18Feb 16, 2024Updated 2 years ago
- Mixture of Lora Experts☆10Apr 7, 2024Updated last year
- Complete Reinforcement Learning Toolkit for Large Language Models!☆21Aug 2, 2025Updated 7 months ago
- ☆13Jun 15, 2021Updated 4 years ago
- Official implementation for Text Generation Beyond Discrete Token Sampling☆24Aug 11, 2025Updated 7 months ago
- 同济大学计算机组成原理课程设计☆19Nov 17, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This is the repo of "RR-Compound: RDMA-Fused gRPC for Low Latency and High Throughput With an Easy Interface" published in TPDS☆27Mar 12, 2025Updated last year
- Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …☆64Jan 28, 2026Updated last month
- Repo allows users to test different DL archictectures when applied to time series forecasting of weather data (TCN, LSTM, BiLSTM, GRU, Bi…☆20Mar 14, 2025Updated last year
- 同济大学现代密码学课程设计,包含AES算法,CBC模式,RSA加密解密,RSA签名,基于RSA的证书以及基于RSA和AES算法的文件加密系统。☆15Sep 1, 2022Updated 3 years ago
- An Ultra-Long Output Reinforcement Learning Approach☆23Jul 31, 2025Updated 7 months ago
- ☆14Dec 21, 2024Updated last year
- 同济大学软件学院离散数学tjf老师班大作业,附项目说明文档等☆17Dec 9, 2022Updated 3 years ago