YYZhang2025 / Stanford-CS336View external linksLinks
My Solution and Notes for the Stanford CS336: LLM from scratch
☆167Jan 26, 2026Updated 3 weeks ago
Alternatives and similar repositories for Stanford-CS336
Users that are interested in Stanford-CS336 are comparing it to the libraries listed below
Sorting:
- ☆92Jul 20, 2025Updated 6 months ago
- ☆97Jan 18, 2026Updated 3 weeks ago
- Code for "RSQ: Learning from Important Tokens Leads to Better Quantized LLMs"☆20Jun 11, 2025Updated 8 months ago
- Project of hardware course group in Tongji University☆16Dec 26, 2019Updated 6 years ago
- A Guided Reinforcement Learning framework enhancing MLLM reasoning via process-level verification and collaborative rollout strategies.☆41Updated this week
- 电子科技大学实验报告latex模板☆10Dec 12, 2019Updated 6 years ago
- 同济23数字逻辑张冬冬老师作业☆13Nov 23, 2024Updated last year
- Super mario running on FPGA☆12Mar 14, 2019Updated 6 years ago
- pytorch☆10Apr 13, 2022Updated 3 years ago
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Aug 28, 2025Updated 5 months ago
- Biologically-Constrained Graphs for Global Connectomics Reconstruction☆11Feb 9, 2021Updated 5 years ago
- Repo allows users to test different DL archictectures when applied to time series forecasting of weather data (TCN, LSTM, BiLSTM, GRU, Bi…☆19Mar 14, 2025Updated 11 months ago
- ☆11Jan 21, 2024Updated 2 years ago
- EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation☆27Jul 30, 2025Updated 6 months ago
- ☆13Feb 2, 2023Updated 3 years ago
- 【ICME2025 Oral】 Offical Pytorch Code for "Learning Dual-Domain Multi-Scale Representations for Single Image Deraining"☆16Mar 21, 2025Updated 10 months ago
- B.Tech Thesis Code for RLCaR: Deep Reinforcement Learning Framework for Optimal and Adaptive Cache Replacement☆12Oct 25, 2020Updated 5 years ago
- 文言文信息抽取(实体识别+关系抽取)☆10Feb 24, 2023Updated 2 years ago
- A MATLAB toolbox for simulating multi-channel communications.☆16Jun 4, 2025Updated 8 months ago
- Multi-Figurative Language Generation (COLING 2022)☆12Jan 30, 2023Updated 3 years ago
- ☆21Apr 5, 2025Updated 10 months ago
- This is a repository of all the can-be-made-public notes and assignments during my undergraduate studies in Computer Science and Technolo…☆16Nov 13, 2023Updated 2 years ago
- personal info☆10Mar 23, 2024Updated last year
- Github Repo for OATS: Outlier-Aware Pruning through Sparse and Low Rank Decomposition☆17Apr 16, 2025Updated 10 months ago
- 同济大学22级沈坚面向对象程序设计大作业☆14Aug 31, 2024Updated last year
- ☆13Jan 15, 2025Updated last year
- Artifact evaluation for HPCA'24 paper Lightening-Transformer: A Dynamically-operated Optically-interconnected Photonic Transformer Accele…☆11Mar 3, 2024Updated last year
- ☆15Nov 7, 2024Updated last year
- Paper-Agent 是一个面向科研人员和学生的智能论文检索与调研工具。项目基于多智能体协作架构(AutoGen + LangGraph),通过自然语言处理(NLP)、自动化搜索和知识库构建,帮助用户高效查找学术论文、 分析文献内容,并进行论文调研。Paper-Agent …☆39Feb 5, 2026Updated last week
- Code for the paper "Automated Generation of Hospital Discharge Summaries Using Clinical Guidelines and Large Language Models"☆11May 3, 2024Updated last year
- LLM implementation one matrix multiplication at a time☆13Aug 8, 2024Updated last year
- A Mobile edge computing server placement algorithm, written from scratch for 5g server placement depending upon various KPIs across a ar…☆12Sep 14, 2022Updated 3 years ago
- An OFDM modulation commutation system on Android phones using sound wave☆17May 14, 2022Updated 3 years ago
- ☆14Dec 21, 2024Updated last year
- Implemented a basic 802.11 OFDM PHY layer, including packet detection, synchronization, channel estimation, modulation and demodulation☆14Nov 4, 2023Updated 2 years ago
- [EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un…☆17Dec 17, 2025Updated last month
- A custom Huggingface trainer which supports logging auxiliary losses returned by your model☆15Jul 27, 2025Updated 6 months ago
- ☆11Oct 8, 2022Updated 3 years ago
- Code for paper "Joint Architecture Design and Workload Partitioning for DNN Inference on Industrial IoT Clusters"☆15Aug 22, 2025Updated 5 months ago