My implementation of Stanford CS336 assignments.
☆240Mar 15, 2026Updated 3 months ago
Alternatives and similar repositories for cs336-assignments-answer
Users that are interested in cs336-assignments-answer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 🏆🏆 「大模型」All in one & All from scratch. 🌍🌍 收集、清洗数据,训练Tokenizer,预训练、SFT、GRPO!☆57Aug 12, 2025Updated 10 months ago
- Assignment 1 for Stanford CS336 - Language Modeling From Scratch☆78Jul 7, 2025Updated 11 months ago
- ☆55Nov 22, 2025Updated 6 months ago
- ☆97Jul 20, 2025Updated 10 months ago
- ☆123Jan 18, 2026Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆17Dec 21, 2024Updated last year
- Repo for paper "Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability"☆108Apr 23, 2026Updated last month
- 记录我在cs336学习时的笔记和作业☆909May 2, 2026Updated last month
- ☆18Nov 22, 2025Updated 6 months ago
- Stanford "Language Modeling from Scratch" CS336 Assignment1 - 斯坦福大学 CS336 课程作业1 个人实现,仅供参考☆46Jun 15, 2025Updated last year
- Un-official implementation of the Transformer Index for GEnerative Recommenders (TIGER) framework.☆13Jun 6, 2023Updated 3 years ago
- Student version of Assignment 2 for Stanford CS336 - Language Modeling From Scratch☆247May 1, 2026Updated last month
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆73May 29, 2026Updated 3 weeks ago
- NJU 软件分析 Tai-e☆13Dec 10, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆15Mar 18, 2026Updated 3 months ago
- my implementation of NJU ICS PA 2021☆18Sep 12, 2022Updated 3 years ago
- Official implementation: Population Aware Diffusion for Time Series Generation (AAAI-25)☆17Sep 1, 2025Updated 9 months ago
- ☆112Jan 23, 2026Updated 4 months ago
- [Patterns] MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning☆81Mar 10, 2026Updated 3 months ago
- Official code for paper "GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable R…☆64Mar 29, 2026Updated 2 months ago
- ☆14Oct 19, 2025Updated 8 months ago
- ☆3,261May 28, 2026Updated 3 weeks ago
- ☆24Oct 13, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Prototype for a Category Theory-based GNN Library☆15Apr 20, 2022Updated 4 years ago
- ☆11Apr 12, 2024Updated 2 years ago
- a simple shell to imitate linux_shell written in C(linux) for BUAA-Unix-Lecture 2021☆15Aug 24, 2021Updated 4 years ago
- 南京大学 机器学习导论☆21Jun 6, 2019Updated 7 years ago
- homework answer for UCB cs285 deepRL☆75Dec 26, 2024Updated last year
- ☆13Oct 5, 2021Updated 4 years ago
- CTR-Prediction☆14Aug 7, 2019Updated 6 years ago
- [NeurIPS 2023] Focus Your Attention when Few-Shot Classification☆17Feb 26, 2024Updated 2 years ago
- Code for Multi-Aspect Cross-modal Quantization for Generative Recommendation. (AAAI 2026 Oral)☆43Dec 9, 2025Updated 6 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Use stable diffusion to outpaint around an image and uncrop it☆21Feb 8, 2023Updated 3 years ago
- my solutions for CS61A and all resource I have☆171Feb 7, 2023Updated 3 years ago
- ☆12Jun 26, 2024Updated last year
- [NeurIPS 2025] MedAgentBoard: Benchmarking Multi-Agent Collaboration with Conventional Methods for Diverse Medical Tasks☆58Mar 13, 2026Updated 3 months ago
- A simple implementation about LEGv8 instruction set using Verilog HDL.☆12May 8, 2022Updated 4 years ago
- [NeurIPS 2024] Official Implementation of "SDformer: Similarity-driven Discrete Transformer For Time Series Generation"☆16May 23, 2025Updated last year
- Awesome Generative Recommendation papers primarily focused on industry-level applications.☆224Jun 1, 2026Updated 2 weeks ago