Stanford "Language Modeling from Scratch" CS336 Assignment1 - 斯坦福大学 CS336 课程作业1 个人实现,仅供参考
☆43Jun 15, 2025Updated 9 months ago
Alternatives and similar repositories for cs336-assignment1-basics
Users that are interested in cs336-assignment1-basics are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 本项目是我在学习 CS336 课程过程中整理的学习笔记 This project is a collection of study notes I compiled while taking the CS336 course.☆24Nov 1, 2025Updated 4 months ago
- [IEEE TKDE] A LLM-based Recommender System with user&item Tokenizers and a generative retrieval paradigm.☆26Mar 11, 2026Updated 2 weeks ago
- An agent with multiple CUHKSZ campus systems connected.☆17Dec 12, 2024Updated last year
- 🚀全流程自己训练一个VLA 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!☆29Oct 16, 2025Updated 5 months ago
- Implementation based on pytorch for DIN recommendation algorithm☆21Jul 30, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A Pandas-like DataFrame in c++☆20Aug 3, 2019Updated 6 years ago
- Implementation codes for NeurIPS23 paper "Spectral Invariant Learning for Dynamic Graphs under Distribution Shifts"☆14Mar 19, 2024Updated 2 years ago
- Pytorch routines for (Ker)nel (Mac)hines☆11Oct 10, 2025Updated 5 months ago
- 🐲 LLVM-based Kaleidoscope language compiler ✨ 基于 LLVM 的 Kaleidoscope 编译器☆12Dec 16, 2022Updated 3 years ago
- A project designed to build and render a full Minecraft crafting tree.☆10Aug 10, 2021Updated 4 years ago
- Official code for the paper: Scaling Transformers for Discriminative Recommendation via Generative Pretraining☆26Sep 1, 2025Updated 6 months ago
- ☆11Mar 8, 2024Updated 2 years ago
- ☆11Jun 20, 2023Updated 2 years ago
- A Jupyter-style custom node for executing Python code and plotting within ComfyUI workflows.☆36Mar 18, 2026Updated last week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Analyzing LLM Alignment via Token distribution shift☆17Jan 26, 2024Updated 2 years ago
- NeurIPS22 "RankFeat: Rank-1 Feature Removal for Out-of-distribution Detection" and T-PAMI Extension☆20Feb 21, 2025Updated last year
- ☆17Feb 4, 2025Updated last year
- Tutorials for MATH 4432 Statistical Machine Learning, HKUST, Fall 2022☆11Sep 17, 2024Updated last year
- This is a repository for RM2021 Software tutorial☆11Nov 4, 2020Updated 5 years ago
- [ICLR 2025] This repository contains the code to reproduce the results from our paper From Sparse Dependence to Sparse Attention: Unveili…☆12Mar 7, 2025Updated last year
- Experiments for "A Closer Look at In-Context Learning under Distribution Shifts"☆19May 29, 2023Updated 2 years ago
- A template project to both illustrate and serve as an example for plugin creations on top of the manim.☆20Apr 30, 2021Updated 4 years ago
- ☆19Dec 12, 2023Updated 2 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Curse-of-memory phenomenon of RNNs in sequence modelling☆19May 8, 2025Updated 10 months ago
- 《计算机程序的构造和解释》(原书第二版)习题解答,在线阅读地址:https://relph1119.github.io/sicp-solutions-manual☆13May 28, 2021Updated 4 years ago
- [ICML 2023] Taxonomy-Structured Domain Adaptation☆12Oct 6, 2023Updated 2 years ago
- Code for ICLR 2023 Harnessing Out-Of-Distribution Examples via Augmenting Content and Style☆13Jul 3, 2023Updated 2 years ago
- CSE 351: The Hardware/Software Interface (taught by Luis Ceze)☆16May 22, 2014Updated 11 years ago
- Code for the paper: https://arxiv.org/pdf/2309.06979.pdf☆21Jul 29, 2024Updated last year
- Diffusion-based Negative Sampling on Graphs for Link Prediction☆14Feb 13, 2024Updated 2 years ago
- My solution to assignments for Berkeley CS 285: Deep Reinforcement Learning, Decision Making, and Control.☆17Mar 19, 2025Updated last year
- This is the notebooks for videos in my Bilibili Channel (https://space.bilibili.com/32773300?spm_id_from=333.1007.0.0)☆31Nov 6, 2025Updated 4 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆31Nov 30, 2025Updated 3 months ago
- 微信刷票实例☆14Aug 5, 2019Updated 6 years ago
- ☆23Apr 16, 2024Updated last year
- Implementation of approximate free-energy minimization in PyTorch☆21Oct 16, 2021Updated 4 years ago
- source code of AAAI 2024 paper "Graph Invariant Learning with Subgraph Co-mixup for Out-Of-Distribution Generalization".☆18Apr 29, 2024Updated last year
- 龙大生存手册☆38Jan 20, 2025Updated last year
- Official implementation of MARIO: Model Agnostic Recipe for Improving OOD Generalization of Graph Contrastive Learning☆19Jan 27, 2024Updated 2 years ago