Stanford "Language Modeling from Scratch" CS336 Assignment1 - 斯坦福大学 CS336 课程作业1 个人实现,仅供参考
☆45Jun 15, 2025Updated 11 months ago
Alternatives and similar repositories for cs336-assignment1-basics
Users that are interested in cs336-assignment1-basics are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [IEEE TKDE] A LLM-based Recommender System with user&item Tokenizers and a generative retrieval paradigm.☆26Mar 11, 2026Updated 2 months ago
- 对推荐广告中,序列推荐、多任务推荐、跨域推荐、冷启动等方向主要算法学习笔记。☆16May 25, 2022Updated 4 years ago
- Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch☆1,650Apr 7, 2026Updated last month
- This repo contains the syllabus of the Hugging Face Deep Reinforcement Learning Course translated in Chinese.☆10Jan 16, 2024Updated 2 years ago
- Implementation based on pytorch for DIN recommendation algorithm☆22Jul 30, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official pytorch implementation of "MUSE: A Simple Yet Effective Multimodal Search-Based Framework for Lifelong User Interest Modeling"☆48Jan 12, 2026Updated 4 months ago
- ☆13Aug 13, 2025Updated 9 months ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆11Dec 30, 2024Updated last year
- Implementation codes for NeurIPS23 paper "Spectral Invariant Learning for Dynamic Graphs under Distribution Shifts"☆14Mar 19, 2024Updated 2 years ago
- Automate dating apps with AI☆23Jan 18, 2024Updated 2 years ago
- Procedural data generators suite for synthetic pretraining and formal reasoning☆40Updated this week
- Pytorch routines for (Ker)nel (Mac)hines☆12Oct 10, 2025Updated 7 months ago
- Simple MoE - Day 17 of 365 Days of Repos☆19Apr 21, 2026Updated last month
- ICLR 2026☆42May 12, 2026Updated 2 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆23Jun 28, 2025Updated 10 months ago
- 🐲 LLVM-based Kaleidoscope language compiler ✨ 基于 LLVM 的 Kaleidoscope 编译器☆12Dec 16, 2022Updated 3 years ago
- A project designed to build and render a full Minecraft crafting tree.☆10Aug 10, 2021Updated 4 years ago
- ☆66Mar 4, 2026Updated 2 months ago
- Open source code for MobiPurpose project☆13Mar 25, 2025Updated last year
- ☆11Mar 8, 2024Updated 2 years ago
- Official code for the paper: Scaling Transformers for Discriminative Recommendation via Generative Pretraining☆29Sep 1, 2025Updated 8 months ago
- Analyzing LLM Alignment via Token distribution shift☆18Jan 26, 2024Updated 2 years ago
- Code for Multi-Aspect Cross-modal Quantization for Generative Recommendation. (AAAI 2026 Oral)☆39Dec 9, 2025Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- WIP: Unnoficial implementation of diffusion autoencoders, using pytorch☆11Feb 15, 2023Updated 3 years ago
- NeurIPS22 "RankFeat: Rank-1 Feature Removal for Out-of-distribution Detection" and T-PAMI Extension☆20Feb 21, 2025Updated last year
- ☆16Feb 4, 2025Updated last year
- This is a repository for RM2021 Software tutorial☆11Nov 4, 2020Updated 5 years ago
- [ICLR 2025] This repository contains the code to reproduce the results from our paper From Sparse Dependence to Sparse Attention: Unveili…☆12Mar 7, 2025Updated last year
- Experiments for "A Closer Look at In-Context Learning under Distribution Shifts"☆18May 29, 2023Updated 2 years ago
- A template project to both illustrate and serve as an example for plugin creations on top of the manim.☆20Apr 30, 2021Updated 5 years ago
- Code and data release for CCS'2022 paper "Understanding IoT Security from a Market-Scale Perspective"☆12Apr 13, 2023Updated 3 years ago
- C++-Animation-(Standard-Template-Library)-Engine,or CASTLE for short,is a C++ plotting and animation engine created by BiliBili uploader …☆11Jan 17, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆19Dec 12, 2023Updated 2 years ago
- Advanced Programming - HW2☆27Mar 11, 2022Updated 4 years ago
- Code for ICLR 2023 Harnessing Out-Of-Distribution Examples via Augmenting Content and Style☆13Jul 3, 2023Updated 2 years ago
- [ICML 2023] Taxonomy-Structured Domain Adaptation☆12Oct 6, 2023Updated 2 years ago
- A curated list of papers on graph transfer learning (GTL).☆18Oct 23, 2023Updated 2 years ago
- CSE 351: The Hardware/Software Interface (taught by Luis Ceze)☆16May 22, 2014Updated 12 years ago
- Code for the paper: https://arxiv.org/pdf/2309.06979.pdf☆21Jul 29, 2024Updated last year