Assignment 1 for Stanford CS336 - Language Modeling From Scratch
☆77Jul 7, 2025Updated 8 months ago
Alternatives and similar repositories for stanford-cs336-a1
Users that are interested in stanford-cs336-a1 are comparing it to the libraries listed below
Sorting:
- 国科大雁栖湖校区2024~2025年课程资料,包括强化学习、智能计算系统、模式识别、矩阵分析与应用、人工智能原理与算法、自然语言处理☆36Sep 22, 2025Updated 5 months ago
- [NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆65Sep 27, 2025Updated 5 months ago
- 5G通信资源分配算法☆11Jun 19, 2020Updated 5 years ago
- Automate dating apps with AI☆19Jan 18, 2024Updated 2 years ago
- Redefining Video Management with power of SQL☆11Oct 15, 2023Updated 2 years ago
- Code for "Learning Harmonic Molecular Representations on Riemannian Manifold", ICLR, 2023☆10Mar 23, 2023Updated 2 years ago
- ☆13Dec 1, 2025Updated 3 months ago
- Implementation codes for NeurIPS23 paper "Spectral Invariant Learning for Dynamic Graphs under Distribution Shifts"☆13Mar 19, 2024Updated last year
- Official eval code for ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation☆27Dec 12, 2025Updated 2 months ago
- Code for Research Project TLDR☆25Jul 28, 2025Updated 7 months ago
- ☆14Nov 11, 2024Updated last year
- ☆16Nov 12, 2025Updated 3 months ago
- ☆25Updated this week
- Codes and data for KDD 2024 Research Track paper "ProCom: A Few-shot Targeted Community Detection Algorithm"☆11Aug 15, 2024Updated last year
- awesome SAE papers☆74May 24, 2025Updated 9 months ago
- Enformer Celltyping is a tensorflow, multi-headed attention based model that incorporates distal effects of Deoxyribonucleic Acid (DNA) i…☆16Jun 25, 2025Updated 8 months ago
- ☆17Jun 26, 2025Updated 8 months ago
- Diffusion-based Negative Sampling on Graphs for Link Prediction☆13Feb 13, 2024Updated 2 years ago
- ☆14May 30, 2023Updated 2 years ago
- Code release for "Generative Modeling of Weights: Generalization or Memorization?"☆19Jun 10, 2025Updated 8 months ago
- Implementation of Phenotype prediction from single-cell RNA-seq data using attention-based neural networks (Bioinformatics 2024).☆13Jul 15, 2024Updated last year
- Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries☆34Nov 19, 2025Updated 3 months ago
- translate skyzh/mini-lsm to go version☆10Jun 7, 2023Updated 2 years ago
- [EMNLP 2023] Generative Emotion Cause Triplet Extraction in Conversations with Commonsense Knowledge☆14Jan 5, 2024Updated 2 years ago
- LLMs Learn Task Heuristics from Demonstrations: A Heuristic-Driven Prompting Strategy for Document-Level Event Argument Extraction (ACL 2…☆14Aug 12, 2024Updated last year
- ☆11Jan 9, 2025Updated last year
- [NeurIPS 2023] "Understanding the Limitations of Deep Models for Molecular Property Prediction: Insights and Solutions"☆12Jan 26, 2024Updated 2 years ago
- hustpa ics2019☆10Jul 11, 2022Updated 3 years ago
- [Paper][AAAI2023] Analogical Inference Enhanced Knowledge Graph Embedding☆13Jan 19, 2023Updated 3 years ago
- Ollivier-Ricci Curvature for Hypergraphs: A Unified Framework (ICLR 2023)☆18Jun 14, 2023Updated 2 years ago
- [ICML 2024] Probabilistic Conceptual Explainers (PACE): Trustworthy Conceptual Explanations for Vision Foundation Models☆18Sep 25, 2025Updated 5 months ago
- ☆14Oct 12, 2024Updated last year
- ☆14Mar 26, 2024Updated last year
- [NeurIPS2024] CURE4Rec: A Benchmark for Recommendation Unlearning with Deeper Influence”☆20Jun 14, 2024Updated last year
- ☆23Apr 16, 2024Updated last year
- [ICML 2023] Taxonomy-Structured Domain Adaptation☆12Oct 6, 2023Updated 2 years ago
- ☆13Nov 12, 2021Updated 4 years ago
- GPT lanuage model for dna sequence☆17Nov 20, 2024Updated last year
- 华中科技大学大学CS课程其它报告存档库。组成原理、计算机网络、汇编语言、数据库、操作系统、课程设计以及大数据处理。☆11Jan 16, 2024Updated 2 years ago