brandon-snider/cs336-a1

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/brandon-snider/cs336-a1)

brandon-snider / cs336-a1

Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch

☆34

Alternatives and similar repositories for cs336-a1

Users that are interested in cs336-a1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Brittanywu / Leetcode-Orbit
View on GitHub
☆17Mar 24, 2026Updated 4 months ago
xUhEngwAng / pinyin
View on GitHub
这个仓库包含了我在上人工智能课时完成的拼音输入法作业。
☆11Feb 16, 2022Updated 4 years ago
GoJunHyeong / SpatialBias
View on GitHub
☆10Dec 13, 2022Updated 3 years ago
Justherozen / FlowBench
View on GitHub
[EMNLP 2024] FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents
☆24Jan 6, 2025Updated last year
SEU-ProactiveSecurity-Group / LLM-PD
View on GitHub
Official code for the paper entitled "Toward Intelligent and Secure Cloud: Large Language Model Empowered Proactive Defense"
☆16Apr 10, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jianuo-huang / Domino
View on GitHub
Official implementation of “Domino: Decoupling Causal Modeling from Autoregressive Drafting in Speculative Decoding”.
☆123Updated this week
warlockee / oxRL
View on GitHub
A lightweight post-training framework for LLMs and VLMs. 51 algorithms, 38 verified models. Scales with DeepSpeed, vLLM, and Ray.
☆19Updated this week
sophgo / sophpi-shaolin
View on GitHub
☆16Feb 7, 2023Updated 3 years ago
yiren-jian / LM-SupCon
View on GitHub
[NAACL 2022] Contrastive Learning for Prompt-based Few-shot Language Learners
☆22Jan 26, 2023Updated 3 years ago
SalesforceAIResearch / ActiveVideoPerception
View on GitHub
Official Code for paper "Active Video Perception: Iterative Evidence Seeking for Agentic Long Video Understanding""
☆18Jun 2, 2026Updated last month
cgarciae / dyn_plot
View on GitHub
☆13Jul 12, 2024Updated 2 years ago
autoload / hexo-theme-auto
View on GitHub
A modern stylish theme for Hexo
☆26Mar 6, 2021Updated 5 years ago
wade3han / normlens
View on GitHub
An official codebase for "NormLens: Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Comm…
☆10May 9, 2024Updated 2 years ago
csitfun / ConTRoL-dataset
View on GitHub
Dataset for AAAI paper "Natural Language Inference in Context - Investigating Contextual Reasoning over Long Texts"
☆11Nov 18, 2022Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
rhodesvic / ComputerNetwork-ATopDownApproach
View on GitHub
Computer Network : A Top-Down Approach 8th Resource and Homework
☆15Apr 23, 2021Updated 5 years ago
kernelmachine / demix-data
View on GitHub
Benchmark API for Multidomain Language Modeling
☆25Aug 26, 2022Updated 3 years ago
Andy-Cheng / TEMPURA
View on GitHub
TEMPURA enables video-language models to reason about causal event relationships and generate fine-grained, timestamped descriptions of u…
☆27Jun 4, 2025Updated last year
xy-guo / mmdetection_kitti
View on GitHub
2D detection on KITTI dataset. see configs/kitti
☆15Jul 7, 2021Updated 5 years ago
krafton-ai / mini-batch-cl
View on GitHub
☆11Aug 21, 2023Updated 2 years ago
dwlmt / Story-Untangling
View on GitHub
Story understanding and plot analysis pilot.
☆10Dec 27, 2022Updated 3 years ago
yu-gi-oh-leilei / IDA_2023ICLR
View on GitHub
The reproduction of the paper "Robust Attention for Contextual Biased Visual Recognition" ICLR2023.
☆12Feb 23, 2024Updated 2 years ago
Xiang-Deng-DL / CID
View on GitHub
The implementation for "Comprehensive Knowledge Distillation with Causal Intervention".
☆15Mar 12, 2022Updated 4 years ago
duan602728596 / bilibiliLiveCatch
View on GitHub
B站直播流抓取、视频快速剪切工具。
☆21Mar 3, 2018Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
vhrehfdl / bentoml_tutorial
View on GitHub
☆11May 12, 2022Updated 4 years ago
hhaAndroid / mmdetection
View on GitHub
OpenMMLab Detection Toolbox and Benchmark
☆27Mar 4, 2024Updated 2 years ago
Edgis / OneSug
View on GitHub
OneSug
☆29Nov 13, 2025Updated 8 months ago
Data-Intelligence-Lab / DEFT-korean-alpaca
View on GitHub
☆23Oct 30, 2023Updated 2 years ago
juzhengz / logit-fusion
View on GitHub
Learning from Mixed Rollouts: Logit Fusion as a Bridge Between Imitation and Exploration
☆17Feb 24, 2026Updated 5 months ago
farewellthree / Causal-Context-Debiasing
View on GitHub
CCD： Official PyTorch implementation of the paper "Contextual Debiasing for Visual Recognition with Causal Mechanisms"
☆17Jan 26, 2023Updated 3 years ago
kirubarajan / narrative_chains
View on GitHub
An implementation of (Chambers and Jurafsky, 2008), using updated machine learning models, and different training data domains for an ind…
☆14Dec 8, 2022Updated 3 years ago
clankur / einygpt
View on GitHub
a transformer implemented primarily using einops and trained on the tinystories dataset
☆14Jun 21, 2024Updated 2 years ago
kirubarajan / roft
View on GitHub
Real or Fake Text? Evaluation criteria for human-written and computer-generated text through the gamification of annotation. Published in…
☆12Dec 22, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
FreqEdit / FreqEdit
View on GitHub
[CVPR2026] Official implementation of "FreqEdit: Preserving High-Frequency Features for Robust Multi-Turn Image Editing"
☆15Mar 31, 2026Updated 3 months ago
cisnlp / ofa
View on GitHub
[NAACL 2024] A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining
☆18Nov 26, 2023Updated 2 years ago
TsinghuaC3I / Intuitive-Fine-Tuning
View on GitHub
[ACL 2025, Main Conference, Oral] Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process
☆30Aug 2, 2024Updated last year
PlusLabNLP / Narrative-Discourse
View on GitHub
☆16Nov 5, 2024Updated last year
vision-x-nyu / vstat
View on GitHub
Evaluation code for "Benchmarking Visual State Tracking in Multimodal Video Understanding"
☆36Jun 3, 2026Updated last month
sail-sg / odc
View on GitHub
On demand communication
☆34Apr 16, 2026Updated 3 months ago
fudan-generative-vision / MixFlow
View on GitHub
[CVPR 2026] MixFlow Training: Alleviating Exposure Bias with Slowed Interpolation Mixture
☆22Dec 23, 2025Updated 7 months ago